Associative classifier
{{Short description|Machine learning model type}}
An associative classifier (AC) is a kind of supervised learning model that uses association rules to assign a target value. The term associative classification was coined by Bing Liu et al.,{{Cite journal|last1=Liu|first1=Bing|last2=Hsu|first2=Wynne|last3=Ma|first3=Yiming|date=1998|title=Integrating Classification and Association Rule Mining|pages=80––86|citeseerx=10.1.1.48.8380}} in which the authors defined a model made of rules "whose right-hand side are restricted to the classification class attribute".
Model
The model generated by an AC and used to label new records consists of association rules, where the consequent corresponds to the class label. As such, they can also be seen as a list of "if-then" clauses: if the record matches some criteria (expressed in the left side of the rule, also called antecedent), it is then labeled accordingly to the class on the right side of the rule (or consequent).
Most ACs read the list of rules in order, and apply the first matching rule to label the new record.{{Cite journal|last=Thabtah|first=Fadi|date=2007|title=A review of associative classification mining|journal=The Knowledge Engineering Review|volume=22|issue=1|pages=37–65|doi=10.1017/s0269888907001026|s2cid=15986963 |issn=0269-8889|url=http://eprints.hud.ac.uk/id/eprint/269/1/ThabtahReview.pdf}}
Metrics
The rules of an AC inherit some of the metrics of association rules, like the support or the confidence.{{Cite book|title=Recent Advances in Data Mining of Enterprise Data: Algorithms and Applications|last1=Liao|first1=T Warren|last2=Triantaphyllou|first2=Evangelos|s2cid=34599426|date=2008|publisher=WORLD SCIENTIFIC|isbn=9789812779854|series=Series on Computers and Operations Research|doi=10.1142/6689}} Metrics can be used to order or filter the rules in the model{{Cite web|url=http://cgi.csc.liv.ac.uk/~frans/KDD/Software/CBA/cba.html|title=CBA homepage|access-date=2018-10-04}} and to evaluate their quality.
Implementations
The first proposal of a classification model made of association rules was FBM. The approach was popularized by CBA, although other authors had also previously proposed the mining of association rules for classification.{{Cite journal|last1=Ali|first1=Kamal|last2=Manganaris|first2=Stefanos|last3=Srikant|first3=Ramakrishnan|date=1997-08-14|title=Partial classification using association rules|url=http://dl.acm.org/citation.cfm?id=3001392.3001412|publisher=AAAI Press|pages=115–118|series=KDD'97}} Other authors have since then proposed multiple changes to the initial model, like the addition of a redundant rule pruning phase{{Cite book|last1=Wenmin Li|last2=Jiawei Han|last3=Jian Pei|title=Proceedings 2001 IEEE International Conference on Data Mining |chapter=CMAR: Accurate and efficient classification based on multiple class-association rules |pages=369–376|publisher=IEEE Comput. Soc|doi=10.1109/icdm.2001.989541|isbn=978-0769511191|year=2001|citeseerx=10.1.1.13.219|s2cid=2243455}} or the exploitation of Emerging Patterns.{{Citation|last1=Dong|first1=Guozhu|title=CAEP: Classification by Aggregating Emerging Patterns|date=1999|work=Discovery Science|pages=[https://archive.org/details/discoveryscience0000unse/page/30 30–42]|publisher=Springer Berlin Heidelberg|isbn=9783540667131|last2=Zhang|first2=Xiuzhen|last3=Wong|first3=Limsoon|last4=Li|first4=Jinyan|doi=10.1007/3-540-46846-3_4|citeseerx=10.1.1.37.3226|s2cid=6465422 |url-access=registration|url=https://archive.org/details/discoveryscience0000unse/page/30}}
Notable implementations include:
- CMAR{{Cite web|url=https://cgi.csc.liv.ac.uk/~frans/KDD/Software/CMAR/cmar.html|title=CMAR Implementation|website=cgi.csc.liv.ac.uk|access-date=2018-10-04}}
- CPAR{{Citation|last1=Yin|first1=Xiaoxin|title=CPAR: Classification based on Predictive Association Rules|date=2003|work=Proceedings of the 2003 SIAM International Conference on Data Mining|pages=331–335|publisher=Society for Industrial and Applied Mathematics|isbn=9780898715453|last2=Han|first2=Jiawei|doi=10.1137/1.9781611972733.40|citeseerx=10.1.1.12.7268}}{{Cite web|url=https://cgi.csc.liv.ac.uk/~frans/KDD/Software/FOIL_PRM_CPAR/foilPrmCpar.html|title=THE LUCS-KDD IMPLEMENTATIONS OF THE FOIL, PRM AND CPAR ALGORITHMS|website=cgi.csc.liv.ac.uk|access-date=2018-10-04}}
- L³{{Cite journal|last1=Baralis|first1=E.|last2=Chiusano|first2=S.|last3=Garza|first3=P.|date=2008|title=A Lazy Approach to Associative Classification|journal=IEEE Transactions on Knowledge and Data Engineering|volume=20|issue=2|pages=156–171|doi=10.1109/tkde.2007.190677|s2cid=14829459|issn=1041-4347}}{{Cite web|url=http://dbdmg.polito.it/wordpress/research/associative-classification/|title=L3 implementation|website=dbdmg.polito.it|access-date=2018-10-08}}
- CAEP
- GARC{{Cite journal|last1=Chen|first1=Guoqing|last2=Liu|first2=Hongyan|last3=Yu|first3=Lan|last4=Wei|first4=Qiang|last5=Zhang|first5=Xing|date=2006|title=A new approach to classification based on association rule mining|journal=Decision Support Systems|volume=42|issue=2|pages=674–689|doi=10.1016/j.dss.2005.03.005|issn=0167-9236}}
- ADT.{{Cite book|last1=Wang|first1=Ke|last2=Zhou|first2=Senqiang|last3=He|first3=Yu|title=Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining |chapter=Growing decision trees on support-less association rules |date=2000|pages=265–269 |location=New York, New York, USA|publisher=ACM Press|doi=10.1145/347090.347147|isbn=978-1581132335|citeseerx=10.1.1.36.9265|s2cid=8296096}}
References
{{reflist}}