Redundancy, Deduction Schemes, and Minimum-Size Bases for Association Rules

Jose L. Balcazar

doi:10.2168/LMCS-6(2:4)2010

Jose L. Balcazar - Redundancy, Deduction Schemes, and Minimum-Size Bases for Association Rules

lmcs:812 - Logical Methods in Computer Science, June 27, 2010, Volume 6, Issue 2 - https://doi.org/10.2168/LMCS-6(2:4)2010

Redundancy, Deduction Schemes, and Minimum-Size Bases for Association RulesArticle

Authors: Jose L. Balcazar

Association rules are among the most widely employed data analysis methods in the field of Data Mining. An association rule is a form of partial implication between two sets of binary variables. In the most common approach, association rules are parameterized by a lower bound on their confidence, which is the empirical conditional probability of their consequent given the antecedent, and/or by some other parameter bounds such as "support" or deviation from independence. We study here notions of redundancy among association rules from a fundamental perspective. We see each transaction in a dataset as an interpretation (or model) in the propositional logic sense, and consider existing notions of redundancy, that is, of logical entailment, among association rules, of the form "any dataset in which this first rule holds must obey also that second rule, therefore the second is redundant". We discuss several existing alternative definitions of redundancy between association rules and provide new characterizations and relationships among them. We show that the main alternatives we discuss correspond actually to just two variants, which differ in the treatment of full-confidence implications. For each of these two notions of redundancy, we provide a sound and complete deduction calculus, and we show how to construct complete bases (that is, axiomatizations) of absolutely minimum size in terms of the number of rules. We explore finally an approach to redundancy with respect to several association rules, and fully characterize its simplest case of two partial premises.

Comment: LMCS accepted paper

https://doi.org/10.2168/LMCS-6(2:4)2010

Source: arXiv.org:1002.4286

Volume: Volume 6, Issue 2

Published on: June 27, 2010

Imported on: January 13, 2009

Keywords: Computer Science - Logic in Computer Science, Computer Science - Artificial Intelligence, I.2.3, H.2.8, I.2.4, G.2.3, F.4.1

Licence: Attribution 3.0 Unported (CC BY 3.0)

Classifications

Mathematics Subject Classification 2020¹

Sources:

[1] zbMATH Open.

Bibliographic References

17 Documents citing this article

Andrés Martínez;Manuel J. Cuesta;Victor Peralta, 2021, Dependence Graphs Based on Association Rules to Explore Delusional Experiences, Multivariate Behavioral Research, 57, 2-3, pp. 458-477, 10.1080/00273171.2020.1870912.

Cristina Tîrnăucă;José L. Balcázar;Domingo Gómez-Pérez, 2020, Closed-Set-Based Discovery of Representative Association Rules, International Journal of Foundations of Computer Science, 31, 01, pp. 143-156, 10.1142/s0129054120400109.

Albert Atserias;José L. Balcázar;Marie Ely Piceno, 2019, Relative Entailment Among Probabilistic Implications, Logical Methods in Computer Science, Volume 15, Issue 1, 10.23638/lmcs-15(1:10)2019, https://doi.org/10.23638/lmcs-15(1:10)2019.

Sophie Tourret;Andrew Cropper, 2019, SLD-Resolution Reduction of Second-Order Horn Fragments, pp. 259-276, 10.1007/978-3-030-19570-0_17, https://hal.science/hal-02988015.

Wilhelmiina Hämäläinen;Geoffrey I. Webb, 2018, A tutorial on statistically sound pattern discovery, Data Mining and Knowledge Discovery, 33, 2, pp. 325-377, 10.1007/s10618-018-0590-x, https://doi.org/10.1007/s10618-018-0590-x.

Carlos Molina;Belén Prados-Suárez;Antonio Cortes-Romero, 2017, Bankruptcy Scenario Query: B-SQ, Lecture notes in computer science, pp. 295-306, 10.1007/978-3-319-67582-4_21.

Carlos Molina;Belen Prados-Suárez;Daniel Sanchez, 2016, Scenario Query Based on Association Rules (SQAR), Communications in computer and information science, pp. 537-548, 10.1007/978-3-319-40596-4_45.

Marcel Wild, 2016, The joy of implications, aka pure Horn formulas: Mainly a survey, arXiv (Cornell University), 658, pp. 264-292, 10.1016/j.tcs.2016.03.018, http://arxiv.org/abs/1411.6432.

Albert Atserias;Jose L. Balcazar, 2015, Entailment among Probabilistic Implications, QRU Quaderns de Recerca en Urbanisme, 29, pp. 621-632, 10.1109/lics.2015.63, http://hdl.handle.net/2117/79017.

José L. Balcázar, 2015, Quantitative Redundancy in Partial Implications, QRU Quaderns de Recerca en Urbanisme, pp. 3-20, 10.1007/978-3-319-19545-2_1, https://hdl.handle.net/2117/83945.

Marti Zamora;Manel Baradad;Ester Amado;Silvia Cordomi;Esther Limon;et al., 2015, Characterizing chronic disease and polymedication prescription patterns from electronic health records, QRU Quaderns de Recerca en Urbanisme, pp. 1-9, 10.1109/dsaa.2015.7344870, https://hdl.handle.net/2117/82778.

Gonzalo A. Aranda-Corral;Joaquín Borrego-Díaz;Juan Galán-Páez, 2014, Simulating Language Dynamics by Means of Concept Reasoning, Lecture notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, pp. 296-311, 10.1007/978-3-319-06944-9_21.

Gonzalo A. Aranda-Corral;Joaquín Borrego-Díaz;Juan Galán-Páez, 2013, Complex concept lattices for simulating human prediction in sport, Journal of Systems Science and Complexity, 26, 1, pp. 117-136, 10.1007/s11424-013-2288-x.

Gonzalo A. Aranda‐Corral;Joaquín Borrego‐Díaz;Juan Galán‐Páez, 2013, On the Phenomenological Reconstruction of Complex Systems—The Scale‐Free Conceptualization Hypothesis, Systems Research and Behavioral Science, 30, 6, pp. 716-734, 10.1002/sres.2240.

José L. Balcázar, 2013, Formal and computational properties of the confidence boost of association rules, ACM Transactions on Knowledge Discovery from Data, 7, 4, pp. 1-41, 10.1145/2541268.2541272.

Gonzalo A. Aranda-Corral;Joaquin Borrego-Diaz;Juan Gal´n-P´ez, 2011, Bounded Rationality for Data Reasoning Based on Formal Concept Analysis, 2011 22nd International Workshop on Database and Expert Systems Applications, 62, pp. 350-354, 10.1109/dexa.2011.18.

Gonzalo A. Aranda-Corral;Joaquín Borrego Díaz;Juan Galán Páez, 2011, Confidence-Based Reasoning with Local Temporal Formal Contexts, Lecture notes in computer science, pp. 461-468, 10.1007/978-3-642-21498-1_58.

Sources : OpenCitations, OpenAlex & Crossref

Share and export

Consultation statistics

This page has been seen 3061 times.

This article's PDF has been downloaded 788 times.