Elisabeth Georgii, Lothar Richter, Ulrich Rückert, and Stefan Kramer (2005)
Analyzing Microarray Data Using Quantitative Association Rules
Motivation: We tackle the problem of finding regularities in microarray data. A wide variety of data mining tools such as clustering, classification, Bayesian networks and association rules have been applied so far to gain insight into gene expression data. Association rule mining techniques used so far work on discretizations of the data and cannot account for cumulative effects. In this paper, we investigate the use of quantitative association rules that can operate directly on numeric data and represent cumulative effects of variables. Technically speaking, this type of quantitative association rules based on half-spaces can find non-axis-parallel regularities. Results: We performed a variety of experiments testing the utility of quantitative association rules for microarray data. First of all, the results should be statistically significant and robust against fluctuations in the data. Next, the approach should be scalable in the number of variables, which is important for such high-dimensional data. Finally, the rules should make sense biologically and be sufficiently different from rules found in regular association rule mining working with discretizations. In all of these dimensions, the proposed approach performed satisfactorily. Therefore, quantitative association rules based on half spaces should be considered as a tool for the analysis of microarray gene expression data.