To read the full version of this content please select one of the options below:

Directed extended dependency analysis for data mining

Thaddeus T. Shannon (Portland State University, Portland, Oregon, USA)
Martin Zwick (Portland State University, Portland, Oregon, USA)

Kybernetes

ISSN: 0368-492X

Article publication date: 1 June 2004

Abstract

Extended dependency analysis (EDA) is a heuristic search technique for finding significant relationships between nominal variables in large data sets. The directed version of EDA searches for maximally predictive sets of independent variables with respect to a target dependent variable. The original implementation of EDA was an extension of reconstructability analysis. Our new implementation adds a variety of statistical significance tests at each decision point that allow the user to tailor the algorithm to a particular objective. It also utilizes data structures appropriate for the sparse data sets customary in contemporary data mining problems. Two examples that illustrate different approaches to assessing model quality tests are given in this paper.

Keywords

Citation

Shannon, T.T. and Zwick, M. (2004), "Directed extended dependency analysis for data mining", Kybernetes, Vol. 33 No. 5/6, pp. 973-983. https://doi.org/10.1108/03684920410534010

Publisher

:

Emerald Group Publishing Limited

Copyright © 2004, Emerald Group Publishing Limited