Noda, Edgar and Freitas, Alex A. (2006) Discovering knowledge nuggets with a genetic algorithm. In: Triantaphyllou, Evangelos and Felici, Giovanni, eds. Data Mining and Knowledge Discovery Approaches Based on Rule Induction Techniques. Massive Computing . Springer, New York, New York (USA), pp. 395-432. ISBN 978-0-387-34294-8. (doi:10.1007/0-387-34296-6_12) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:14456)
The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided. | |
Official URL: http://dx.doi.org/10.1007/0-387-34296-6_12 |
Abstract
Measuring the quality of a prediction rule is a difficult task, which can involve several criteria. The majority of the rule induction literature focuses on discovering accurate, comprehensible rules. In this chapter we also take these two criteria into account, but we go beyond them in the sense that we aim at discovering rules that are interesting (surprising) for the user. Hence, the search for rules is guided by a rule-evaluation function that considers both the degree of predictive accuracy and the degree of interestingness of candidate rules. The search is performed by two versions of a genetic algorithm (GA) specifically designed to the discovery of interesting rules - or “knowledge nuggets.” The algorithm addresses the dependence modeling task (sometimes called “generalized rule induction”), where different rules can predict different goal attributes. This task can be regarded as a generalization of the very well known classification task, where all rules predict the same goal attribute. This chapter also compares the results of the two versions of the GA with the results of a simpler, greedy rule induction algorithm to discover interesting rules.
Item Type: | Book section |
---|---|
DOI/Identification number: | 10.1007/0-387-34296-6_12 |
Uncontrolled keywords: | genetic algorithms, data mining, classification rules |
Subjects: | Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
Depositing User: | Mark Wheadon |
Date Deposited: | 24 Nov 2008 18:04 UTC |
Last Modified: | 05 Nov 2024 09:48 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/14456 (The current URI for this page, for reference purposes) |
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):