Discovering knowledge nuggets with a genetic algorithm

Noda, Edgar and Freitas, Alex A. (2006) Discovering knowledge nuggets with a genetic algorithm. In: Triantaphyllou, Evangelos and Felici, Giovanni, eds. Data Mining and Knowledge Discovery Approaches Based on Rule Induction Techniques. Massive Computing . Springer, New York, New York (USA), pp. 395-432. ISBN 978-0-387-34294-8. (doi:10.1007/0-387-34296-6_12) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:14456)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided.
Official URL: http://dx.doi.org/10.1007/0-387-34296-6_12
Additional URLs: http://www.cs.kent.ac.uk/people/staff/aa...

Abstract

Measuring the quality of a prediction rule is a difficult task, which can involve several criteria. The majority of the rule induction literature focuses on discovering accurate, comprehensible rules. In this chapter we also take these two criteria into account, but we go beyond them in the sense that we aim at discovering rules that are interesting (surprising) for the user. Hence, the search for rules is guided by a rule-evaluation function that considers both the degree of predictive accuracy and the degree of interestingness of candidate rules. The search is performed by two versions of a genetic algorithm (GA) specifically designed to the discovery of interesting rules - or “knowledge nuggets.” The algorithm addresses the dependence modeling task (sometimes called “generalized rule induction”), where different rules can predict different goal attributes. This task can be regarded as a generalization of the very well known classification task, where all rules predict the same goal attribute. This chapter also compares the results of the two versions of the GA with the results of a simpler, greedy rule induction algorithm to discover interesting rules.

Item Type:	Book section
DOI/Identification number:	10.1007/0-387-34296-6_12
Uncontrolled keywords:	genetic algorithms, data mining, classification rules
Subjects:	Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming,
Institutional Unit:	Schools > School of Computing
Former Institutional Unit:	Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Depositing User:	Mark Wheadon
Date Deposited:	24 Nov 2008 18:04 UTC
Last Modified:	20 May 2025 10:05 UTC
Resource URI:	https://kar.kent.ac.uk/id/eprint/14456 (The current URI for this page, for reference purposes)

University of Kent Author Information

Freitas, Alex A..

Creator's ORCID:	https://orcid.org/0000-0001-9825-4700
CReDIT Contributor Roles:

Depositors only (login required):

Altmetric

Total Views

Total unique views of this page since July 2020. For more details click on the image.