Carvalho, Deborah R. and Freitas, Alex A. and Ebecken, Nelson (2003) A critical review of rule surprisingness measures. In: Ebecken, Nelson and Brebbia, C.A. and Zanasi, A., eds. Data Mining. Management Information Systems, IV . WIT Press, pp. 545-556. ISBN 978-1-85312-806-6. (doi:10.2495/DATA030531) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:13867)
The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided. | |
Official URL: http://dx.doi.org/10.2495/DATA030531 |
Abstract
In data mining it is usually desirable that discovered knowledge have some characteristics such as being as accurate as possible, comprehensible and surprising to the user. The vast majority of data mining algorithms produce, as part of their results, information of a statistical nature that allows the user to assess how accurate and reliable the discovered knowledge is. However, in many cases this is not enough for the user. Even if discovered knowledge is highly accurate from a statistical point of view, it might not be interesting for the user. Few data mining algorithms produce, as part of their results, a measure of the degree of surprisingness of discovered knowledge. However, these measures can be computed in a post-processing phase, as a form of additional evaluation of the quality of discovered knowledge, complementing (rather than replacing) statistical measures of discovered knowledge accuracy. This papers presents a review of four measures of classification-rule surprisingness, discussing their main characteristics, advantages and disadvantages. Hence, the main contribution of this paper is to improve our understanding of these rule surprisingness measures, which is a step towards solving the very difficult problem of selecting the "best" rule surprisingness measure for a given application domain.
Item Type: | Book section |
---|---|
DOI/Identification number: | 10.2495/DATA030531 |
Uncontrolled keywords: | data mining, rule surprisingness, rule induction |
Subjects: | Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
Depositing User: | Mark Wheadon |
Date Deposited: | 24 Nov 2008 18:00 UTC |
Last Modified: | 05 Nov 2024 09:47 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/13867 (The current URI for this page, for reference purposes) |
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):