A critical review of rule surprisingness measures

Carvalho, Deborah R. and Freitas, Alex A. and Ebecken, Nelson (2003) A critical review of rule surprisingness measures. In: Ebecken, Nelson and Brebbia, C.A. and Zanasi, A., eds. Data Mining. Management Information Systems, IV . WIT Press, pp. 545-556. ISBN 978-1-85312-806-6. (doi:10.2495/DATA030531) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:13867)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided.
Official URL: http://dx.doi.org/10.2495/DATA030531

Abstract

In data mining it is usually desirable that discovered knowledge have some characteristics such as being as accurate as possible, comprehensible and surprising to the user. The vast majority of data mining algorithms produce, as part of their results, information of a statistical nature that allows the user to assess how accurate and reliable the discovered knowledge is. However, in many cases this is not enough for the user. Even if discovered knowledge is highly accurate from a statistical point of view, it might not be interesting for the user. Few data mining algorithms produce, as part of their results, a measure of the degree of surprisingness of discovered knowledge. However, these measures can be computed in a post-processing phase, as a form of additional evaluation of the quality of discovered knowledge, complementing (rather than replacing) statistical measures of discovered knowledge accuracy. This papers presents a review of four measures of classification-rule surprisingness, discussing their main characteristics, advantages and disadvantages. Hence, the main contribution of this paper is to improve our understanding of these rule surprisingness measures, which is a step towards solving the very difficult problem of selecting the "best" rule surprisingness measure for a given application domain.

Item Type:	Book section
DOI/Identification number:	10.2495/DATA030531
Uncontrolled keywords:	data mining, rule surprisingness, rule induction
Subjects:	Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming,
Divisions:	Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Depositing User:	Mark Wheadon
Date Deposited:	24 Nov 2008 18:00 UTC
Last Modified:	05 Nov 2024 09:47 UTC
Resource URI:	https://kar.kent.ac.uk/id/eprint/13867 (The current URI for this page, for reference purposes)

University of Kent Author Information

Freitas, Alex A..

Creator's ORCID:	https://orcid.org/0000-0001-9825-4700
CReDIT Contributor Roles:

Depositors only (login required):

Altmetric

Total Views

Total unique views of this page since July 2020. For more details click on the image.