Holden, N. and Freitas, A.A.
(2004)
Web page classification with an ant colony algorithm.
In: Parallel Problem Solving from Nature - PPSN VIII, LNCS 3242, SEP 18-22, 2004, Univ Birmingham, Sch Comp Sci, Birmingham, ENGLAND, .
Abstract
This paper utilizes Ant-Miner - the first Ant Colony algorithm for discovering classification rules - in the field of web content mining, and shows that it is more effective than C5.0 in two sets of BBC and Yahoo web pages used in our experiments. It also investigates the benefits and dangers of several linguistics-based text preprocessing techniques to reduce the large numbers of attributes associated with web content mining.
- Depositors only (login required):