Holden, Nicholas and Freitas, Alex A. (2004) Web page classification with an ant colony algorithm. In: Parallel Problem Solving from Nature - PPSN VIII 8th International Conference. Lecture Notes in Computer Science . Springer, Berlin, Germany, pp. 1092-1102. ISBN 3-540-23092-0. (doi:10.1007/978-3-540-30217-9_110) (KAR id:14076)
PDF
Language: English |
|
Download this file (PDF/97kB) |
Preview |
Request a format suitable for use with assistive technology e.g. a screenreader | |
Official URL: http://dx.doi.org/10.1007/978-3-540-30217-9_110 |
Abstract
This paper utilizes Ant-Miner - the first Ant Colony algorithm for discovering classification rules - in the field of web content mining, and shows that it is more effective than C5.0 in two sets of BBC and Yahoo web pages used in our experiments. It also investigates the benefits and dangers of several linguistics-based text preprocessing techniques to reduce the large numbers of attributes associated with web content mining.
Item Type: | Book section |
---|---|
DOI/Identification number: | 10.1007/978-3-540-30217-9_110 |
Subjects: | Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
Depositing User: | Mark Wheadon |
Date Deposited: | 24 Nov 2008 18:01 UTC |
Last Modified: | 05 Nov 2024 09:48 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/14076 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):