Skip to main content
Kent Academic Repository

Two methods for constructing a gene ontology-based feature network for a Bayesian network classifier and applications to datasets of aging-related genes.

Wan, Cen and Freitas, Alex A. (2015) Two methods for constructing a gene ontology-based feature network for a Bayesian network classifier and applications to datasets of aging-related genes. In: Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics. BCB Bioinformatics, Computational Biology and Biomedicine . ACM, New York, USA, pp. 27-36. ISBN 978-1-4503-3853-0. (doi:10.1145/2808719.2808722) (KAR id:50807)

Abstract

In the context of the classification task of data mining or machine learning, hierarchical feature selection methods exploit hierarchical relationships among features in order to select a subset of features without hierarchical redundancy. Hierarchical feature selection is a new research area in classification research, since nearly all feature selection methods ignore hierarchical relationships among features. This paper proposes two methods for constructing a network of features to be used by a Bayesian Network Augmented Naïve Bayes (BAN) classifier, in datasets of aging-related genes where Gene Ontology (GO) terms are used as hierarchically related predictive features. One of the BAN network construction method relies on a hierarchical feature selection method to detect and remove hierarchical redundancies among features (GO terms); whilst the other BAN network construction method simply uses a conventional, flat feature selection method to select features, without removing the hierarchical redundancies associated with the GO. Both BAN network construction methods may create new edges among nodes (features) in the BAN network that did not exist in the original GO DAG (Directed Acyclic Graph), in order to preserve the generalization-specialization (ancestor-descendant) relationship among selected features. Experiments comparing these two BAN network construction methods, when using two different hierarchical feature selection methods and one at feature selection method, have shown that the best results are obtained by the BAN network construction method using one type of hierarchical feature selection method, i.e., select Hierarchical Information-Preserving features (HIP).

Item Type: Book section
DOI/Identification number: 10.1145/2808719.2808722
Uncontrolled keywords: data mining, machine learning, classification, aging genes, bioinformatics, Bayesian network classifier, gene ontology
Subjects: Q Science > Q Science (General) > Q335 Artificial intelligence
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Depositing User: Alex Freitas
Date Deposited: 07 Oct 2015 15:07 UTC
Last Modified: 05 Nov 2024 10:36 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/50807 (The current URI for this page, for reference purposes)

University of Kent Author Information

Wan, Cen.

Creator's ORCID:
CReDIT Contributor Roles:

Freitas, Alex A..

Creator's ORCID: https://orcid.org/0000-0001-9825-4700
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.