Skip to main content
Kent Academic Repository

Hierarchical Information Clustering by Means of Topologically Embedded Graphs

Song, Won-Min, Di Matteo, T., Aste, Tomaso (2012) Hierarchical Information Clustering by Means of Topologically Embedded Graphs. PLoS ONE, 7 (3). e31929. ISSN 1932-6203. (doi:10.1371/journal.pone.0031929) (KAR id:29167)

Abstract

We introduce a graph-theoretic approach to extract clusters and hierarchies in complex data-sets in an unsupervised and deterministic manner, without the use of any prior information. This is achieved by building topologically embedded networks containing the subset of most significant links and analyzing the network structure. For a planar embedding, this method provides both the intra-cluster hierarchy, which describes the way clusters are composed, and the inter-cluster hierarchy which describes how clusters gather together. We discuss performance, robustness and reliability of this method by first investigating several artificial data-sets, finding that it can outperform significantly other established approaches. Then we show that our method can successfully differentiate meaningful clusters and hierarchies in a variety of real data-sets. In particular, we find that the application to gene expression patterns of lymphoma samples uncovers biologically significant groups of genes which play key-roles in diagnosis, prognosis and treatment of some of the most relevant human lymphoid malignancies.

Item Type: Article
DOI/Identification number: 10.1371/journal.pone.0031929
Subjects: Q Science
Divisions: Divisions > Division of Natural Sciences > Physics and Astronomy
Depositing User: Tomaso Aste
Date Deposited: 20 Mar 2012 16:00 UTC
Last Modified: 05 Nov 2024 10:10 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/29167 (The current URI for this page, for reference purposes)

University of Kent Author Information

Aste, Tomaso.

Creator's ORCID:
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.