Hong, Xin, Li, Hailin, Miller, Paul, Zhou, Jianjiang, Li, Ling, Crookes, Danny, Lu, Yonggang, Li, Xuelong, Zhou, Huiyu (2019) Component-based Feature Saliency for Clustering. IEEE Transactions on Knowledge and Data Engineering, . ISSN 1041-4347. E-ISSN 1558-2191. (doi:10.1109/TKDE.2019.2936847) (KAR id:78093)
PDF
Author's Accepted Manuscript
Language: English |
|
Download this file (PDF/2MB) |
Preview |
Request a format suitable for use with assistive technology e.g. a screenreader | |
Official URL: https://doi.org/10.1109/TKDE.2019.2936847 |
Abstract
Simultaneous feature selection and clustering is a major challenge in unsupervised learning. In particular, there has been significant research into saliency measures for features that result in good clustering. However, as datasets become larger and more complex, there is a need to adopt a finer-grained approach to saliency by measuring it in relation to a part of a model. Another issue is learning the feature saliency and advanced model parameters. We address the first by presenting a novel Gaussian mixture model, which explicitly models the dependency of individual mixture components on each feature giving a new component-based feature saliency measure. For the second, we use Markov Chain Monte Carlo sampling to estimate the model and hidden variables. Using a synthetic dataset, we demonstrate the superiority of our approach, in terms of clustering accuracy and model parameter estimation, over an approach using a model-based feature saliency with expectation maximisation. We performed an evaluation of our approach with six synthetic trajectory datasets. To demonstrate the generality of our approach, we applied it to a network traffic flow dataset for intrusion detection. Finally, we performed a comparison with state-of-the-art clustering techniques using three real-world trajectory datasets of vehicle traffic.
Item Type: | Article |
---|---|
DOI/Identification number: | 10.1109/TKDE.2019.2936847 |
Subjects: | Q Science |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
Depositing User: | Caroline Li |
Date Deposited: | 31 Oct 2019 00:55 UTC |
Last Modified: | 05 Nov 2024 12:42 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/78093 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):