Phan, Huy, Hertel, Lars, Maass, Marco, Koch, Philipp, Mertins, Alfred (2016) Label Tree Embeddings for Acoustic Scene Classification. In: Proceedings of the 24th ACM international conference on Multimedia. . pp. 486-490. ACM, Amsterdam, The Netherlands ISBN 978-1-4503-3603-1. (doi:10.1145/2964284.2967268) (KAR id:72682)
PDF
Author's Accepted Manuscript
Language: English |
|
Download this file (PDF/200kB) |
Preview |
Request a format suitable for use with assistive technology e.g. a screenreader | |
Official URL: http://dx.doi.org/10.1145/2964284.2967268 |
Abstract
We present in this paper an efficient approach for acoustic scene classification by exploring the structure of class labels. Given a set of class labels, a category taxonomy is automatically learned by collectively optimizing a clustering of the labels into multiple meta-classes in a tree structure. An acoustic scene instance is then embedded into a low-dimensional feature representation which consists of the likelihoods that it belongs to the meta-classes. We demonstrate state-of-the-art results on two different datasets for the acoustic scene classification task, including the DCASE 2013 and LITIS Rouen datasets.
Item Type: | Conference or workshop item (Proceeding) |
---|---|
DOI/Identification number: | 10.1145/2964284.2967268 |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
Depositing User: | Huy Phan |
Date Deposited: | 25 Feb 2019 16:16 UTC |
Last Modified: | 05 Nov 2024 12:35 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/72682 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):