Pham, Lam Dang, McLoughlin, Ian Vince, Palaniappan, Ramaswamy, Mertins, Alfred (2020) Deep feature embedding and hierarchical classification for audio scene classification. In: 2020 International Joint Conference on Neural Networks (IJCNN), 19-24 July 2020, Glasgow, UK. (doi:10.1109/IJCNN48605.2020.9206866) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:91413)
The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided. (Contact us about this Publication) | |
Official URL: https://ieeexplore.ieee.org/abstract/document/9206... |
Abstract
In this work, we propose an approach that features deep feature embedding learning and hierarchical classification with triplet loss function for Acoustic Scene Classification (ASC). In the one hand, a deep convolutional neural network is firstly trained to learn a feature embedding from scene audio signals. Via the trained convolutional neural network, the learned embedding embeds an input into the embedding feature space and transforms it into a high-level feature vector for representation. In the other hand, in order to exploit the structure of the scene categories, the original scene classification problem is structured into a hierarchy where similar categories are grouped into meta-categories. Then, hierarchical classification is accomplished using deep neural network classifiers associated with triplet loss function. Our experiments show that the proposed system achieves good performance on both the DCASE 2018 Task 1A and 1B datasets, resulting in accuracy gains of 15.6% and 16.6% absolute over the DCASE 2018 baseline on Task 1A and 1B, respectively.
Item Type: | Conference or workshop item (Proceeding) |
---|---|
DOI/Identification number: | 10.1109/IJCNN48605.2020.9206866 |
Subjects: | Q Science > QA Mathematics (inc Computing science) > QA 75 Electronic computers. Computer science |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
Depositing User: | Palaniappan Ramaswamy |
Date Deposited: | 08 Nov 2021 10:52 UTC |
Last Modified: | 04 Mar 2024 18:08 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/91413 (The current URI for this page, for reference purposes) |
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):