Skip to main content
Kent Academic Repository

Deep feature embedding and hierarchical classification for audio scene classification

Pham, Lam Dang, McLoughlin, Ian Vince, Palaniappan, Ramaswamy, Mertins, Alfred (2020) Deep feature embedding and hierarchical classification for audio scene classification. In: 2020 International Joint Conference on Neural Networks (IJCNN), 19-24 July 2020, Glasgow, UK. (doi:10.1109/IJCNN48605.2020.9206866) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:91413)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided. (Contact us about this Publication)
Official URL:
https://ieeexplore.ieee.org/abstract/document/9206...

Abstract

In this work, we propose an approach that features deep feature embedding learning and hierarchical classification with triplet loss function for Acoustic Scene Classification (ASC). In the one hand, a deep convolutional neural network is firstly trained to learn a feature embedding from scene audio signals. Via the trained convolutional neural network, the learned embedding embeds an input into the embedding feature space and transforms it into a high-level feature vector for representation. In the other hand, in order to exploit the structure of the scene categories, the original scene classification problem is structured into a hierarchy where similar categories are grouped into meta-categories. Then, hierarchical classification is accomplished using deep neural network classifiers associated with triplet loss function. Our experiments show that the proposed system achieves good performance on both the DCASE 2018 Task 1A and 1B datasets, resulting in accuracy gains of 15.6% and 16.6% absolute over the DCASE 2018 baseline on Task 1A and 1B, respectively.

Item Type: Conference or workshop item (Proceeding)
DOI/Identification number: 10.1109/IJCNN48605.2020.9206866
Subjects: Q Science > QA Mathematics (inc Computing science) > QA 75 Electronic computers. Computer science
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Depositing User: Palaniappan Ramaswamy
Date Deposited: 08 Nov 2021 10:52 UTC
Last Modified: 04 Mar 2024 18:08 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/91413 (The current URI for this page, for reference purposes)

University of Kent Author Information

Pham, Lam Dang.

Creator's ORCID:
CReDIT Contributor Roles:

McLoughlin, Ian Vince.

Creator's ORCID: https://orcid.org/0000-0001-7111-2008
CReDIT Contributor Roles:

Palaniappan, Ramaswamy.

Creator's ORCID: https://orcid.org/0000-0001-5296-8396
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.