Skip to main content
Kent Academic Repository

Bag-of-features models based on C-DNN network for acoustic scene classification

Pham, Lam Dang, McLoughlin, Ian Vince, Palaniappan, Ramaswamy, Lang, Yue (2019) Bag-of-features models based on C-DNN network for acoustic scene classification. In: 2019 AES International Conference on Audio Forensics (June 2019). . (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) (KAR id:91420)

The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided. (Contact us about this Publication)
Official URL:
https://www.aes.org/e-lib/online/browse.cfm?elib=2...

Abstract

This work proposes bag-of-features deep learning models for acoustic scene classi?cation (ASC) – identifying recording locations by analyzing background sound. We explore the effect on classi?cation accuracy of various front-end feature extraction techniques, ensembles of audio channels, and patch sizes from three kinds of spectrogram. The back-end process presents a two-stage learning model with a pre-trained CNN (preCNN) and a post-trained DNN (postDNN). Additionally, data augmentation using the mixup technique is investigated for both the pre-trained and post-trained processes, to improve classi?cation accuracy through increasing class boundary training conditions. Our experiments on the 2018 Challenge on Detection and Classi?cation of Acoustic Scenes and Events - Acoustic Scene Classi?cation (DCASE2018-ASC) subtask 1A and 1B signi?cantly outperform the DCASE2018 reference implementation and approach state-of-the-art performance for each task. Results reveal that the ensemble of multi-spectrogram features and data augmentation is bene?cial to performance.

Item Type: Conference or workshop item (Proceeding)
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Depositing User: Palaniappan Ramaswamy
Date Deposited: 08 Nov 2021 11:24 UTC
Last Modified: 08 Nov 2021 11:24 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/91420 (The current URI for this page, for reference purposes)

University of Kent Author Information

Pham, Lam Dang.

Creator's ORCID:
CReDIT Contributor Roles:

McLoughlin, Ian Vince.

Creator's ORCID: https://orcid.org/0000-0001-7111-2008
CReDIT Contributor Roles:

Palaniappan, Ramaswamy.

Creator's ORCID: https://orcid.org/0000-0001-5296-8396
CReDIT Contributor Roles:
  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.