Skip to main content

Audio Phrases for Audio Event Recognition

Phan, Huy, Hertel, Lars, Maass, Marco, Mazur, Radoslaw, Mertins, Alfred (2015) Audio Phrases for Audio Event Recognition. In: 23rd European Signal Processing Conference (EUSIPCO 2015). . pp. 2546-2550. IEEE, Nice, France E-ISBN 978-0-9928626-3-3. (doi:10.1109/EUSIPCO.2015.7362844)

PDF - Author's Accepted Manuscript
Download (162kB) Preview
[img]
Preview
Official URL
https://doi.org/10.1109/EUSIPCO.2015.7362844

Abstract

The bag-of-audio-words approach has been widely used for audio event recognition. In these models, a local feature of an audio signal is matched to a code word according to a learned codebook. The signal is then represented by frequencies of the matched code words on the whole signal. We present in this paper an improved model based on the idea of audio phrases which are sequences of multiple audio words. By using audio phrases, we are able to capture the relationship between the isolated audio words and produce more semantic descriptors. Furthermore, we also propose an efficient approach to learn a compact codebook in a discriminative manner to deal with high-dimensionality of bag-of-audio-phrases representations. Experiments on the Freiburg-106 dataset show that the recognition performance with our proposed bag-of-audio-phrases descriptor outperforms not only the baselines but also the state-of-the-art results on the dataset.

Item Type: Conference or workshop item (Proceeding)
DOI/Identification number: 10.1109/EUSIPCO.2015.7362844
Uncontrolled keywords: audio phrase, bag-of-words, audio event, recognition, human activity
Divisions: Faculties > Sciences > School of Computing
Depositing User: Huy Phan
Date Deposited: 25 Feb 2019 16:42 UTC
Last Modified: 03 Jun 2019 09:28 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/72687 (The current URI for this page, for reference purposes)
Phan, Huy: https://orcid.org/0000-0003-4096-785X
  • Depositors only (login required):

Downloads

Downloads per month over past year