Skip to main content

Source-Aware Context Network for Single-Channel Multi-speaker Speech Separation

Li, Zengxi, Song, Yan, Dai, Li-Rong, McLoughlin, Ian Vince (2018) Source-Aware Context Network for Single-Channel Multi-speaker Speech Separation. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). . pp. 681-685. IEEE ISBN 978-1-5386-4659-5. E-ISBN 978-1-5386-4658-8. (doi:10.1109/ICASSP.2018.8461578) (KAR id:67161)

PDF Author's Accepted Manuscript
Language: English
Download (213kB) Preview
[img]
Preview
Official URL
https://doi.org/10.1109/ICASSP.2018.8461578

Abstract

Deep learning based approaches have achieved promising performance in speaker-dependent single-channel multi-speaker speech separation.However, partly due to the label permutation problem, they may encounter difficulties in speaker-independent conditions. Recent methods address this problem by some assignment operations. Different from them, we propose a novel source-aware context network, which explicitly inputs speech sources as well as mixture signal. By exploiting the temporal dependency and continuity of the same source signal, the permutation order of outputs can be easily determined without any additional post-processing. Furthermore, a Multi-time-step Prediction Training strategy is proposed to address the mismatch between training and inference stages. Experimental results on benchmark WSJ0-2mix dataset revealed that our network achieved comparable or better results than state-of-the-art methods in both closed-set and open-set conditions, in terms of Signal-to-Distortion Ratio (SDR) improvement.

Item Type: Conference or workshop item (Proceeding)
DOI/Identification number: 10.1109/ICASSP.2018.8461578
Subjects: T Technology
Divisions: Faculties > Sciences > School of Computing
Faculties > Sciences > School of Computing > Data Science
Depositing User: Ian McLoughlin
Date Deposited: 30 May 2018 11:59 UTC
Last Modified: 29 May 2019 20:35 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/67161 (The current URI for this page, for reference purposes)
McLoughlin, Ian Vince: https://orcid.org/0000-0001-7111-2008
  • Depositors only (login required):

Downloads

Downloads per month over past year