Skip to main content
Kent Academic Repository

Browse by Person (creator, editor, contributor, etc.)

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Item Type | Date | No Grouping
Number of items: 37.

Article

Li, Zheng-xi and Song, Yan and Dai, Li-Rong and McLoughlin, Ian Vince (2019) Listening and grouping: an online autoregressive approach for monaural speech separation. IEEE Transactions On Audio Speech And Language Processing, . ISSN 1558-7916. E-ISSN 2329-9304. (doi:https://doi.org/10.1109/TASLP.2019.2892241) (Access to this publication is currently restricted. You may be able to access a copy if URLs are provided)
[img]

Li, Zheng-xi and Dai, Li-Rong and Song, Yan and McLoughlin, Ian Vince (2018) A Conditional Generative Model for Speech Enhancement. Circuits, Systems, and Signal Processing, . ISSN 0278-081X. E-ISSN 1531-5878. (doi:https://doi.org/10.1007/s00034-018-0798-4) (Full text available)
[img]
Preview

Jin, Ma and Song, Yan and McLoughlin, Ian Vince and Dai, Li-Rong (2017) LID-senones and their statistics for language identification. Ieee Transactions On Audio Speech And Language Processing, 26 (1). pp. 171-183. ISSN 1558-7916. E-ISSN 2329-9304. (doi:https://doi.org/10.1109/TASLP.2017.2766023) (Access to this publication is currently restricted. You may be able to access a copy if URLs are provided)
[img]
Preview
[img]

McLoughlin, Ian Vince and Zhang, Hao-min and Xie, Zhi-Peng and Song, Yan and Xiao, Wei and Phan, Huy (2017) Continuous Robust Sound Event Classification Using Time-Frequency Features and Deep Learning. PLoS ONE, 12 (9). e0182309. ISSN 1932-6203. (doi:https://doi.org/10.1371/journal.pone.0182309) (Full text available)
[img]
Preview

McLoughlin, Ian Vince and Li, Jingjie and Song, Yan and Sharifzadeh, Hamid Reza (2017) Speech reconstruction using a deep partially supervised neural network. IET Healthcare Technology Letters, 4 (4). pp. 129-133. ISSN 2053-3713. E-ISSN 2053-3713. (doi:https://doi.org/10.1049/htl.2016.0103) (Full text available)
[img]
Preview

Xie, Zhi-Peng and McLoughlin, Ian Vince and Zhang, Hao-min and Song, Yan and Xiao, Wei (2016) A new variance-based approach for discriminative feature extraction in machine hearing classification using spectrogram features. Digital Signal Processing, . ISSN 1051-2004. (doi:https://doi.org/10.1016/j.dsp.2016.04.005) (Access to this publication is currently restricted. You may be able to access a copy if URLs are provided)
[img]
Preview
[img]

Xu, Yan and McLoughlin, Ian Vince and Song, Yan and Wu, Kui (2015) Improved i-vector representation for speaker diarization. Circuits, Systems, and Signal Processing, . pp. 1-12. ISSN 0278-081X. E-ISSN 1531-5878. (doi:https://doi.org/10.1007/s00034-015-0206-2) (Full text available)
[img]
Preview

McLoughlin, Ian Vince and Sharifzadeh, Hamid Reza and Tan, Su Lim and Li, Jingjie and Song, Yan (2015) Reconstruction of Phonated Speech from Whispers Using Formant-Derived Plausible Pitch Modulation. ACM Transactions on Accessible Computing, 6 (4). pp. 1-21. (doi:https://doi.org/10.1145/2737724) (Full text available)
[img]
Preview

McLoughlin, Ian Vince and Zhang, Hao-min and Xie, Zhi-Peng and Song, Yan and Xiao, Wei (2015) Robust Sound Event Classification using Deep Neural Networks. Audio, Speech, and Language Processing, IEEE/ACM Transactions on, 23 (3). pp. 540-552. ISSN 2329-9290. (doi:https://doi.org/10.1109/TASLP.2015.2389618) (Full text available)
[img]
Preview

McLoughlin, Ian Vince and Song, Yan (2014) Mouth State Detection From Low-Frequency Ultrasonic Reflection. Circuits, Systems, and Signal Processing, 34 (4). pp. 1279-1304. ISSN 0278-081X. E-ISSN 1531-5878. (doi:https://doi.org/10.1007/s00034-014-9904-4) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

Jiang, Bing and Song, Yan and Wei, Si and Liu, Jun-Hua and McLoughlin, Ian Vince and Dai, Li-Rong (2014) Deep bottleneck features for spoken language identification. PloS one, 9 (7). e100795-e100795. (doi:https://doi.org/10.1371/journal.pone.0100795) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

Song, Yan and McLoughlin, Ian Vince and Dai, Li-Rong (2014) Local coding based matching kernel method for image classification. PloS one, 9 (8). e103575-e103575. (doi:https://doi.org/10.1371/journal.pone.0103575) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

Jiang, Bing and Song, Yan and Wei, Si and McLoughlin, Ian Vince and Dai, Li-Rong (2014) Task-Aware Deep Bottleneck Features for Spoken Language Identification. Interspeech 2014, . (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

McLoughlin, Ian Vince and Li, Jingjie and Song, Yan (2013) Reconstruction of Continuous Voiced Speech from Whispers. Proc. Interspeech 2013, (August). pp. 1022-1026. (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

Conference or workshop item

Li, Zengxi and Song, Yan and Dai, Li-Rong and McLoughlin, Ian Vince (2018) Source-Aware Context Network for Single-Channel Multi-speaker Speech Separation. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE pp. 681-685. ISBN 978-1-5386-4659-5. E-ISBN 978-1-5386-4658-8. (doi:https://doi.org/10.1109/ICASSP.2018.8461578) (Full text available)
[img]
Preview

Tang, Jian and Song, Yan and Dai, Li-Rong and McLoughlin, Ian Vince (2018) Acoustic Modeling with Densely Connected Residual Network for Multichannel Speech Recognition. In: ISCA Conference. (doi:https://doi.org/10.21437/Interspeech.2018-1089) (Full text available)
[img]
Preview

Li, Pengcheng and Song, Yan and McLoughlin, Ian Vince and Guo, Wu and Dai, Li-Rong (2018) An Attention Pooling based Representation Learning Method for Speech Emotion Recognition. In: ISCA Conference. International Speech Communication Association (doi:https://doi.org/10.21437/Interspeech.2018-1242) (Full text available)
[img]
Preview

McLoughlin, Ian Vince and Song, Yan and Pham, Lam Dang and Palaniappan, Ramaswamy and Phan, Huy and Lang, Yue (2018) Early detection of continuous and partial audio events using CNN. In: Proceedings of Interspeech. International Speech Communication Association pp. 3314-3318. (doi:https://doi.org/10.21437/Interspeech.2018-1821) (Full text available)
[img]
Preview

Gao, Zhifu and Song, Yan and McLoughlin, Ian Vince and Guo, Wu and Dai, Li-Rong (2018) An Improved Deep Embedding Learning Method for Short Duration Speaker Verification. In: ISCA Conference. International Speech Communication Association (doi:https://doi.org/10.21437/Interspeech.2018-1515) (Full text available)
[img]
Preview

Song, Yan and Wang, Peiseng and Hong, Xinhai and McLoughlin, Ian Vince (2018) Fisher Vector based CNN architecture for Image Classification. In: 2017 IEEE International Conference on Image Processing. IEEE ISBN 978-1-5090-2175-8. (doi:https://doi.org/10.1109/ICIP.2017.8296344) (Access to this publication is currently restricted. You may be able to access a copy if URLs are provided)
[img]

Jin, Ma and Song, Yan and McLoughlin, Ian Vince and Guo, Wu and Dai, Li-Rong (2017) End-to-End Language Identification Using High-Order Utterance Representation with Bilinear Pooling. In: The proceedings of Interspeech 2017. International Speech Communication Society pp. 2571-2575. (doi:https://doi.org/10.21437/Interspeech.2017-44) (Full text available)
[img]
Preview

Jin, Ma and Song, Yan and McLoughlin, Ian Vince (2017) End-to-end DNN-CNN Classification for Language Identification. In: Proceedings of The World Congress on Engineering 2017. IAENG pp. 119-203. ISBN 978-988-14-0474-9. (Full text available)
[img]
Preview

Song, Yan and Hong, Xinhai and McLoughlin, Ian Vince and Dai, Li-Rong (2017) Image Classification with CNN-based Fisher Vector Coding. In: Proceedings of the IEEE International Conference on Visual Communications and Image Processing 2016. IEEE ISBN 978-1-5090-5317-9. E-ISBN 978-1-5090-5316-2. (doi:https://doi.org/10.1109/VCIP.2016.7805494) (Full text available)
[img]
Preview

Zhang, Hao-min and McLoughlin, Ian Vince and Song, Yan (2016) Robust Sound Event Detection in Continuous Audio Environments. In: Understanding speech processing in humans and machines : 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016). Red Hook, NY Curran Associates ISBN 978-1-5108-3313-5. (doi:https://doi.org/10.21437/Interspeech.2016-392) (Full text available)
[img]
Preview

Song, Yan and Cui, Ruilian and Hong, Xinhai and McLoughlin, Ian Vince and Shi, Jiong and Dai, Lirong (2016) Improved language identification using deep bottleneck network. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, South Brisbane, QLD pp. 4200-4204. (doi:https://doi.org/10.1109/ICASSP.2015.7178762) (Full text available)
[img]
Preview

Song, Yan and Cui, Ruilian and McLoughlin, Ian Vince and Dai, Li-Rong (2016) Improvements on Deep Bottleneck Network based I-Vector Representation for Spoken Language Identification. In: Odyssey 2016: The Speaker and Language Recognition Workshop. pp. 140-145. (Full text available)
[img]
Preview

Ma, Jin and Song, Yan and McLoughlin, Ian Vince and Dai, Li-Rong and Ye, Zhong-Fu (2016) LID-senone Extraction via Deep Neural Networks for End-to-End Language Identification. In: Odyssey 2016: The Speaker and Language Recognition Workshop. pp. 210-216. (Full text available)
[img]
Preview

Li, Zheng-xi and Song, Yan and McLoughlin, Ian Vince and Dai, Li-Rong (2016) Compact Convolutional Neural Network Transfer Learning For Small-Scale Image Classification. In: Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on. Proceedings 2016 International Conference on Acoustics, Speech and Signal Processing. IEEE E-ISBN 978-1-4799-9988-0. (doi:https://doi.org/10.1109/ICASSP.2016.7472175) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

Zhang, Haomin and McLoughlin, Ian Vince and Song, Yan (2016) Robust sound event recognition using convolutional neural networks. In: IEEE International Conference on Acoustics Speech and Signal Processing. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers, South Brisbane, QLD pp. 559-563. (doi:https://doi.org/10.1109/ICASSP.2015.7178031) (Full text available)
[img]
Preview

Song, Yan and Hong, Xinhai and Jiang, Bing and Cui, Ruilian and McLoughlin, Ian Vince and Dai, Lirong (2015) Deep Bottleneck Network based i-vector representation for Language Identification. In: Proc. Interspeech 2015. (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

McLoughlin, Ian Vince and Song, Yan (2015) Low Frequency Ultrasonic Voice Activity Detection using Convolutional Neural Networks. In: Proc. Interspeech 2015. (Unpublished) (Full text available)
[img]
Preview

Song, Yan and McLoughlin, Ian Vince and Dai, Li-Rong (2015) Deep Bottleneck Feature for Image Classification. In: ACM International Conference on Multimedia Information Retrieval (ICMR), June 2015, Shanghai. (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

Song, Yan and McLoughlin, Ian Vince and Dai, Lirong (2015) Deep Bottleneck Feature for Image Classification. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval. ACM, New York, NY, USA pp. 491-494. ISBN 978-1-4503-3274-3. (doi:https://doi.org/10.1145/2671188.2749314) (Full text available)
[img]
Preview

Jiang, Bing and Song, Yan and Wei, Si and Wang, Meng-Ge and McLoughlin, Ian Vince and Dai, Li-Rong (2014) Performance evaluation of deep bottleneck features for spoken language identification. In: Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on. Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on. IEEE pp. 143-147. ISBN 978-1-4799-4219-0. (doi:https://doi.org/10.1109/ISCSLP.2014.6936580) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

Li, Jingjie and McLoughlin, Ian Vince and Song, Yan (2014) Reconstruction of pitch for whisper-to-speech conversion of Chinese. In: , The 9th International Symposium on Chinese Spoken Language Processing. IEEE pp. 206-210. ISBN 978-1-4799-4219-0. (doi:https://doi.org/10.1109/ISCSLP.2014.6936709) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

McLoughlin, Ian Vince and Xu, Yan and Song, Yan (2014) Tone confusion in spoken and whispered Mandarin Chinese. In: , The 9th International Symposium on Chinese Spoken Language Processing. 12-14 Sept. 2014. IEEE pp. 313-316. ISBN 978-1-4799-4219-0. (doi:https://doi.org/10.1109/ISCSLP.2014.6936708) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

Song, Yan and Guo, Wu and Dai, Li-Rong and McLoughlin, Ian Vince (2014) A spectral based visual matching method for image classification. In: , 2014 International Conference on Audio, Language and Image Processing. 7-9 July 2014. IEEE pp. 666-670. ISBN 978-1-4799-3903-9. (doi:https://doi.org/10.1109/ICALIP.2014.7009878) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided)

This list was generated on Sun May 26 21:45:24 2019 BST.