Browse by Person (creator, editor, contributor, etc.)
Up a level |
Jump to: Article | Conference or workshop item
Number of items: 10.
Article
Phan, Huy and Hertel, Lars and Maass, Marco and Koch, Philipp and Mazur, Radoslaw and Mertins, Alfred (2017) Improved Audio Scene Classification based on Label-Tree Embeddings and Convolutional Neural Networks. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25 (6). pp. 1278-1290. ISSN 2329-9290. E-ISSN 2329-9304. (doi:https://doi.org/10.1109/TASLP.2017.2690564) (Full text available) |
Phan, Huy and Hertel, Lars and Maass, Marco and Mazur, Radoslaw and Mertins, Alfred (2016) Learning Representations for Nonspeech Audio Events through Their Similarities to Speech Patterns. IEEE/ACM Transactions on Audio, Speech and Language Processing, 24 (4). pp. 807-822. ISSN 2329-9290. E-ISSN 2329-9304. (doi:https://doi.org/10.1109/TASLP.2016.2530401) (Full text available) |
Conference or workshop item
Phan, Huy and Koch, Philipp and Hertel, Lars and Maass, Marco and Mazur, Radoslaw and Mertins, Alfred (2017) CNN-LTE: A Class of 1-X Pooling Convolutional Neural Networks on Label Tree Embeddings for Audio Scene Classification. In: 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings. IEEE, New Orleans, USA pp. 136-140. ISBN 978-1-5090-4117-6. (doi:https://doi.org/10.1109/ICASSP.2017.7952133) (Full text available) |
Phan, Huy and Hertel, Lars and Maass, Marco and Koch, Philipp and Mertins, Alfred (2016) Label Tree Embeddings for Acoustic Scene Classification. In: Proceedings of the 24th ACM international conference on Multimedia. ACM, Amsterdam, The Netherlands pp. 486-490. ISBN 978-1-4503-3603-1. (doi:https://doi.org/10.1145/2964284.2967268) (Full text available) |
Phan, Huy and Hertel, Lars and Maass, Marco and Mertins, Alfred (2016) Robust Audio Event Recognition with 1-Max Pooling Convolutional Neural Networks. In: Proceedings of Interspeech. ISCA, San Francisco, USA pp. 3653-3657. (doi:https://doi.org/10.21437/Interspeech.2016-123) (Full text available) |
Hertel, Lars and Phan, Huy and Mertins, Alfred (2016) Comparing Time and Frequency Domain for Audio Event Recognition Using Deep Learning. In: IEEE International Joint Conference on Neural Networks (IJCNN 2016). IEEE, Vancouver, BC, Canada pp. 3407-3411. ISBN 978-1-5090-0619-9. (doi:https://doi.org/10.1109/IJCNN.2016.7727635) (Full text available) |
Phan, Huy and Maass, Marco and Hertel, Lars and Mazur, Radoslaw and McLoughlin, Ian Vince and Merins, Alfred (2016) Learning Compact Structural Representations For Audio Events Using Regressor Banks. In: 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings. IEEE ISBN 978-1-4799-9988-0. (doi:https://doi.org/10.1109/ICASSP.2016.7471667) (The full text of this publication is not currently available from this repository. You may be able to access a copy if URLs are provided) |
Phan, Huy and Maass, Marco and Hertel, Lars and Mazur, Radoslaw and Mertins, Alfred (2015) A Multi-Channel Fusion Framework for Audio Event Detection. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2015). IEEE, New York, USA pp. 1-5. ISBN 978-1-4799-7450-4. (doi:https://doi.org/10.1109/WASPAA.2015.7336889) (Full text available) |
Phan, Huy and Hertel, Lars and Maass, Marco and Mazur, Radoslaw and Mertins, Alfred (2015) Representing Nonspeech Audio Signals through Speech Classification Models. In: 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015). ISCA, Dresden, Germany pp. 3441-3445. (Full text available) |
Phan, Huy and Hertel, Lars and Maass, Marco and Mazur, Radoslaw and Mertins, Alfred (2015) Audio Phrases for Audio Event Recognition. In: 23rd European Signal Processing Conference (EUSIPCO 2015). IEEE, Nice, France pp. 2546-2550. E-ISBN 978-0-9928626-3-3. (doi:https://doi.org/10.1109/EUSIPCO.2015.7362844) (Full text available) |