Howard, Sam, Silla Jr, Carlos N., Johnson, Colin G. (2011) Automatic Lyrics-based Music Genre Classification in a Multilingual Setting. In: Proceedings of the Thirteenth Brazilian Symposium on Computer Music. . (KAR id:33266)
PDF
Language: English |
|
Download this file (PDF/130kB) |
Preview |
Request a format suitable for use with assistive technology e.g. a screenreader | |
Official URL: http://compmus.ime.usp.br/sbcm/2011/en/index.html |
Abstract
A large amount of research has been undertaken with regard to the classification of lyrics into genres, but most of this work has featured solely English lyrics. This study investigates the implications of classifying a multilingual database and the effectiveness of a number of techniques and algorithms for doing so. Part of this involves the creation of a high-quality dataset for use in this research. This paper finds that there are significant challenges in preprocessing multilingual text, and that traditional techniques like stemming and stop words may actually do more harm than good in such circumstances. It also finds that classes with strong language bias may be more likely to perform better than those with multiple languages.
Item Type: | Conference or workshop item (Paper) |
---|---|
Subjects: | Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, |
Divisions: | Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing |
Depositing User: | Colin Johnson |
Date Deposited: | 21 Feb 2013 18:54 UTC |
Last Modified: | 05 Nov 2024 10:16 UTC |
Resource URI: | https://kar.kent.ac.uk/id/eprint/33266 (The current URI for this page, for reference purposes) |
- Link to SensusAccess
- Export to:
- RefWorks
- EPrints3 XML
- BibTeX
- CSV
- Depositors only (login required):