6-12 - "Butter Lyrics Over Hominy Grit": Comparing Audio and Psychology-based Text Features in MIR Tasks
Jaehun Kim, Andrew M. Demetriou, Sandy Manolios, M. Stella Tavella, Cynthia C. S. Liem
Keywords: MIR fundamentals and methodology, Lyrics and other textual data, web mining, and natural language , Applications, Music recommendation and playlist generation, Domain knowledge, Machine learning/Artificial intelligence for music, Evaluation, datasets, and reproducibility, MIR tasks, Automatic classification
Abstract:
Psychology research has shown that song lyrics are a rich source of data, yet they are often overlooked in the field of MIR compared to audio. In this paper, we provide an initial assessment of the usefulness of features drawn from lyrics for various fields, such as MIR and Music Psychology. To do so, we assess the performance of lyric-based text features on 3 MIR tasks, in comparison to audio features. Specifically, we draw sets of text features from the field of Natural Language Processing and Psychology. Further, we estimate their effect on performance while statistically controlling for the effect of audio features, by using a hierarchical regression statistical model. Lyric-based features show a small but statistically significant effect, that anticipates further research. Implications and directions for future studies are discussed.