Ambiguity Modelling with Label Distribution Learning for Music Classification

Authors

Buisson M, Alonso-Jiménez P, Bogdanov D

Type

Conference proceedings

Book Authors

Li H, Furui S (eds,)

Book title

2022 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Publisher

IEEE Signal Processing Society

Publication year

2022

Pages

611-615

ISBN

978-1-6654-0540-9

Abstract

An important amount of work has been devoted to the task of music classification. Despite promising results achieved by convolutional neural networks, there still exists a gap left to be filled for such models to perform well in real-world applications. In this work, we address the issue of ambiguity that can arise in many classification problems. We propose a method based on adaptive label smoothing that aims at implicitly modelling perceptual vagueness among classes to improve both training and testing performances. We assess our method using two state-of-the-art CNN architectures for audio classification on a variety of music mood and genre classification tasks. We show that the proposed strategy brings consistent improvements over the traditional approach, significantly improves generalization to external audio collections and emphasizes how crucial information carried by labels can be in an ambiguous music classification context.

Complete citation

Buisson M, Alonso-Jiménez P, Bogdanov D. Ambiguity Modelling with Label Distribution Learning for Music Classification. In: Li H, Furui S (eds,). 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 1 ed. Singapur: IEEE Signal Processing Society; 2022. p. 611-615.

Bibliometric indicators

1 times cited

0 times cited