Publications-ppc
BAF: an audio fingerprinting dataset for broadcast monitoring
Authors
Cortès G, Ciurana A, Molina E, Miron M, Meyers O, Six J, Serra X
UPF authors
Type
Conference proceedings
Book Authors
AA. VVV.
Book title
Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022)
Publisher
ISMIR
Publication year
2022
Pages
908-916
ISBN
978-1-7327299-2-6
Abstract
Audio Fingerprinting (AFP) is a well-studied problem in music information retrieval for various use-cases e.g. content-based copy detection, DJ-set monitoring, and music excerpt identification. However, AFP for continuous broadcast monitoring (e.g. for TV & Radio), where music is often in the background, has not received much attention despite its importance to the music industry. In this paper (1) we present BAF, the first public dataset for music monitoring in broadcast. It contains 74 hours of production music from Epidemic Sound and 57 hours of TV audio recordings. Furthermore, BAF provides cross-annotations with exact matching timestamps between Epidemic tracks and TV recordings. Approximately, 80% of the total annotated time is background music. (2) We benchmark BAF with public state-of-the-art AFP systems, together with our proposed baseline PeakFP: a simple, non-scalable AFP algorithm based on spectral peak matching. In this benchmark, none of the algorithms obtain a F1-score above 47%, pointing out that further research is needed to reach the AFP performance levels in other studied use cases. The dataset, baseline, and benchmark framework are open and available for research.
Complete citation
Cortès G, Ciurana A, Molina E, Miron M, Meyers O, Six J, Serra X. BAF: an audio fingerprinting dataset for broadcast monitoring. In: AA. VVV.. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 1 ed. Bengaluru: ISMIR; 2022. p. 908-916.