Dzhambazov G, Miron M, Serra X. Lyrics to audio alignment for karaoke in pop music. Music Information Retrieval Evaluation eXchange (MIREX 2017)
We develop a large number of software tools and hosting infrastructures to support the research developed at the Department. We will be detailing in this section the different tools available. You can take a look for the moment at the offer available within the UPF Knowledge Portal, the innovations created in the context of EU projects in the Innovation Radar and the software sections of some of our research groups:
Artificial Intelligence |
Nonlinear Time Series Analysis |
Web Research |
Music Technology |
Interactive Technologies |
Barcelona MedTech |
Natural Language Processing |
Nonlinear Time Series Analysis |
UbicaLab |
Wireless Networking |
Educational Technologies |
Dzhambazov G, Miron M, Serra X. Lyrics to audio alignment for karaoke in pop music. Music Information Retrieval Evaluation eXchange (MIREX 2017)
Dzhambazov G, Miron M, Serra X. Lyrics to audio alignment for karaoke in pop music. Music Information Retrieval Evaluation eXchange (MIREX 2017)
In this paper we describe an algorithm for automatic lyricsto-audio alignment. It has as a goal the automatic detection of word boundaries in multi-instrumental English pop songs. We rely on a phonetic recognizer based on hidden Markov models: a widely-used method for tracking phonemes in speech processing problems. Tracking lyrics in music audio is harder than tracking text in speech because, unlike speech, the singing voice is mixed with multiple instruments. To address this obstacle we apply a convolution neural networks-based method for singing voice separation. We present a prototype of a practical application based on the alignment method - the highliting of lyrics in a karaoke-like fashion
Additional material:
- Software https://github.com/georgid