Slizovskaia O, Kim L, Haro G, Gomez E. End-to-End Sound Source Separation Conditioned On Instrument Labels. arXiv pre-print
We develop a large number of software tools and hosting infrastructures to support the research developed at the Department. We will be detailing in this section the different tools available. You can take a look for the moment at the offer available within the UPF Knowledge Portal, the innovations created in the context of EU projects in the Innovation Radar and the software sections of some of our research groups:
Artificial Intelligence |
Nonlinear Time Series Analysis |
Web Research |
Music Technology |
Interactive Technologies |
Barcelona MedTech |
Natural Language Processing |
Nonlinear Time Series Analysis |
UbicaLab |
Wireless Networking |
Educational Technologies |
Slizovskaia O, Kim L, Haro G, Gomez E. End-to-End Sound Source Separation Conditioned On Instrument Labels. arXiv pre-print
Slizovskaia O, Kim L, Haro G, Gomez E. End-to-End Sound Source Separation Conditioned On Instrument Labels. arXiv pre-print
Can we perform an end-to-end sound source separation (SSS) with a variable number of sources using a deep learning model? This paper presents an extension of the Wave-U-Net model which allows end-to-end monaural source separation with a non-fixed number of sources. Furthermore, we propose multiplicative conditioning with instrument labels at the bottleneck of the Wave-U-Net and show its effect on the separation results. This approach can be further extended to other types of conditioning such as audio-visual SSS and score-informed SSS.
Code and datasets: https://github.com/Veleslavia/vimss