Slizovskaia O, Kim L, Haro G, Gomez E. End-to-End Sound Source Separation Conditioned On Instrument Labels. arXiv pre-print

We develop a large number of software tools and hosting infrastructures to support the research developed at the Department. We will be detailing in this section the different tools available. You can take a look for the moment at the offer available within the UPF Knowledge Portal, the innovations created in the context of EU projects in the Innovation Radar and the software sections of some of our research groups:

Artificial Intelligence

Nonlinear Time Series Analysis

Downloads

Web Research

Dyswebxia

Music Technology

Interactive Technologies

Barcelona MedTech

GitHub

Natural Language Processing

GitHub
Resources (datasets, software and other material)

Nonlinear Time Series Analysis

Downloads

UbicaLab

GitHub

Wireless Networking

GitHub

Educational Technologies

GitHub

Back Slizovskaia O, Kim L, Haro G, Gomez E. End-to-End Sound Source Separation Conditioned On Instrument Labels. arXiv pre-print

Slizovskaia O, Kim L, Haro G, Gomez E. End-to-End Sound Source Separation Conditioned On Instrument Labels. arXiv pre-print

Can we perform an end-to-end sound source separation (SSS) with a variable number of sources using a deep learning model? This paper presents an extension of the Wave-U-Net model which allows end-to-end monaural source separation with a non-fixed number of sources. Furthermore, we propose multiplicative conditioning with instrument labels at the bottleneck of the Wave-U-Net and show its effect on the separation results. This approach can be further extended to other types of conditioning such as audio-visual SSS and score-informed SSS.

Code and datasets: https://github.com/Veleslavia/vimss

Link: https://arxiv.org/abs/1811.01850

DTIC MdM Strategic Program: Artificial and Natural Intelligence for ICT and beyond

Slizovskaia O, Kim L, Haro G, Gomez E. End-to-End Sound Source Separation Conditioned On Instrument Labels. arXiv pre-print

Related Assets