Back Slizovskaia O, Haro G, Gómez E. Conditioned Source Separation for Music Instrument Performances. arXiv preprint

Slizovskaia O, Haro G, Gómez E. Conditioned Source Separation for Music Instrument Performances

Separating different music instruments playing the same piece is a challenging task since the different audio sources are synchronized and playing in harmony. Moreover, the number of sources may vary for each piece and some of the sources may belong to the same family of instruments, thus sharing timbral characteristics and making the sources more correlated.

This paper proposes a source separation method for multiple musical instruments sounding simultaneously and explores how much additional information apart from the audio stream can lift the quality of source separation. We explore conditioning techniques at different levels of a primary source separation network and utilize two extra modalities of data, namely presence or absence of instruments in the mixture, and the corresponding video stream data.