Software & Datasets

Open Science and Reproducibility are core goals of the MTG, promoting collaborations by making sure that our research results can be used by other researchers and by society at large. Here we highlight some of the software tools and datasets developed as part of our research and that are being maintained by researchers of the MTG. All our open source projects are made available from our github repository and all our open datasets are made available from Zenodo. Please review the terms and conditions of use stated in each software tool and dataset to make sure that they allow your intended use.

Apart from research collaborations, we are also interested in technology transfer, offering commercial licenses for exploiting our technology portfolio in industrial applications. Contact us for any further information.

Software

ESSENTIA:

Software library and AI models for audio and music analysis, description, and synthesis.

ESSENTIA.JS:

JavaScript (JS) library for music/audio signal analysis and processing powered by Essentia.

MTG Toolbox:

Audio & MIDI Applications.

SMS TOOLS:

Sound analysis/synthesis tools for music applications.

DUNYA DESKTOP:

Modular and extensible desktop application to explore the Dunya corpora.

GAIA:

Software library to apply similarity measures and classiﬁcations on the results of audio analysis.

...more software

Corpora

DUNYA:

Music corpora of several non-western music repertoires and related software tools.

FREESOUND:

Collaborative database of Creative Commons Licensed sounds.

ACOUSTICBRAINZ:

Crowdsourced acoustic information of songs available under open licenses.

Datasets platforms

FREESOUND ANNOTATOR:

Platform for the collaborative creation of open audio collections from Freesound.

MIRDATA:

Library of tools for accessing and working with datasets of relevance to the field of Music Information Retrieval (MIR).

SOUNDATA:

Python library for downloading, loading & working with sound datasets.

Datasets

COMPMUSIC datasets:

Collection of datasets related to the Dunya/Compmusic corpora.

Freesound datasets:

Datasets using sounds from Freesound.

AdoVoc Pro:

Monophonic and polyphonic audio files of a set of common Flamenco singing.

Choral Singing Dataset:

Audio recordings of 3 pieces a cappella with their associated MIDI files.

Dagstuhl ChoirSet:

Multitrack dataset of a cappella choral music.

Da-TACOS:

A Dataset for Cover Song Identification and Understanding.

DREANSS:

Annotations of drum events within known music audio recordings datasets.

EEP:

Multimodal recordings of string quartet performances.

FLABASE:

Knowledge Base of flamenco music.

GIANTSTEPS Key:

Key annotations of a music audio collection.

GIANTSTEPS Tempo:

Tempo annotations of a music audio collection.

GOOD-SOUNDS:

Recordings of single notes and scales played by several instruments.

HAYDN QUARTETS:

Scores and harmonic annotations of Haydn's String Quartets Op. 20.

IRMAS:

Musical audio excerpts with annotations of the predominant instruments.

ISMIR04 genre:

ISMIR 2004 Genre Identification task dataset.

JAAH:

Audio-aligned jazz harmony dataset.

KBSF:

Knowledge Base automatically extracted from songfacts.com.

Last.fm Dataset 360k users - Last.fm Dataset 1k users:

<user, artist-mbid, artist-name, total-plays> tuples from Last.fm.

Datasets

MARD:

Text and accompanying metadata of Amazon customer reviews.

MASS:

Multi-track recordings for audio source separation research.

MAST:

Rhythmic pattern reproductions with grades for a subset of performances.

MEDIAEVAL ACOUSTICBRAINZ GENRE:

AcousticBrainz music features and genre/subgenre annotations extracted from AllMusic, Discogs, Lastfm and Tagtraum

MELON PLAYLIST:

148,826 playlists, with 649,091 songs. Genre, tag information plus mel-spectrograms for each song.

MTG-Jamendo:

55,000 full audio tracks with 195 tags from genre, instrument, and mood/theme categories.

MTG-QBH:

Recordings of sung melodies for Query-by-Humming research.

MusAV:

Benchmark dataset of relative arousal/valence annotations for validation of audio models for music emotion recognition.

OpenBMAT:

Open dataset for the tasks of music detection and relative music loudness estimation.

ORCHSET:

Orchestral music excerpts with annotations for melody extraction research.

PHENICX Anechoic:

Denoised recordings and note annotations for Aalto anechoic orchestral database.

PHENICX conduct dataset:

Different Motion Capture (MoCap) recordings of conducting movements.

PHENICX emotion:

Excerpts of the Eroica Symphony by Beethoven plus audio descriptors.

PHENICX Symphonies Recordings:

Multimodal recordings of orchestra performances.

QUARTET:

Multimodal data of string quartet performances.

SAS:

List of artists and biographical information for semantic artist similarity research.

Song Describer Dataset:

A corpus of audio captions for music and language evaluation.

TONAS:

Flamenco a cappella sung melodies with manual transcriptions.

... more datasets