DCASE-models: a phyton library for computational environmental sound analysis using deep-learning models

  • Authors
  • Zinemanas P, Hounie I, Cancela P, Font F, Rocamora M, Serra X
  • UPF authors
  • ZINEMANAS FRIET, PABLO; SERRA CASALS, FRANCESC XAVIER; FONT CORBERA, FREDERIC;
  • Authors of the book
  • Ono N, Harada N, Kawaguchi Y, Mesaros A, Imoto K, Koizumi Y, Komatsu T (eds.)
  • Book title
  • Proceedings of the Fifth Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2020)
  • Publisher
  • Zenodo
  • Publication year
  • 2020
  • Pages
  • 240-244
  • ISBN
  • 978-4-600-00566-5
  • Abstract
  • This document presents DCASE-models, an open¿source Python library for rapid prototyping of environmental sound analysis systems, with an emphasis on deep¿learning models. Together with a collection of functions for dataset handling, data preparation, feature extraction, and evaluation, it includes a model interface to standardize the interaction of machine learning methods with the other system components. This also provides an abstraction layer that allows the use of different machine learning backends. The package includes Python scripts, Jupyter Notebooks, and a web application, to illustrate its usefulness. The library seeks to alleviate the process of releasing and maintaining the code of new models, improve research reproducibility, and simplify comparison of methods. We expect it to become a valuable resource for the community.
  • Complete citation
  • Zinemanas P, Hounie I, Cancela P, Font F, Rocamora M, Serra X. DCASE-models: a phyton library for computational environmental sound analysis using deep-learning models. In: Ono N, Harada N, Kawaguchi Y, Mesaros A, Imoto K, Koizumi Y, Komatsu T (eds.). Proceedings of the Fifth Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2020). 1 ed. Tokio: Zenodo; 2020. p. 240-244.