Pons J, Serrà J, Serra X. Training neural audio classifiers with few data. arXiv pre-print.

Thesis linked to the implementation of the María de Maeztu Strategic Research Program.

Open access to PhD thesis carried out at the Department can be found at TDX

Please visit these pages for information on our PhD, MSc and BSc programs.

Back Pons J, Serrà J, Serra X. Training neural audio classifiers with few data. arXiv pre-print.

Pons J, Serrà J, Serra X. Training neural audio classifiers with few data. arXiv pre-print.

We investigate supervised learning strategies that improve the training of neural network audio classifiers on small annotated collections. In particular, we study whether (i) a naive regularization of the solution space, (ii) prototypical networks, (iii) transfer learning, or (iv) their combination, can foster deep learning models to better leverage a small amount of training examples. To this end, we evaluate (i-iv) for the tasks of acoustic event recognition and acoustic scene classification, considering from 1 to 100 labeled examples per class. Results indicate that transfer learning is a powerful strategy in such scenarios, but prototypical networks show promising results when one does not count with external or validation data.

Code: https://github.com/jordipons/neural-classifiers-with-few-audio

Entry at the author’s blog: http://www.jordipons.me/arxiv-article-training-neural-audio-classifiers-with-few-data/

Link: https://arxiv.org/abs/1810.10274

DTIC MdM Strategic Program: Artificial and Natural Intelligence for ICT and beyond

Pons J, Serrà J, Serra X. Training neural audio classifiers with few data. arXiv pre-print.

Related Assets