No results founds

Perez M.; Kirchhoff H.; Grosche P.; Serra X.. Improving Singing Voice Transcription Generalization with AI Generated Accompaniments. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2025; 15521.

Serra X. Studying a Musical Repertoire with Computational Approaches: The Case of Carnatic Music. In: ICASSP, K. V. S. Hari, V. John Mathews, Institute of Electrical and Electronics Engineers. 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing proceedings 06-11 April 2025, Hyderabad, India. IEEE; 2025. p. 1-27.

Ramoneda P.; Jeong D.; Eremenko V.; Tamer N.C.; Miron M.; Serra X.. Combining piano performance dimensions for score difficulty classification. Expert Systems with Applications 2024; 238(PartB): 1-15.

Fuentes M, Plaja-Roglans G, Cortès-Sebastià G, Khandelwal T, Miron M, Serra X, Bello JP, Salamon J. Soundata: Reproducible use of audio datasets. Journal of Open Source Software 2024; 9(98): 1-5.

Morsi A, Zhang H, Maezawa A, Dixon S, Serra X. Simulating piano performance mistakes for music learning. In: Campos G, Salselas I, Vieira J, Conceição M, Fonseca N, Penha R, Ponte A, Andrikopoulos D (eds.). 21st Sound and Music Computing Conference SMC 2024. ESMAE; 2024. p. 1-8.

Batlle-Roca R, Tyman D, Meléndez-Catalán B, Molina E, Serra X, Herrera-Boyer P. The elusive nature of inattentional deafness: assessing the influence of visual attention on background music perception in TV programmes. In: Campos G, Salselas I, Vieira J, Conceição M, Fonseca N, Penha R, Ponte A, Andrikopoulos D (eds.). 21st Sound and Music Computing Conference SMC 2024. ESMAE; 2024. p. 1-7.

Shankar A, Plaja-Roglans G, Nuttall T, Rocamora M, Serra X. Saraga Audiovisual: a large multimodal open data collection for the analysis of carnatic music. In: Kim H, Serra X (eds.). 25th International Society for Music Information Retrieval Conference (ISMIR2024). ISMIR; 2024. p. 61-69.

Ramoneda P, Eremenko VE, D'Hooge A, Parada-Cabaleiro E, Serra X. Towards Explainable and Interpretable Musical Difficulty Estimation: A Parameter-efficient Approach. In: Kim H, Serra X (eds.). 25th International Society for Music Information Retrieval Conference (ISMIR2024). ISMIR; 2024.

Batlle-Roca R, Liao WH, Serra X, Mitsufuji Y, Gómez E. Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio. In: Kim H, Serra X (eds.). 25th International Society for Music Information Retrieval Conference (ISMIR2024). ISMIR; 2024.

Narang J, Tamer NC, de la Vega V, Serra X. Automatic Estimation of Singing Voice Musical Dynamics. In: Kim H, Serra X (eds.). 25th International Society for Music Information Retrieval Conference (ISMIR2024). ISMIR; 2024.

Kim H, Serra X. A method for MIDI velocity estimation for piano performance by a U-Net with attention and FiLM. In: Kim H, Serra X (eds.). 25th International Society for Music Information Retrieval Conference (ISMIR2024). ISMIR; 2024.

Weck B, Kirchhoff H, Grosche P, Serra X. WikiMuTe: A Web-Sourced Dataset of Semantic Descriptions for Music Audio. In: Rudinac S (ed.). International Conference on Multimedia Modeling (MMM 2024). Springer Cham; 2024. p. 42-56.

Nuttall T, Serra X, Pearson L. Svara-forms and coarticulation in carnatic music: an investigation using deep clustering. In: Weigl DM (ed.). Proceedings of the 11th International Conference on Digital Libraries for Musicology (DLfM '24). ACM Association for Computer Machinery; 2024. p. 15-22.

Araz RO, Bogdanov D, Serra X. Discogs-VI: a musical version identification dataset based on public editorial metadata. In: Kaneshiro B, Mysore G, Nieto O, Donahue C, Huang CZA, Lee JH, McFee B, McCallum M [editors]. Proceedings of the 25th International Society for Music Information Retrieval Conference (ISMIR2024); 2024 November 10-14; San Francisco, USA. 2024. p. 478-485.

Ramoneda P, Parada-Cabaleiro E, Weck B, Serra X. The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?. In: Kruspe A, Oramas S, Epure, EV, Sordo M, Weck B, Doh S, Won M, Manco I, Meseguer-Brocal, G (eds.). Proceedings of the 3rd Workshop on NLP for Music and Audio (NLP4MusA). Association for Computational Lingustics; 2024. p. 81-86.

Alonso-Jimenez P, Pepino L, Batlle-Roca R, Zinemanas P, Bogdanov D, Serra X, Rocamora M. Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio. In: -. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing Workshops (ICASSPW). IEEE Xplore; 2024. p. 833-837.

Narang J, De La Vega V, Lizarraga X, Mayor O, Parra H, Janer J, Serra X. ChoralSynth: Synthetic Dataset of Choral Singing. arxiv.org; 2023.

Valero-Mas JJ, Gallego AJ, Alonso-Jimenez P, Serra X. Multilabel Prototype Generation for data reduction in K-Nearest Neighbour classification. Pattern Recognition 2023; 135.

Plaja-Roglans G, Nuttall T, Pearson L, Serra X, Miron M. Repertoire-Specific Vocal Pitch Data Generation for Improved Melodic Analysis of Carnatic Music. Transactions of the International Society for Music Information Retrieval 2023; 6(1): 13-26.

Batlle-Roca R, Herrera-Boyer P, Melendez-Catalan B, Molina E, Serra X. Towards a characterization of background music audibility in broadcasted TV. International Journal of Environmental Research and Public Health 2023; 20(1): 1-15.