Publications
2026 (3)Ibáñez-Martínez L; Nkama C; Poltronieri A; Serra X; Rocamora M. Evaluating disentangled representations for controllable music generation. In: IEEE. ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2026. p. 15092-15096. |
|
López-Ayala D, Cabello A, Zinemanas P, Molina E, Rocamora M. AI-generated music detection in broadcast monitoring. In: IEEE. ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE; 2026. p. 12342-12346. |
|
Serra X; Araz RO; Batlle-Roca R; Juvela L; López D; Rocamora M. Technical solutions for marking and detecting AI-generated audio content in the context of article 50(2) AI Act: final study report. European Comission; 2026. |
2025 (10)Anzibar Fialho M.; Rocamora M.; Ziegler L.. Detection of anthropogenic noise pollution as a possible chronic stressor in Antarctic Specially Protected Area N°150, Ardley Island. Ecological Informatics 2025; 87(0). |
|
Shankar A; Schweinitz S; Plaja-Roglans G; Serra X; Rocamora M. Disentangling Overlapping Sources: Improving Vocal and Violin Source Separation in Carnatic Music. In: -. 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops. Piscataway: IEEE; 2025. |
|
Acevedo E, Espinosa MN, Stolovas I, Rocamora M, Steinfeld L. Enhancing the recording and analysis of Antarctic soundscapes. In: -. 2025 IEEE Latin Conference on IoT (LCIoT). IEEE; 2025. p. 125-128. |
|
Azziz J, Lema J, Steinfeld L, Acevedo E, Rocamora M. Selective audio recording device for wildlife research using embedded machine learning. In: -. 2025 IEEE Latin Conference on IoT (LCIoT). IEEE; 2025. p. 65-68. |
|
Batlle-Roca R; Ibáñez-Martínez L; Serra X; Gómez E; Rocamora M. MusGO: a community-driven framework for assessing openness in music-generative AI. In: -. 26th International Society for Music Information Retrieval Conference (ISMIR 2025). 2025. |
|
Poltronieri A; Serra X; Rocamora M. From discord to harmony: decomposed consonance-based training for improved audio chord estimation. In: -. 26th International Society for Music Information Retrieval Conference (ISMIR 2025). 2025. |
|
Plaja-Roglans G; Serra X; Rocamora M. Leveraging Carnatic live recordings for singing voice separation using regression-guided latent diffusion. In: -. 26th International Society for Music Information Retrieval Conference (ISMIR 2025). 2025. |
|
Acevedo E, Rocamora M, Fuentes M. Domain Adaptation Method and Modality Gap Impact in Audio-Text Models for Prototypical Sound Classification. In: -. Proceedings of Interspeech 2025. International Speech Communication Association; 2025. p. 1328-1332. |
|
Azziz J, Lema J, Anzibar M, Ziegler L, Steinfeld L, Rocamora M. Assessing a domain-adaptive deployment workflow for selective audio recording in wildlife acoustic monitoring. In: Benetos E, Font F, Fuentes M, Martin Morato I, Rocamora M (eds.). Proceedings of the 10th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2025). DCASE; 2025. p. 200-204. |
|
Mishra A, Prabhu S, Haki B, Rocamora M. Learning Microrhythm in Uruguayan Candombe using Transformers. In: -. Proceedings of the International Computer Music Conference (ICMC). 2025. |
2024 (7)Ozaki Y.; Tierney A.; Pfordresher P.Q.; McBride J.M.; Benetos E.; Proutskova P.; Chiba G.; Liu F.; Jacoby N.; Purdy S.C.; Opondo P.; Tecumseh Fitch W.; Hegde S.; Rocamora M.; Thorne R.; Nweke F.; Sadaphal D.P.; Sadaphal P.M.; Hadavi S.; Fujii S.; Choo S.; Naruse M.; Ehara U.; Sy L.; Parselelo M.L.; Anglada-Tort M.; Chr. Hansen N.; Haiduk F.; Færøvik U.; Magalhães V.; Krzy¿anowski W.; Shcherbakova O.; Hereld D.; Barbosa B.S.; Varella M.A.C.; van Tongeren M.; Dessiatnitchenko P.; Zar S.Z.; Kahla I.E.; Muslu O.; Troy J.; Lomsadze T.; Kurdova D.; Tsope C.; Fredriksson D.; Arabadjiev A.; Sarbah J.P.; Arhine A.; Meachair T.; Silva-Zurita J.; Soto-Silva I.; Millalonco N.E.M.; Ambrazevi¿ius R.; Loui P.; Ravignani A.; Jadoul Y.; Larrouy-Maestri P.; Bruder C.; Teyxokawa T.P.; Kuikuro U.; Natsitsabui R.; Sagarzazu N.B.; Raviv L.; Zeng M.; Varnosfaderani S.D.; Gomez-Cañon J.S.; Kolff K.; der Nederlanden C.V.B.; Chhatwal M.; David R.M.; Putu Gede Setiawan I.; Lekakul G.; Borsan V.N.; Nguqu N.; Savage P.E.. Globally, songs and instrumental melodies are slower and higher and use more stable pitches than speech: A Registered Report. Science Advances 2024; 10(20). |
|
Jacoby N.; Polak R.; Grahn J.A.; Cameron D.J.; Lee K.M.; Godoy R.; Undurraga E.A.; Huanca T.; Thalwitzer T.; Doumbia N.; Goldberg D.; Margulis E.H.; Wong P.C.M.; Jure L.; Rocamora M.; Fujii S.; Savage P.E.; Ajimi J.; Konno R.; Oishi S.; Jakubowski K.; Holzapfel A.; Mungan E.; Kaya E.; Rao P.; Rohit M.A.; Alladi S.; Tarr B.; Anglada-Tort M.; Harrison P.M.C.; McPherson M.J.; Dolan S.; Durango A.; McDermott J.H.. Commonality and variation in mental representations of music revealed by a cross-cultural comparison of rhythm priors in 15 countries. Nature Human Behaviour 2024; 8: 846-877. |
|
Maia, Lucas S.; Rocamora, Martín; Biscainho, Luiz W.P.; Fuentes, Magdalena. Selective annotation of few data for beat tracking of Latin American music using rhythmic features. Transactions of the International Society for Music Information Retrieval 2024; 7(1): 99-112. |
|
Shankar A; Plaja-Roglans G; Nuttall T; Rocamora M; Serra X. Saraga Audiovisual: a large multimodal open data collection for the analysis of carnatic music. In: Kim H, Serra X (eds.). 25th International Society for Music Information Retrieval Conference (ISMIR2024). ISMIR; 2024. p. 61-69. |
|
Ramoneda P, Rocamora M, Akama T. Music Proofreading with RefinPaint: Where and How to Modify Compositions given Context. In: Kim H, Serra X (eds.). 25th International Society for Music Information Retrieval Conference (ISMIR2024). ISMIR; 2024. |
|
Poltronieri A, Presutti V, Rocamora M. ChordSync: Conformer-Based Alignment of Chord Annotations to Music Audio. In: -. Proceedings of the 21st Sound and Music Computing Conference. Porto: -; 2024. p. 561-568. |
|
Alonso-Jiménez P; Pepino L; Batlle-Roca R; Zinemanas P; Bogdanov D; Serra X; Rocamora M. Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio. In: -. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing Workshops (ICASSPW). IEEE Xplore; 2024. p. 833-837. |
2023 (2)Rumbo V, Mordecki E, Rocamora M. Minimum description length for selection of models of musical rhythm. Journal of Mathematics and Music 2023; 17(3): 433-451. |
|
Irigaray I, Rocamora M, Biscainho LWP. Noise reduction in analog tape audio recordings with deep learning models. In: -. AES 2023 International Conference on Audio Archiving, Preservation & Restoration. -; 2023. p. 1-6. |
2022 (3)Jakubowski, Kelly; Polak, Rainer; Rocamora, Martín; Jure, Luis; Jacoby, Nori. Aesthetics of musical timing: culture and expertise affect preferences for isochrony but not synchrony. Cognition 2022; 227(0). |
|
Fuentes M, Steers B, Zinemanas P, Rocamora M, Bondi L, Wilkins J, Shi Q, Hou Y, Das S, Serra X, Bello JP. Urban sound & sight: dataset and benchmark for audio-visual urban scene understanding. In: Meng H, Watanabe S, Qian Y, Xie L, Wu Z, Li J, Li H, Yu D, Su D, Eldar Y, Segarra S, Shlezinger N, Wang L, Yu K, Li W, Wang DL, Li B, Yoshioka T, Zhang Y, Dang J, Ma Z, Wang C, Chen Z, Ogawa T, Chng E. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2022). Singapur: IEEE; 2022. p. 141-145. |
|
Simões Maia L, Rocamora M, Biscainho LWP, Fuentes M. Adapting meter tracking models to Latin American music. In: Rao P, Murthy H, Srinivasamurthy A, Bittner R, Repetto R, Goto M, Serra X, Miron M. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). Bengaluru: ISMIR; 2022. p. 361-368. |
2021 (3)Zinemanas P, Rocamora M, Miron M, Font F, Serra X. An Interpretable Deep Learning Model for Automatic Sound Classification. Electronics 2021; 10(7): 1-23. |
|
Clayton, Martin; Tarsitani, Simone; Jankowsky, Richard; Jure, Luis; Leante, Laura; Polak, Rainer; Poole, Adrian; Rocamora, Martín; Alborno, Paolo; Camurri, Antonio; Eerola, Tuomas; Jacoby, Nori; Jakubowski, Kelly. The interpersonal entrainment in music performance data collection. Empirical Musicology Review 2021; 16(1): 65-84. |
|
Zinemanas P, Rocamora M, Fonseca E, Font F, Serra X. Toward interpretable polyphonic sound event detection with attention maps based on local prototypes. In: Font F, Mesaros AM, PW Ellis D, Fonseca E, Fuentes M, Elizalde B (eds.). Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2021). Barcelona: Music Technology Group, Universitat Pompeu Fabra; 2021. p. 50-54. |
2020 (2)Alimenti Bel, Demian; Rocamora, Martín; Martínez, Isabel Cecilia. Análisis de interpretaciones de tango usando herramientas: el estilo de ejecución de Aníbal Troilo interpretando Mi refugio. Per Musi 2020; 0(40): 1-26. |
|
Zinemanas P, Hounie I, Cancela P, Font F, Rocamora M, Serra X. DCASE-models: A Python library for computational environmental sound analysis using deep-learning models. In: Ono N, Harada N, Kawaguchi Y, Mesaros A, Imoto K, Koizumi Y, Komatsu T (eds.). Proceedings of the Fifth Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2020). Tokio: Zenodo; 2020. p. 1-5. |
2019 (9)Zinemanas P, Cancela P, Rocamora M. End-to-end convolutional neural networks for sound event detection in urban environments. Proceeding of the Conference of FRUCT Association 2019; 2019-April(0): 533-539. |
|
Rocamora M, Cancela P, Biscainho LWP. Information theory concepts applied to the analysis of rhythm in recorded music with recurrent rhythmic patterns. AES: Journal of the Audio Engineering Society 2019; 67(4). |
|
Rocamora M, Jure L. carat A toolbox for computer-aided rhythm analysis. In: -. 1st Analytical Approaches to World Music Special Topics Symposium. 2019. |
|
Fuentes M, Maia LS, Rocamora M, Biscainho LWP, Crayencour HC, Essid S, Bello JP. Tracking beats and microtiming in Afro-latin American music using conditional random fields and deep learning. In: -. 20th Conference of the International Society for Music Information Retrieval (ISMIR 2019). Society for Music Information Retrieval (ISMIR 2019); 2019. p. 251-258. |
|
Maia LS, Fuentes M, Biscainho LWP, Rocamora M, Essid S. Sambaset: A dataset of historical samba de enredo recordings for computational music analysis. In: -. 20th Conference of the International Society for Music Information Retrieval (ISMIR 2019). Society for Music Information Retrieval (ISMIR 2019); 2019. p. 628-635. |
|
Rocamora M., Jure L., Fuentes M., Simões Maia, Biscainho LWP. CARAT Computer-aided Rhythmic Analysis Toolbox. In: -. 20th Conference of the International Society for Music Information Retrieval (ISMIR 2019). Society for Music Information Retrieval (ISMIR 2019); 2019. p. 1-2. |
|
Zinemanas P, Rocamora M, Jure L. Improving Csounds Ambisonics decoders. In: -. ICSC 5th International Csound Conference. 2019. p. 1-6. |
|
Zinemanas P, Cancela P, Rocamora M. End-to-end convolutional neural networks for sound event detection in urban environments. In: -. Proceedings of the 24th Conference of Open Innovations Association FRUCT. 2019. p. 533-539. |
|
Zinemanas P, Cancela P, Rocamora M. MAVD: A Dataset for Sound Event Detection in Urban Environments. In: AA. VV.. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019). New York: 2019. p. 236-267. |
2018 (7)Simões Maia L, de Tomaz Junior PD, Fuentes M, Rocamora M, Biscainho LWP, da Costa MVM, Cohen S. A novel dataset of Brazilian rhythmic instruments and some experiments in computational rhythm analysis. In: -. AES LAC 2018 - Congreso Latinoamericano de Ingeniería de Audio. 2018. p. 53-60. |
|
De Cola G, Chalupa G, Rocamora M, Cancela P. Análisis automático de voz hablada para detección de dificultades en el aprendizaje de la lectura. In: -. AES LAC 2018 - Congreso Latinoamericano de Ingeniería de Audio. 2018. p. 105-106. |
|
Marenco B, Rocamora M. Reconocimiento de patrones rítmicos en música de percusión a partir de señales de audio. In: -. AES LAC 2018 - Congreso Latinoamericano de Ingeniería de Audio. 2018. p. 87-89. |
|
Agorio L, Corchs A, Pereyra H, Zinemanas P, Rocamora M. UrbanEar Monitoreo de entorno sonoro urbano. In: -. AES LAC 2018 - Congreso Latinoamericano de Ingeniería de Audio. 2018. p. 96-98. |
|
Rumbo V, Mordecki E, Rocamora M. Generación automática de melodías usando cadenas de Markov con restricciones. In: -. AES LAC 2018 - Congreso Latinoamericano de Ingeniería de Audio. 2018. p. 102-104. |
|
Massaferro Saquieres P, Rocamora M, Cancela P. Influencia del acompañamiento en la identificación automática de cantante en música polifónica. In: -. AES LAC 2018 - Congreso Latinoamericano de Ingeniería de Audio. 2018. p. 37-44. |
|
Jure L, Rocamora M. Subir la llamada Negotiating tempo and dynamics in Afro-Uruguayan candombe drumming. In: Holzapfel A, Pikrakis A. International Workshop on Folk Music Analysis 2018. Aristotle University of Thessaloniki, Greece; 2018. p. 1-6. |
2017 (1)Jure L, Rocamora M. Clave patterns in Uruguayan Candombe drumming. In: -. 16th Rhythm Production and Perception Workshop (RPPW 2017). 2017. |
2016 (1)Jure L, Rocamora M. Microtiming in the rhythmic structure of Candombe drumming patterns. In: -. Fourth International Conference On Analytical Approaches To World Music (AAWM 2016) Proceedings. Nova York: 2016. p. 1-6. |
2015 (8)Marenco B, Fuentes M, Lanzaro F, Rocamora M, Gomez A. A multimodal approach for percussion music transcription from audio and video. Lecture Notes in Computer Science 2015; 9423(0): 92-99. |
|
Magnone L, Bessonart M, Rocamora M, Gadea J, Salhi M. Diet estimation of Paralichthys orbignyanus in a coastal lagoon via quantitative fatty acid signature analysis. Journal of Experimental Marine Biology and Ecology 2015; 462(0): 36-49. |
|
Rocamora M, Biscainho L. Modeling onset spectral features for discrimination of drum sounds. Lecture Notes in Computer Science 2015; 9423(0): 100-107. |
|
Apolinário IF, Biscainho LWP, Rocamora M, Cancela P. Fan Chirp Transform with nonlinear time warping. In: -. Brazilian AES Audio Engineering Congress. 2015. p. 62-68. |
|
Rocamora M, Jure L, Marenco B, Fuentes M, Lanzaro F, Gómez A. An audio-visual database of candombe performances for computational musicological studies. In: -. CICTeM 2015 - II Congreso Internacional de Ciencia y Tecnología Musical. 2015. p. 17-24. |
|
Nunes L, Rocamora M, Jure L, Biscainho LWP. Beat and downbeat tracking based on rhythmic patterns applied to the Uruguayan candombe drumming. In: Müller M, Wiering F (eds.). Proceedings of the 16th International Society for Music Information Conference, ISMIR 2015. Málaga: dblp; 2015. p. 264-270. |
|
Rocamora M, Biscainho LWP. Modeling onset spectral features for discrimination of drum sounds. In: -. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications 20th Iberoamerican Congress, CIARP 2015. 2015. p. 100-107. |
|
Marenco B, Fuentes M, Lanzaro F, Rocamora M, Gómez A. A multimodal approach for percussion music transcription from audio and video. In: -. Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications 20th Iberoamerican Congress, CIARP 2015. 2015. p. 92-99. |
2014 (3)Rocamora M, Cancela P, Pardo A. Query by humming: Automatically building the database from music recordings. Pattern Recognition Letters 2014; 36(1): 272-280. |
|
Rocamora M. Tecnologías para el análisis del contenido musical de grabaciones de audio. In: -. Primera Semana del Sonido. 2014. p. 1-12. |
|
Rocamora M, Jure L, Biscainho LWP. Tools for detection and classification of piano drum patterns from candombe recordings. In: -. Proceedings of the 9th Conference on Interdisciplinary Musicology - CIM14. 2014. p. 382-387. |
2012 (2)Rocamora M, Pardo A. Separation and classification of harmonic sounds for singing voice detection. Lecture Notes in Computer Science 2012; 7441(0): 707-714. |
|
Jure L, Lopez E, Rocamora M, Cancela P, Sponton H, Irigaray I. Pitch content visualization tools for music performance analysis. In: Gouyon F, Herrera P, Gustavo L, Müller M. Proceedings of 13th International Society for Music Information Retrieval Conference. Porto: 2012. p. 493-498. |
2011 (2)Rocamora M, Cancela P. Pitch tracking in polyphonic audio by clustering local fundamental frequency estimates. In: -. Brazilian AES Audio Engineering Congress, 9th. 2011. p. 80-87. |
|
Cancela P, López E, Rocamora M. Fan chirp transform for music representation. In: Zölzer, U. (ed.). DAFX Digital Audio Effects. John Wiley; 2011. p. 330-337. |
2010 (1)Cancela P, Lopez E, Rocamora M. Fan chirp transform for music representation. Proceedings of the International Conference on Digital Audio Effects 2010; 0(0). |
2009 (2)Cancela P, Rocamora M, Lopez E. An efficient multi-resolution spectral transform for music analysis. In: VV.AA.. Proceedings of the 10th International Society for Music Information Retrieval Conference ISMIR 2009. Kobe: International Society for Music Information Retrieval; 2009. p. 309-314. |
|
Rocamora M, López E, Jure L. Wind instruments synthesis toolbox for generation of music audio signals with labeled partials. In: -. Proceedings of the 12th Brazilian Symposium on Computer Music (SBCM 2009). 2009. p. 69-80. |
2007 (1)Rocamora M, Herrera P. Comparing audio descriptors for singing voice detection in music audio files. In: -. Brazilian Symposium on Computer Music, 11th. 2007. p. 187-196. |
2006 (1)López E, Rocamora M. Tararira Query by singing system. In: -. 2nd Annual Music Information Retrieval Evaluation eXchange (MIREX). 2006. p. 1-4. |
2005 (1)López E, Rocamora M. Tararira Sistema de búsqueda de música por melodía cantada. In: -. Proceedings of the 10th Brazilian Symposium on Computer Music (SBCM 2005). 2005. p. 142-153. |