No results founds

2023 (12)

Alonso-Jiménez P, Favory X, Foroughmand H, Bourdalas G, Serra X, Lidy T, Bogdanov D. Training Strategies Using Contrastive Learning and Playlist Information for Music Classification and Similarity. In: AA. VV.. Proceedings 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023). 1 ed. Island of Rhodes: IEEE Signal Processing Society; 2023. p. 1-5.

Batlle-Roca R, Herrera-Boyer P, Meléndez B, Molina E, Serra X. Towards a characterization of background music audibility in broadcasted TV. International Journal of Environmental Research and Public Health 2023; 20(1): 1-15.

Hupont I, Tolan S, Frau P, Porcaro L, Gomez E.. Measuring and fostering diversity in Affective Computing research. IEEE Transactions on Affective Computing 2023; ( ): 1-16.

Kim H, Miron M, Serra X. Score-Informed MIDI Velocity Estimation for Piano Performance by FiLM Conditioning. In: AA. VV.. Proceedings of he 2023 Sound & Music Computing Conference (SMC Conference 2023). 1 ed. Stockholm: 2023. p. 1-9.

Plachouras C, Miron M. Music Rearrangement Using Hierarchical Segmentation. In: AA. VV.. Proceedings 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023). 1 ed. Island of Rhodes: IEEE Signal Processing Society; 2023. p. 1-5.

Schedl M, Gómez E, Lex E.. Trustworthy Algorithmic Ranking Systems. In: ACM SIGIR. WSDM 2023 - Proceedings of the 16th ACM International Conference on Web Search and Data Mining. 1 ed. Association for Computing Machinery, Inc; 2023. p. 1240-1243.

Singh R, Zinemanas P, Serra X, Bello JP, Fuentes M. FlowGrad: Using Motion for Visual Sound Source Localization. In: AA. VV.. Proceedings 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023). 1 ed. Island of Rhodes: IEEE Signal Processing Society; 2023. p. 1-5.

Srinivasamurthy A, Gulati S, Caro Repetto R, Serra X. Getting started on computational musicology and music information research: an indian art music perspective. In: Rao P.; Murthy H. A.; Prasann S. R. M.. Indian art music: a computational perspective. 1 ed. Sriranga Digital Software Technologies Pvt. Ltd., 2023; 2023. p. 3-38.

Tamer NC, Özer Y, Müller M, Serra X. TAPE: An End-to-End Timbre-Aware Pitch Estimator. In: AA. VV.. Proceedings 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023). 1 ed. Island of Rhodes: IEEE Signal Processing Society; 2023. p. 1-5.

Weck B, Serra X. Data leakage in cross-modal retrieval training: A case study. In: AA. VV.. Proceedings 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023). 1 ed. Island of Rhodes: IEEE Signal Processing Society; 2023. p. 1-5.

2022 (34)

Alonso-Jiménez P, Serra X, Bogdanov D. Music representation learning based on editorial metadata from discogs. In: AA. VVV.. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 1 ed. Bengaluru: ISMIR; 2022. p. 825-833.

Bogdanov D, Lizarraga Seijas X, Alonso-Jiménez P, Serra X. MusAV: A dataset of relative arousal-valence annotations for validation of audio models. In: AA. VVV.. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 1 ed. Bengaluru: ISMIR; 2022. p. 650-658.

Buisson M, Alonso-Jiménez P, Bogdanov D. Ambiguity Modelling with Label Distribution Learning for Music Classification. In: Li H, Furui S (eds,). 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 1 ed. Singapur: IEEE Signal Processing Society; 2022. p. 611-615.

Correya Albin A, Bogdanov D, Alonso-Jimenez P, Serra X. Essentia API: a web API for music audio analysis. UPF; 2022.

Cortès G, Ciurana A, Molina E, Miron M, Meyers O, Six J, Serra X. BAF: an audio fingerprinting dataset for broadcast monitoring. In: AA. VVV.. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 1 ed. Bengaluru: ISMIR; 2022. p. 908-916.

Cuesta H, Gómez E. Voice assignment in vocal quartets using deep learning models based on pitch salience. Transactions of the International Society for Music Information Retrieval 2022; 5(1): 99-112.

Drysdale J, Hockman J, Ramires A, Serra X. Improved automatic instrumentation role classification and loop activation transcription. In: Evangelista G, Holighaus N (eds.). Proceedings - Digital Audio Effects 20's Vienna (eDAFx2020) 1 ed. Viena: IEEE; 2022. p. 264-271.

Fonseca E, Favory X, Pons J, Font F, Serra X. FSD50K: an Open Dataset of Human-Labeled Sound Events. IEEE-ACM transactions on audio, speech, and language processing 2022; (30): 829-852.

Haki B, Nieto M, Pelinski T, Jordà S. Real-Time Drum Accompaniment Using Transformer Architecture. In: AA. VV.. Proceedings of the 3rd Conference on AI Music Creativity (AIMC 2022). 1 ed. AIMC; 2022. p. 1-10.

Ilaria Manco, Benno Weck, Philip Tovstogan, Minz Won, Dmitry Bogdanov. Song Describer: a Platform for Collecting Textual Descriptions of Music Recordings. In: AA. VVV.. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 1 ed. Bengaluru: ISMIR; 2022. p. 1-4.

Kim H, Ramoneda P, Miron M, Serra X. An overview of automatic piano performance assessment within the music education context. In: Cukurova M, Rummel N, Gillet D, McLaren B, Uhomoibhi J. Proceedings of the 14th International Conference on Computer Supported Education (online streaming). 1 ed. Setúbal: Sciteppress; 2022. p. 465-474.

Marcos-Fernández J, Joglar-Ongay L, Serra X, Bogdanov D. Audio Analysis Applications in the Browser with Essentia.js. In: AA. VV.. Proceedings WAC2022. 7th International Web Audio conference. 1 ed. Cannes: 2022.

Morsi A, Serra X. Bottlenecks and solutions for audio to score alignment research. In: AA. VVV.. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 1 ed. Bengaluru: ISMIR; 2022. p. 272-279.

Narang J, Miron M, Srinivasamurthy A, Serra X. Analysis of musical dynamics in vocal performances using loudness measures. In: Evangelista G, Holighaus N (eds.). Proceedings - Digital Audio Effects 20's Vienna (eDAFx2020) 1 ed. Viena: IEEE; 2022. p. 33-39.

Nuttall T, Plaja-Roglans G, Pearson L, Sierra X. In search of Sañcaras: tradition-informed repeated melodic pattern recognition in carnatic music. In: AA. VVV.. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 1 ed. Bengaluru: ISMIR; 2022. p. 337-344.

Perez M, Kirchhoff H, Serra X. A Comparison of Pitch Chroma Extraction Algorithms. In: Michon R, Pottier L, Orlarey Y (eds.). Proceedings of the SMC 2022 Music technology and design. 1 ed. Saint-Étienne: Université Jean Monnet of Saint-Étienne (CLLA); 2022. p. 224-231.

Plaja-Roglans G, Miron M, Serra X. A diffusion-inspired training strategy for singing voice extraction in the waveform domain. In: AA. VVV.. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 1 ed. Bengaluru: ISMIR; 2022. p. 685-693.

Porcaro L, Gómez E, Castillo C. Diversity in the music listening experience: insights from focus group interviews. In: AA. VV.. CHIIR '22: ACM SIGIR Conference on Human Information Interaction and Retrieval;. 1 ed. Regensburg: ACM; 2022. p. 272-276.

Porcaro L, Gómez-Gutierrez E, Castillo C. Perceptions of Diversity in Electronic Music: the Impact of Listener, Artist, and Track Characteristics. Proceedings of the ACM on Human-Computer Interaction 2022; 6(109): 1-26.

Ramires A, Juras J, Parker JD, Serra X. A study of control methods for percussive sound synthesis based on gans. In: Evangelista G, Holighaus N (eds.). Proceedings - Digital Audio Effects 20's Vienna (eDAFx2020) 1 ed. Viena: IEEE; 2022. p. 224-231.

Ramirez-Melendez R, Matamoros E, Hernandez Leo D, Mirabel J, Sanchez E, Escude N. Music-Enhanced Emotion Identification of Facial Emotions in Autistic Spectrum Disorder Children: A Pilot EEG Study. Brain Sciences 2022; 12(6): 1-11.

Ramoneda P, Can Tamer N, Eremenko V, Serra X, Miron M. Score difficulty analysis for piano performance education based on fingering. In: Li H, Furui S (eds,). 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 1 ed. Singapur: IEEE Signal Processing Society; 2022. p. 201-205.

Ramoneda P, Jeong D, Nakamura E, Serra X, Miron M. Automatic Piano Fingering from Partially Annotated Scores using Autoregressive Neural Networks. In: AA. VV.. Proceedings 30th ACM International Conference on Multimedia. MM'22 1 ed. Lisboa: ACM Multimedia; 2022. p. 1-9.

Schedl M, Gómez E, Lex E. Retrieval and Recommendation Systems at the Crossroads of Artificial Intelligence, Ethics, and Regulation. In: ACM SIGIR. SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1 ed. Association for Computing Machinery, Inc; 2022. p. 3420-3424.

Schmidt N, Pons J, MironM. PodcastMix: A dataset for separating music and speech in podcasts. In: Ko H, Hansen JHL. Proceedings of Interspeech 2022. 1 ed. International Speech Communication Association; 2022. p. 231-235.

Tamer NC, Ramoneda P, Serra X. Violin etudes: a comprehensive dataset for f0 estimation and performance analysis. In: AA. VVV.. Proceedings 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 1 ed. Bengaluru: ISMIR; 2022. p. 517-524.

Tamer NC, Ramoneda P, Serra X. Violin etudes: a comprehensive dataset for f0 estimation and performance analysis. Spanish Ministerio de Ciencia e Innovación and the Agencia Estatal de Investigación; 2022.

Tovstogan P, Serra X, Bogdanov D. Similarity of nearest-neighbor query results in deep latent spaces. In: Michon R, Pottier L, Orlarey Y (eds.). Proceedings of the SMC 2022 Music technology and design. 1 ed. Saint-Étienne: Université Jean Monnet of Saint-Étienne (CLLA); 2022. p. 287-294.

Tovstogan P, Serra X, Bogdanov D. Visualization of deep audio embeddings for music exploration and rediscovery. In: Michon R, Pottier L, Orlarey Y (eds.). Proceedings of the SMC 2022 Music technology and design. 1 ed. Saint-Étienne: Université Jean Monnet of Saint-Étienne (CLLA); 2022. p. 493-500.

Weck B, Pérez M, Kirchhoff H, Serra X. Matching Text and Audio Embeddings: Exploring Transfer-learning Strategies for Language-based Audio Retrieval. In: Lagrange M, Mesaros AM, Pellegrini T, Richard G, Serizel R, Stowell D (eds.). Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2022). 1 ed. Nancy: Dcase; 2022. p. 206-211.

Yesiler F, Miron M, Serrà J, Gómez E. Assessing algorithmic biases for musical version identification. In: K Selcuk Candan. WSDM 2022 - Proceedings of the 15th ACM International Conference on Web Search and Data Mining. 1 ed. Association for Computing Machinery, Inc; 2022. p. 1284-1290.

2021 (48)

Bonada J, Blaauw M. Semi-supervised Learning for Singing Synthesis Timbre. In: AA. VV.. ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021). 1 ed. arXiv.org; 2021. p. 783-787.

Charisi V, Merino L, Escobar M, Caballero F, Gomez R, Gómez E. The effects of robot cognitive reliability and social positioning on child-robot team dynamics. Proceedings IEEE International Conference on Robotics and Automation 2021; ( ): 9439-9445.

Correya A, Marcos-Fernández J, Joglar-Ongay L, Alonso-Jiménez P, Serra X, Bogdanov D. Audio and Music Analysis on the Web using Essentia.js. Transactions of the International Society for Music Information Retrieval 2021; 4(1): 167-181.

Dalmazzo D, Waddell G, Ramírez R. Applying deep learning techniques to estimate patterns of musical gesture. Frontiers in Psychology 2021; (11).

Favory X, Drossos K, Virtanen T, Serra X. Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags. In: Androutsos, Dimitri; Plataniotis, Kostas; Zhang, Xiao-Ping. Proceedings of the 46th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 1 ed. 2021. p. 596-600.

Ferraro A, Bogdanov D, Serra X, Jeon JH, Yoon J. How Low Can You Go? Reducing Frequency and Time Resolution in Current CNN Architectures for Music Auto-tagging. In: Heusdens R, Richard C. Proceedings 28th European Signal Processing Conference (EUSIPCO 2020). 1 ed. Amsterdam: European Association for Signal Processing (EURASIP); 2021. p. 131-135.

Ferraro A, Favory X, Drossos K, Kim Y, Bogdanov D. Enriched Music Representations with Multiple Cross-modal Contrastive Learning. IEEE Signal Processing Letters 2021; ( ): 1-6.

Ferraro A, Kim Y, Lee S, Kim B, Jo N, Lim S, Lim S, Jang J, Kim S, Serra X, Bogdanov D. Melon Playlist Dataset: A Public Dataset For Audio-Based Playlist Generation And Music Tagging. In: Androutsos, Dimitri; Plataniotis, Kostas; Zhang, Xiao-Ping. Proceedings of the 46th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 1 ed. 2021. p. 536-540.

Ferraro A, Serra X, Bauer C. What is fair? Exploring the artists' perspective on the fairness of music streaming platforms.. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2021; ( ): 562-584.

Ferraro A, Serra X, Bauer C. Break the Loop: Gender Imbalance in Music Recommenders. In: Scholer F, Thomas P (eds.). CHIIR '21: Proceedings of the 2021 Conference on Human Information Interaction and Retrieval. 1 ed. New York: Association for Computing Machinery; 2021. p. 249-254.

Fonseca E, Jansen A, Ellis DPW, Wisdom S, Tagliasacchi M, Hershey JR, Plakal M, Hershey S, Channing Moore, Serra X. Self-Supervised Learning from Automatically Separated Sound Scenes. In: IEEE Workshop. WASPAA 2021. 1 ed. Nova York: IEEE Workshop; 2021. p. 251-255.

Fonseca E, Ortego D, McGuinness K, O'Connor NE, Serra X. Unsupervised Contrastive Learning Of Sound Event Representations. In: Androutsos, Dimitri; Plataniotis, Kostas; Zhang, Xiao-Ping. Proceedings of the 46th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 1 ed. 2021. p. 371-375.

Font F. SOURCE: a Freesound community music sampler. In: Audio mostly, A conference on interaction with sound. In: AA. VV.. AM '21: Audio Mostly 2021. 1 ed. Trento: Association for Computing Machinery (ACM); 2021. p. 182-187.

Font F, Mesaros AM, PW Ellis D, Fonseca E, Fuentes M, Elizalde B (eds.). Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2021). 1 ed. Barcelona: Music Technology Group, Universitat Pompeu Fabra; 2021.

Gómez-Cañón JS, Cano E, Yang YH, Herrera P, Gomez E. Let's agree to disagree: Consensus Entropy Active Learning for Personalized Music Emotion Recognition. In: AA. VV.. Proceedings 22nd International Society for Music Information Retrieval Conference (ISMIR 2021) Online. 1 ed. 2021. p. 237-245.

Gómez-Cañón JS, Cano E, Eerola T, Herrera P, Hu X, Yang YH, Gómez E. Music Emotion Recognition: Toward new, robust standards in personalized and context-sensitive applications. IEEE signal processing magazine 2021; 38(6): 106-114.

Gómez-Cañón JS, Cano E, Herrera P, Gómez E. Transfer learning from speech to music: towards language-sensitive emotion recognition models. European Signal Processing Conference 2021; ( ): 136-140.

Gómez-Cañón JS, Cano E, Herrera P, Gómez E. Transfer learning from speech to music: towards language-sensitive emotion recognition models. In: Heusdens R, Richard C. Proceedings 28th European Signal Processing Conference (EUSIPCO 2020). 1 ed. Amsterdam: European Association for Signal Processing (EURASIP); 2021. p. 136-140.

Gómez-Cañón JS, Cano E, Pandrea AG, Herrera P, Gómez E. Language-Sensitive Music Emotion Recognition Models: Are We Really There Yet?. In: Androutsos, Dimitri; Plataniotis, Kostas; Zhang, Xiao-Ping. Proceedings of the 46th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 1 ed. 2021. p. 576-580.

Gómez-Cañón JS, Herrera P, Cano E, Gómez E. Personalized musically induced emotions of not-so-popular Colombian music. In: Ranzato MA. 35th Conference on Neural Information Processing Systems (NeurIPS 2021). 1 ed. Neural Information Processing Systems; 2021. p. 1-7.

Gover M, Sarasua Á, Parra H, Janer J, Mayor O, Cuesta H, Pascual MP, Gkiokas A, Gómez E. Choir Singers Pilot. An online platform for choir singers practice. In: TBD. Proceedings of the Web Audio Confernece 2021. 1 ed. 2021. p. 1-6.

Gutiérrez Páez NF, Gómez-Cañón JS, Porcaro L, Santos P, Hernández-Leo D, Gómez E. Emotion annotation of music: a citizen science approach. In: Hernández-Leo D, Hishiyama R, Zurita G, Weyers B, Nolte A, Ogata H. Collaboration Technologies and Social Computing. Proceedings of the 27th International Conference, CollabTech 2021. 1 ed. Cham: Springer; 2021. p. 51-66.

Hershey S, Ellis D PH, Fonseca E, Jansen A, Liu C, Channing Moore R, Plakal M. The Benefit of Temporally-Strong Labels In Audio Event Classification. arXiv.org; 2021.

Hupont I, Tolan S, Freire A, Porcaro L, Estevez S, Gómez E. How diverse is the ACII community? Analysing gender, geographical and business diversity of affective computing research. International Conference on Affective Computing and Intelligent Interaction 2021; ( ).

Jorda Puig, Sergi. Sonigraphical Instruments: From FMOL To the reacTable. In: Minsky M, Xia G (eds.). Proceedings International Conference on New Interfaces for Musical Expression (Nime 21). 1 ed. Shangay: 2021. p. 89-106.

Lu WT, Wang JCH, Won M, Choi K, Song X. SpecTNT: a Time-Frequency Transformer for Music Audio. In: AA. VV.. Proceedings 22nd International Society for Music Information Retrieval Conference (ISMIR 2021) Online. 1 ed. 2021. p. 396-403.