Publications
2023 (2)Montesinos JF, Michelsanti D, Haro G, Tan ZH, Jensen J. Speech inpainting: Context-based speech synthesis guided by video. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023; 4459-4463. |
Montesinos, JF, Michelsanti D, Haro G, Tan ZH, Jensen J. Speech inpainting: Context-based speech synthesis guided by video. In: AA. VV.. Proceedings Interspeech 2023. Dublin: ICP; 2023. p. 4459-4463. |
2022 (3)Cartas A, Ballester C, Haro G. A Graph-Based Method for Soccer Action Spotting Using Unsupervised Player Classification. In: Rainer L,Thomas BM, Hideo S. Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports. Association for Computing Machinery; 2022. p. 93-102. |
Kadandale VS, Montesinos JF, Haro G. VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices. In: AA. VV.. Proceedings Interspeech 2022. Annual conference of the International Speech Communication Association. Korea: International Speech Communication Association; 2022. p. 3128-3132. |
Montesinos JF, Kadandale VS, Haro G. VoViT: Low Latency Graph-based Audio- Visual Voice Separation Transformer. In: AA. VV.. ECCV 2022 Proceedings European Conference on Computer Vision. Tel Aviv: 2022. |
2021 (6)Montesinos F, Kadandale VS, Haro G. A cappella: Audio-visual Singing Voice Separation. In: AA. VV.. Proceedings of 32nd British Machine Vision Conference (BMVC 2021- Virtual). -: British Machine Vision Association; 2021. p. 1-14. |
Montesinos JF, Kadandale V, Haro G. A cappella: Audio-visual Singing Voice Separation. arxiv.org; 2021. |
Slizovskaia O.; Haro G.; Gomez E.. Conditioned Source Separation for Musical Instrument Performances. IEEE/ACM Transactions on Audio Speech and Language Processing 2021; 29: 2083-2095. |
Batard T.; Haro G.; Ballester C.. DIP-VBTV: A Color Image Restoration Model combining a Deep Image Prior and a Vector Bundle Total Variation. Journal of Imaging Science 2021; 14(4): 1816-1847. |
Arbués-Sangüesa A, Martín A, Granero P, Ballester C, Haro G. Learning Football Body-Orientation as a Matter of Classification. In: AA. VV.. 30th International Joint Conference on Artificial Intelligence (IJCAI-21). Montreal: IJCAI; 2021. p. 1-8. |
Arbués-Sangüesa A, Haro G, Ballester C. Towards Video Summarization: A Temporal Multimodal Method for Action Spotting in Sports Videos. In: AA. VV.. Proceedings IROS Workshop: Egocentric vision for interactive perception, learning, and control - online- (EgoVIP 2021). Spanish Network of Machine Learning and Computer Vision for Human Analysis and Robotic Perception (ReAViPeRo). Spanish Ministerio de Ciencia e Innovación (RED2018-102511-T); 2021. p. 1-3. |
2020 (9)Arbues-Sanguesa A.; Martin A.; Fernandez J.; Rodriguez C.; Haro G.; Ballester C.. Always Look on the Bright Side of the Field: Merging Pose and Contextual Data to Estimate Orientation of Soccer Players. Proceedings IEEE ICIP 2020; 2020-Octo(0): 1506-1510. |
Arbués-Sangüesa A, Martín A, Fernández J, Rodríguez C, Haro G, Ballester C. Always Look on the Bright Side of the Field: Merging Pose and Contextual Data to Estimate Orientation of Soccer Players. In: -. Proceedings Congrès ICIP 2020. ICIP; 2020. p. 1506-1510. |
Kadandale V.S.; Montesinos J.F.; Haro G.; Gomez E.. Multi-channel U-Net for Music Source Separation. In: AA. VV.. IEEE MMSP 2020. IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP). IEEE; 2020. p. 1-6. |
Raad L.; Oliver M.; Ballester C.; Haro G.; Meinhardt E.. On anisotropic optical flow inpainting algorithms. Image Processing On Line 2020; 10: 78-104. |
Hurault S.; Ballester C.; Haro G.. Self-Supervised Small Soccer Player Detection and Tracking. In: -. Proceedings of the International ACM Work shop on Multimedia Content Analysis in Sports. 2020. Washington: ACM/SIGMM; 2020. p. 9-18. |
Montesinos JF, Slizovskaia O, Haro G. Solos: A Dataset for Audio-Visual Music Analysis. 22nd IEEE International Workshop on Multimedia Signal Processing (MMSP), September 21-24, 2020. arxiv.org; 2020. |
Arbues-Sanguesa A.; Martin A.; Fernandez J.; Ballester C.; Haro G.. Using player's body-orientation to model pass feasibility in soccer. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshops 2020; 2020-June(0): 3875-3884. |
Arbues-Sanguesa A, Martín A, Fernández J, Ballester C, Haro G. Using Player's Body-Orientation to Model Pass Feasibility in Soccer. In: AA.VV.. Proceedings 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Seattle: The Institute of Electrical and Electronics Engineers (IEEE); 2020. p. 886-887. |
Michelsanti D.; Slizovskaia O.; Haro G.; Gomez E.; Tan Z.H.; Jensen J.. Vocoder-based speech synthesis from silent videos. Proceedings of Interspeech 2020; 3530-3534. |