Publications
2026 (1)Barreiro A.; Mari R.; Redondo R.; Haro G.; Bosch C.. ShinyNeRF: Digitizing Anisotropic Appearance in Neural Radiance Fields. The international archives of photogrammetry, remote sensing and spatial information sciences 2026; 48: 33-40. |
2025 (5)Juanola X, Haro G, Fuentes M. A Critical Assessment of Visual Sound Source Localization Models Including Negative Audio [IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)]. arxiv.org 2025. |
|
Juanola X; Morais G; Fuentes M; Haro G. Learning from Silence and Noise for Visual Sound Source Localization. In: AA.VV.. Proceedings of the 36th British Machine Vision Conference 2025. 2025. |
|
Barreiro A.; Mari R.; Redondo R.; Haro G.; Bosch C.; Berga D.. Specularity in NeRFs: A Comparative Study of Ref-NeRF and NRFF. Image Processing On Line 2025; 15: 32-44. |
|
Phillips A.; Grandes Rodríguez D.; Sánchez-Manzano M.; Salvadó-Romero A.; Garin M.; Haro G.; Ballester C. Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods. In: Del Bue A; Canton C; Pont-Tuset J; Tommasi T (eds.). Computer Vision - ECCV 2024 Workshops. Springer; 2025. p. 345-361. |
|
Phillips A.; Grandes Rodriguez D.; Sanchez-Manzano M.; Salvado A.; Garin M.; Haro G.; Ballester C.. Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods. Lecture Notes in Computer Science 2025; 15628: 345-361. |
2024 (4)Juanola, Xavier; Haro, Gloria. A Brief Analysis of SLAVC method for Sound Source Localization. Image Processing On Line 2024; 159-172. |
|
Diaz-Juan A, Ballester C, Haro G. SoccerHigh: A Benchmark Dataset for Automatic Soccer Video Summarization. In: -. -. -; 2024. p. 121-130. |
|
Cartas A, Ballester C, Haro G. Two Weakly Supervised Approaches for Role Classification of Soccer Players. In: AA.VV.. MMSports '24: Proceedings of the 7th ACM International Workshop on Multimedia Content Analysis in Sports. ACM Digital Library; 2024. p. 81-89. |
|
Phillips, Grandes Rodriguez D, Sánchez-Manzano M, Salvadó A, Garin M, Haro G, Ballester C. Visual Motif Identification: Elaboration of a Curated Comparative Dataset and Classification Methods. In: Leonardis A, Ricci E, Roth S, Russakovsky O, Sattler, Varol TG. Proceedings of the European Conference on Computer Vision (ECCV) Workshops 2024. Springer Cham; 2024. |
2023 (2)Montesinos JF, Michelsanti D, Haro G, Tan ZH, Jensen J. Speech inpainting: Context-based speech synthesis guided by video. Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023; 4459-4463. |
|
Montesinos, JF, Michelsanti D, Haro G, Tan ZH, Jensen J. Speech inpainting: Context-based speech synthesis guided by video. In: AA. VV.. Proceedings Interspeech 2023. Dublin: ICP; 2023. p. 4459-4463. |
2022 (3)Cartas A, Ballester C, Haro G. A Graph-Based Method for Soccer Action Spotting Using Unsupervised Player Classification. In: Rainer L,Thomas BM, Hideo S. Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports. Association for Computing Machinery; 2022. p. 93-102. |
|
Kadandale VS, Montesinos JF, Haro G. VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices. In: AA. VV.. Proceedings Interspeech 2022. Annual conference of the International Speech Communication Association. Korea: International Speech Communication Association; 2022. p. 3128-3132. |
|
Montesinos JF, Kadandale VS, Haro G. VoViT: Low Latency Graph-based Audio- Visual Voice Separation Transformer. In: AA. VV.. ECCV 2022 Proceedings European Conference on Computer Vision. Tel Aviv: 2022. |
2021 (7)Montesinos F, Kadandale VS, Haro G. A cappella: Audio-visual Singing Voice Separation. In: AA. VV.. Proceedings of 32nd British Machine Vision Conference (BMVC 2021- Virtual). -: British Machine Vision Association; 2021. p. 1-14. |
|
Montesinos JF, Kadandale V, Haro G. A cappella: Audio-visual Singing Voice Separation. arxiv.org; 2021. |
|
Slizovskaia O.; Haro G.; Gomez E.. Conditioned Source Separation for Musical Instrument Performances. IEEE/ACM Transactions on Audio Speech and Language Processing 2021; 29: 2083-2095. |
|
Batard T, Haro G, Ballester C. DIP-VBTV: A Color Image Restoration Model combining a Deep Image Prior and a Vector Bundle Total Variation. Journal of Imaging Science 2021; 14(4): 1816-1847. |
|
Arbués-Sangüesa A, Martín A, Granero P, Ballester C, Haro G. Learning Football Body-Orientation as a Matter of Classification. In: AA. VV.. 30th International Joint Conference on Artificial Intelligence (IJCAI-21). Montreal: IJCAI; 2021. p. 1-8. |