Publications

17 documents

Antonio Almudévar, José Miguel Hernández-Lobato, Sameer Khurana, Ricard Marxer, Alfonso Ortega. Aligning Multimodal Representations through an Information Bottleneck. 2026. ⟨hal-05265540⟩
Paul Best, Angela Dassow, Arik Kershenbaum, Tho Duc Nguyen, Megan Pogson, et al.. Spatial validation of acoustic individual identification models without ground truths: a case study with the cao-vit gibbon population. PeerJ, 2026, 14, pp.e20655. ⟨10.7717/peerj.20655⟩. ⟨hal-05533845⟩
Sergio Burdisso, Séverin Baroudi, Yanis Labrak, David Grünert, Pawel Cyrta, et al.. SDialog: A Python Toolkit for End-to-End Agent Building, User Simulation, Dialog Generation, and Evaluation. Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations), Mar 2026, Rabat, France. pp.320-340, ⟨10.18653/v1/2026.eacl-demo.23⟩. ⟨hal-05583080⟩
Paul Best, Marion Poupard, Ricard Marxer, Paul Spong, Helena Symonds, et al.. Analysing vocal complexity in relation to sociality in orcas of British Columbia: An application of long-term computational passive acoustics. Ecological Informatics, 2025, 90, pp.103211. ⟨10.1016/j.ecoinf.2025.103211⟩. ⟨hal-05265578⟩
Paul Best, Marcelo Araya-Salas, Axel Ekström, Bárbara Freitas, Frants Jensen, et al.. Bioacoustic fundamental frequency estimation: a cross-species dataset and deep learning baseline. Bioacoustics, 2025, 34 (4), pp.419-446. ⟨10.1080/09524622.2025.2500380⟩. ⟨hal-05265455⟩
Joonas Kalda, Séverin Baroudi, Martin Lebourdais, Clément Pagés, Ricard Marxer, et al.. Design Choices for PixIT-based Speaker-Attributed ASR: Team ToTaTo at the NOTSOFAR-1 Challenge. Computer Speech and Language, 2025, 95, pp.101824. ⟨10.1016/j.csl.2025.101824⟩. ⟨hal-05084070⟩
Md Ether Deowan, Md Shamin Yeasher Yousha, Tihan Mahmud Hossain, Shahriar Hassan, Ricard Marxer. Optimizing Underwater Robot Navigation: A Study of DRL Algorithms and Multi-Modal Sensor Fusion. IEEE International Conference on Robotics & Automation (ICRA), May 2025, Atlanta, GA, United States. ⟨hal-05004039⟩
Santiago Cuervo, Ricard Marxer. Scaling Properties of Speech Language Models. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024, Miami, United States. pp.351-361, ⟨10.18653/v1/2024.emnlp-main.21⟩. ⟨hal-04832692⟩
Arik Kershenbaum, Çağlar Akçay, Lakshmi Babu-Saheer, Alex Barnhill, Paul Best, et al.. Automatic detection for bioacoustic research: a practical guide from and for biologists and computer scientists. Biological Reviews, 2024, ⟨10.1111/brv.13155⟩. ⟨hal-04741895⟩
Paul Best, Santiago Cuervo, Ricard Marxer. Transfer Learning from Whisper for Microscopic Intelligibility Prediction. Interspeech 2024, Sep 2024, Kos, Greece. pp.3839-3843, ⟨10.21437/Interspeech.2024-2258⟩. ⟨hal-04683361⟩

Publications

Contact

Domains

Keywords

Authors

Affiliated authors

Journals

Year of production

Institutions

Laboratories

Departments

Research team