Publications

15 documents

  • Paul Best, Marion Poupard, Ricard Marxer, Paul Spong, Helena Symonds, et al.. Analysing vocal complexity in relation to sociality in orcas of British Columbia: An application of long-term computational passive acoustics. Ecological Informatics, 2025, 90, pp.103211. ⟨10.1016/j.ecoinf.2025.103211⟩. ⟨hal-05265578⟩
  • Antonio Almudévar, José Miguel Hernández-Lobato, Sameer Khurana, Ricard Marxer, Alfonso Ortega. Aligning Multimodal Representations through an Information Bottleneck. International Conference on Machine Learning (ICML), Jul 2025, Vancouver, Canada. ⟨10.48550/arXiv.2506.04870⟩. ⟨hal-05265540⟩
  • Paul Best, Marcelo Araya-Salas, Axel Ekström, Bárbara Freitas, Frants Jensen, et al.. Bioacoustic fundamental frequency estimation: a cross-species dataset and deep learning baseline. Bioacoustics, 2025, 34 (4), pp.419-446. ⟨10.1080/09524622.2025.2500380⟩. ⟨hal-05265455⟩
  • Joonas Kalda, Séverin Baroudi, Martin Lebourdais, Clément Pagés, Ricard Marxer, et al.. Design Choices for PixIT-based Speaker-Attributed ASR: Team ToTaTo at the NOTSOFAR-1 Challenge. Computer Speech and Language, 2025, 95, pp.101824. ⟨10.1016/j.csl.2025.101824⟩. ⟨hal-05084070⟩
  • Md Ether Deowan, Md Shamin Yeasher Yousha, Tihan Mahmud Hossain, Shahriar Hassan, Ricard Marxer. Optimizing Underwater Robot Navigation: A Study of DRL Algorithms and Multi-Modal Sensor Fusion. IEEE International Conference on Robotics & Automation (ICRA), May 2025, Atlanta, GA, United States. ⟨hal-05004039⟩
  • Santiago Cuervo, Ricard Marxer. Scaling Properties of Speech Language Models. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024, Miami, United States. pp.351-361, ⟨10.18653/v1/2024.emnlp-main.21⟩. ⟨hal-04832692⟩
  • Arik Kershenbaum, Çağlar Akçay, Lakshmi Babu-Saheer, Alex Barnhill, Paul Best, et al.. Automatic detection for bioacoustic research: a practical guide from and for biologists and computer scientists. Biological Reviews, 2024, ⟨10.1111/brv.13155⟩. ⟨hal-04741895⟩
  • Paul Best, Santiago Cuervo, Ricard Marxer. Transfer Learning from Whisper for Microscopic Intelligibility Prediction. Interspeech 2024, Sep 2024, Kos, Greece. pp.3839-3843, ⟨10.21437/Interspeech.2024-2258⟩. ⟨hal-04683361⟩
  • Santiago Cuervo, Ricard Marxer. Speech Foundation Models on Intelligibility Prediction for Hearing-Impaired Listeners. ICASSP 2024 – 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2024, Seoul, South Korea. pp.1421-1425, ⟨10.1109/ICASSP48485.2024.10447907⟩. ⟨hal-04592508⟩
  • Santiago Cuervo, Ricard Marxer. On the Benefits of Self-supervised Learned Speech Representations for Predicting Human Phonetic Misperceptions. INTERSPEECH 2023, Aug 2023, Dublin, Ireland. pp.1788-1792, ⟨10.21437/Interspeech.2023-1476⟩. ⟨hal-04194225⟩