Joan Serrà – Sony AI

Joan
Serrà

Google Scholar

Profile

Joan did an MSc and PhD in machine learning for audio at the Music Technology Group of Universitat Pompeu Fabra (2006-2011) and a postdoc in artificial intelligence at IIIA-CSIC (2011-2015). After that, he joined Telefónica R&D as a machine learning researcher (2015-2019) and Dolby Laboratories as an AI researcher and research manager (2019-2024). He is currently with Sony AI, where he performs research on machine learning, focusing on audio and multimedia analysis, synthesis, and retrieval.

Publications

A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?

Interspeech, 2025
Yigitcan Özer, Woosung Choi, Joan Serrà, Mayank Kumar Singh*, Wei-Hsiang Liao, Yuki Mitsufuji

We introduce the Robust Audio Watermarking Benchmark (RAW-Bench), a benchmark for evaluating deep learning-based audio watermarking methods with standardized and systematic comparisons. To simulate real-world usage, we introduce a comprehensive audio attack pipeline with var…

Supervised Contrastive Learning from Weakly-labeled Audio Segments for Musical Version Matching

ICML, 2025
Joan Serrà, R. Oguz Araz, Dmitry Bogdanov, Yuki Mitsufuji

Detecting musical versions (different renditions of the same piece) is a challenging task with important applications. Because of the ground truth nature, existing approaches match musical versions at the track level (e.g., whole song). However, most applications require to …

HOME
People
Joan Serrà

JOIN US

Shape the Future of AI with Sony AI

We want to hear from those of you who have a strong desire
to shape the future of AI.

LEARN MORE