Overview of the ICASSP 2026 Cadenza Challenge: Predicting Lyric Intelligibility

Roa-Dabike, Gerardo; Barker, Jon P.; Cox, Trevor J.; Akeroyd, Michael A.; Bannister, Scott; Fazenda, Bruno; Firth, Jennifer; Graetzer, Simone; Greasley, Alinka; Vos, Rebecca R.; Whitmer, William M.

doi:10.1109/icassp55912.2026.11463231

articleApr 21, 2026Closed access

Overview of the ICASSP 2026 Cadenza Challenge: Predicting Lyric Intelligibility

GRGerardo Roa-Dabike JPJon P. Barker TJTrevor J. Cox MAMichael A. Akeroyd SBScott Bannister

University of Sheffield · University of Salford · +2 more institutions

Indexed incrossref

Abstract

We present the first open challenge on predicting lyric intelligibility. A new dataset, CLIP1, was introduced, comprising audio samples of popular western music paired with listener intelligibility scores. To model diverse listening profiles, samples were processed with no, mild and moderate simulated hearing loss. A total of 27 systems were submitted by 22 teams. Most systems used foundation models to extract encoder embeddings as high-level acoustic representations, often complemented by signal features and perceptual metrics. Twenty-five systems outperformed the STOI baseline, and 16 outperformed a Whisper-based baseline.

Citation impact

5

total citations

FWCI: 30.77
Percentile: 100%
References: 14

Too recent for citation history.

Authors

11

Topics & keywords

Topics

Keywords

Intelligibility (philosophy)
Acoustic phonetics
Phonetics
Prosody

No related works found for this paper.

Funding

EA
Engineering and Physical Sciences Research Council