articleApr 21, 2026Closed access

Overview of the ICASSP 2026 Cadenza Challenge: Predicting Lyric Intelligibility

University of Sheffield · University of Salford · +2 more institutions

Indexed incrossref

Abstract

We present the first open challenge on predicting lyric intelligibility. A new dataset, CLIP1, was introduced, comprising audio samples of popular western music paired with listener intelligibility scores. To model diverse listening profiles, samples were processed with no, mild and moderate simulated hearing loss. A total of 27 systems were submitted by 22 teams. Most systems used foundation models to extract encoder embeddings as high-level acoustic representations, often complemented by signal features and perceptual metrics. Twenty-five systems outperformed the STOI baseline, and 16 outperformed a Whisper-based baseline.

No related works found for this paper.

Funding