Efficient and Robust Speaker Diarization via Structured Pruning of Self-Supervised Models

Han, Jiangyu; Pálka, Petr; Delcroix, Marc; Landini, Federico; Rohdin, Johan; Černocký, Jan; Burget, Lukáš

doi:10.1109/taslpro.2026.3675801

articleIEEE Transactions on Audio Speech and Language ProcessingJan 1, 2026Closed access

Efficient and Robust Speaker Diarization via Structured Pruning of Self-Supervised Models

JHJiangyu Han PPPetr Pálka MDMarc Delcroix FLFederico Landini JRJohan Rohdin

Brno University of Technology · NTT (Japan) · +1 more institution

Indexed incrossref

Abstract

This work presents a framework for compressing self-supervised models for speaker diarization through structured pruning guided by knowledge distillation. We investigate pruning objectives that target both model parameters and computational complexity, and analyze alternative strategies, showing that a simple overall pruning approach provides the best balance between efficiency and accuracy. Our method achieves up to 80% model size reduction and 4x faster inference without performance degradation. Comprehensive experiments across eight public diarization datasets demonstrate that the pruned models consistently match or surpass the performance of their uncompressed counterparts. Furthermore, we show strong…

Citation impact

5

total citations

FWCI: 176.93
Percentile: 100%
References: 0

Too recent for citation history.

Authors

7

Topics & keywords

Topics

Keywords

Pruning
Inference
Generalization
Reduction (mathematics)
Speaker diarisation
Domain (mathematical analysis)
Speech processing

No related works found for this paper.