Efficient and Robust Speaker Diarization via Structured Pruning of Self-Supervised Models
Brno University of Technology · NTT (Japan) · +1 more institution
Abstract
This work presents a framework for compressing self-supervised models for speaker diarization through structured pruning guided by knowledge distillation. We investigate pruning objectives that target both model parameters and computational complexity, and analyze alternative strategies, showing that a simple overall pruning approach provides the best balance between efficiency and accuracy. Our method achieves up to 80% model size reduction and 4x faster inference without performance degradation. Comprehensive experiments across eight public diarization datasets demonstrate that the pruned models consistently match or surpass the performance of their uncompressed counterparts. Furthermore, we show strong…
Citation impact
- FWCI
- 176.93
- Percentile
- 100%
- References
- 0
Authors
7Topics & keywords
- Pruning
- Inference
- Generalization
- Reduction (mathematics)
- Speaker diarisation
- Domain (mathematical analysis)
- Speech processing