Deep Scattering Spectrum
Centre de Mathématiques Appliquées · École Polytechnique · +3 more institutions
Indexed inarxivcrossref
Abstract
A scattering transform defines a locally translation invariant representation which is stable to time-warping deformation. It extends MFCC representations by computing modulation spectrum coefficients of multiple orders, through cascades of wavelet convolutions and modulus operators. Second-order scattering coefficients characterize transient phenomena such as attacks and amplitude modulation. A frequency transposition invariant representation is obtained by applying a scattering transform along log-frequency. State-the-of-art classification results are obtained for musical genre and phone classification on GTZAN and TIMIT databases, respectively.
Citation impact
651
total citations
- FWCI
- 24.22
- Percentile
- 100%
- References
- 64
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Wavelet transform
- Scattering
- Mathematics
- Invariant (physics)
- Wavelet
- Mel-frequency cepstrum
- Speech recognition
- Computer science
No related works found for this paper.