The Natural Statistics of Audiovisual Speech
Princeton University · Grenoble Images Parole Signal Automatique
Abstract
Humans, like other animals, are exposed to a continuous stream of signals, which are dynamic, multimodal, extended, and time varying in nature. This complex input space must be transduced and sampled by our sensory systems and transmitted to the brain where it can guide the selection of appropriate actions. To simplify this process, it's been suggested that the brain exploits statistical regularities in the stimulus space. Tests of this idea have largely been confined to unimodal signals and natural scenes. One important class of multisensory signals for which a quantitative input space characterization is unavailable is human speech. We do not understand what signals our brain has to actively piece together…
Citation impact
- FWCI
- 15.02
- Percentile
- 100%
- References
- 91
Authors
5Topics & keywords
- Percept
- Computer science
- Speech recognition
- Natural sounds
- Stimulus (psychology)
- Context (archaeology)
- Envelope (radar)
- Natural (archaeology)