Unit selection in a concatenative speech synthesis system using a large speech database

Hunt, Andrew J.; Black, Alan W.

doi:10.1109/icassp.1996.541110

articleDec 24, 2002Closed access

Unit selection in a concatenative speech synthesis system using a large speech database

AJAndrew J. Hunt AWAlan W. Black

Advanced Telecommunications Research Institute International

Indexed incrossref

Abstract

One approach to the generation of natural-sounding synthesized speech waveforms is to select and concatenate units from a large speech database. Units (in the current work, phonemes) are selected to produce a natural realisation of a target phoneme sequence predicted from text which is annotated with prosodic and phonetic context information. We propose that the units in a synthesis database can be considered as a state transition network in which the state occupancy cost is the distance between a database unit and a target, and the transition cost is an estimate of the quality of concatenation of two consecutive units. This framework has many similarities to HMM-based speech recognition. A pruned Viterbi…

Citation impact

1,192

total citations

FWCI: 55.43
Percentile: 100%
References: 5

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Selection (genetic algorithm)
Speech synthesis
Speech recognition
Natural language processing
Artificial intelligence

No related works found for this paper.