Speech Research >> Acoustical modeling of vocal tract

by aolney » Thu, 14 Oct 2004 06:42:01 GMT

I just ran across the following reference

"Acoustical Klein-Gordon Equation: A Time-Independent Perturbation
Analysis," Physical Review Letters, July 30, 2004

which pupports to model vowels with high fidelity using a smallish
number of parameters.

Could someone more knowledgable on TTS than myself comment as to the
applicability of this approach to delivering high quality TTS with a
smaller memory footprint?

Speech Research >> Acoustical modeling of vocal tract

by Olivier Galibert » Fri, 15 Oct 2004 05:49:57 GMT



Facts of life in TTS synthesis:
- vowels are easy-ish
- consonants are harder
- transitions are the real pain
- a large part of the intelligibility is in the transitions

OG.

Speech Research >> Acoustical modeling of vocal tract

by James Salsman » Fri, 15 Oct 2004 06:40:02 GMT


Yes, which is the reason why the best concatinative synthesizers are
based on diphones or larger units instead of just phonemes.

Sincerely,
James Salsman
--
www.readsay.com - maker of the ReadSay PROnounce English literacy system
400 MHz PDA included: $499 -- http://www.readsay.com/PROnounce.html