I just ran across the following reference
"Acoustical Klein-Gordon Equation: A Time-Independent Perturbation
Analysis," Physical Review Letters, July 30, 2004
which pupports to model vowels with high fidelity using a smallish
number of parameters.
Could someone more knowledgable on TTS than myself comment as to the
applicability of this approach to delivering high quality TTS with a
smaller memory footprint?