Thanks James for your reply. I'm attempting to train a semi-continuous
small vocabulary model using SphinxTrain-0.9.1-beta on x86. I am following
the document at
http://www-2.cs.cmu.edu/ ~rsingh/sphinxman/s3manual.html
which has links to "Instruction set for training" and "Troubleshooting:
tools and logfiles".
I use /usr/bin/record to record wav file at 16000Hz and use wave2feat to
extract MFCCs. I think the feature files are correct because cepview shows
the vectors having values in acceptable ranges according to the
trobleshooting doc. ie. the first component typically starts off about 12 and then
continues int the range 5 - 10. The other components are in the range -1
to +1. However I was just wanting to double check. For wave2feat you have
to specify the audio file as either -nist or -raw. I'm saying -raw.
Training data comprises 189 utterances or 43M of wav files resulting in
133087 frames.
I have previously gone through the entire training sequence once (CI,
CD-untied, CD-tied training and conversion to Sphinx2 format). But I don't
get any recognition when using sphinx2-continuous, so I'm trying again.
One problem was that I wasn't getting any indication of convergence so I
ran the bw/norm step 7 times for each section of training.
My main question is regarding the 8 "empty cluster" messages at
kmeans_init. The troubleshooting log suggests that this can happen if the
feature files are very small, byte swapped or contain garbage. This is why
I'm asking if I'm doing wave2feat extraction correctly. And are the "never
observed" messages at normalization a consequence of the "empty clusters".
As far as calculating convergence ratios, I was thinking of writing my own
C program to do that from subsequent iterations of means/vars/mixw/tmat
files. What exactly do I have to calculate?
But this must have been done already.
The specs for my microphone are: Sensitivity -54 +/-3dB; Output impedance
2.2K ohms; Frequency response 20Hz ~ 20KHz; Operating voltage 1.5V ~ 10V;
Sensitivity reduction within -3dB at 1V.
Thanks, Jack.