Mehdi Souregi
Mehdi Souregi

Reputation: 3265

HTK: Understanding scores in resulting .mlf file

I am trying to understand the file result recout.mlf, so I have the following lines in that file:

enter image description here

Which of 'as' was well prononced: the one with -524.427185 or -1054.774536

Upvotes: 0

Views: 221

Answers (1)

Dmytro Prylipko
Dmytro Prylipko

Reputation: 5084

The acoustic scores obtained during decoding are usually very tiny. To prevent underflow, log likelihoods are used instead of likelihoods: 1.5 Recognition and Viterbi Decoding.

Smaller argument values correspond to larger negative values of logarithms:

Natural log

Thus, the first 'as' obtained a higher (-524.427185) acoustic score. Logarithm is a monotonic function (the larger is argument - the larger is the value), so you can compare the log-likelihoods directly: -524 > -1054.

BTW, it does not necessarily mean the first 'as' was better pronounced. The acoustic score depends on many factors, including model topology and the data the model was trained on.

Upvotes: 1

Related Questions