Google cloud speech to text grammar to narrow results to a number?

Question

I very simply want to pass is a tiny audio clip (8Khz telephony) containing a single digit number, and get back a single digit number as text, narrowed down to a number.

File in > number as text out. Preferably via the python command line API.

The problem is, by default, it recognises things like 1,2,3,4,5 as won,too,free,fore,5 ... no good!

I believe I want what is called a grammar? Or something like Amazon's number slot types it uses in Alexa? I've looked over the cloud speech docs and can't find it. The only thing I could think of is looping over the alternatives given and see if any match an int rather than a word. And if none do, then what?

Thanks.

A.Queue · Accepted Answer

Try adding speechContexts. You can then add a few phrases that you think are most probable.

Google cloud speech to text grammar to narrow results to a number?

Answers (2)

Related Questions