Google Speech API Single Utterance

Question

How does Google Speech API's SingleUtterance work? According to the docs, it is Google's way of determining when a speaker has spoken a single utterance. I understand what it does, but I would like to know how? Does the API simply wait for a certain duration of "speechless" audio? If so, how long a duration of voiceless audio will trigger the end of an utterance?

Does it have some other sort of AI algorithm that helps determine when someone has stopped speaking?

Thanks

Google Speech API Single Utterance

Answers (1)

Related Questions