NedStarkOfWinterfell
NedStarkOfWinterfell

Reputation: 5203

Google Speech API doesn't give correct result when audio is sent in file

I chanced upon the article at Google Speech API which suggested a mechanism for extracting text from audio file through Perl. Now I have recorded a audio file, which you will find at http://vocaroo.com/i/s0lPN5d3YQJj. It is a simple piece of audio, reading I love you. When I go to the Google speech API in Chrome, and speak those words, I get the right result. When I try the code at the above mentioned link with the audio file I pointed out, it returns strange results, like logan. How can I make it more accurate? This is just a sample audio, what I am generally doing is extracting the audio from a video file through FFMpeg using something like ffmpeg -i input.avi -vn -ar 44100 -ac 2 -ab 192 -f mp3 output.mp3, followed by ffmpeg -i input.mp3 output.flac.

Upvotes: 1

Views: 484

Answers (1)

Borodin
Borodin

Reputation: 126742

Have you tried playing the audio files you are creating?

You are setting an audio bitrate of 192 bits/second which is ridiculously low.

For 192Kbps you need -ab 196608.

Upvotes: 2

Related Questions