Reputation: 10828
What is the solution to detect specific words from audio file?
I have a lot of audio file (same codec) and each file is only about 15 seconds long. (Note: all audio files are the same person / same accent)
For example:
test1.mp3 play Hello Tom, what are you doing today?
test2.mp3 play Hello Paul, what are you doing today?
test3.mp3 play Good morning John - It is lovely weather today
I need a way to detect lovely weather or what are you words from each audio file.
I may have 100 audio files say "what are you doing today" and other files say "what are you doing today?" - I just need to know what what the status/type of each file are..
What is the solution to check frequency bits exist rather than using Voice Recognition tool.
Upvotes: 2
Views: 1820
Reputation: 35663
You are essentially asking "How can I do general purpose speech recognition"?
The solutions are:
If your platform provides speech-recognition out of the box, use that. Microsoft Windows does, for example. http://msdn.microsoft.com/en-us/library/hh323805.aspx
If your platform does not, then you need to integrate a third party speech recognition package, such as Lernaut & Hauspie (now Nuance), Dragon, etc. This will likely involve paying money.
Edit: I have flagged this as a duplicate of Text-to-speech (voice generation) and speech-to-text (voice recognition) APIs?, which has a comprehensive answer to "how can I do speech recognition".
Upvotes: 2