Win Coder
Win Coder

Reputation: 6766

Extending Sphinx to support a local language

Initially i was thinking of writing a speech recognition engine from scratch(with support of 50-100 words)to support my native language.

However after some research it has become clear that it is not possible to make a speech recognition engine even with very limited support in 1.5 years.

Now i was thinking of extending the sphinx engine to support my language. Is it possible to do in 1.5 years or even this is too much work for a final year project.

I am anxiously waiting for your experiences with regards to this matter.

Upvotes: 3

Views: 1126

Answers (2)

vjaivox
vjaivox

Reputation: 41

Yes, it is certainly possible. I made a similar recognizer for Chatino for my senior thesis. (Chatino is an indigenous language from Oaxaca in southern Mexico). The recognizer includes both an isolated word recognizer and a continuous speech recognizer built using Sphinx4.

See http://www.jaimalayalam.com/papers/chatinoVoiceRecognition09.pdf for details.

Upvotes: 2

Luis Uebel
Luis Uebel

Reputation: 44

Time to make a speech recognition will depend of your application.

You will need:

  1. Define what are the words that you want to recognize;
  2. Write a phonetic dictionary for these words;
  3. Record words with several native speakers;
  4. Validate recorded data;
  5. Prepare data to train acoustic models;
  6. Produce grammar or language model (in this case it is necessary to record phonetic balanced words / sentences);
  7. Train acoustic models;
  8. Test your system;
  9. Make adjustments and tuning for grammar and acoustic models (speaker adaptation);
  10. Learn how to make all 9 topics above. :)

Item 10 is the most time consuming task!!!

Answer: Yes, it is possible to make in 3 months for a commercial application.

Sphinx is one possibility, HTK is an excellent open source speech recognition system to train and test a complete system. Julius is an open source speech recognizer (engine) that uses acoustic and language models built with HTK.

Luis Uebel

ASR Labs - www.asrlabs.com.br

Upvotes: 2

Related Questions