Extending Sphinx to support a local language

Initially i was thinking of writing a speech recognition engine from scratch(with support of 50-100 words)to support my native language.

However after some research it has become clear that it is not possible to make a speech recognition engine even with very limited support in 1.5 years.

Now i was thinking of extending the sphinx engine to support my language. Is it possible to do in 1.5 years or even this is too much work for a final year project.

I am anxiously waiting for your experiences with regards to this matter.

Upvotes: 3

Answers (2)

vjaivox

Reputation: 41

Yes, it is certainly possible. I made a similar recognizer for Chatino for my senior thesis. (Chatino is an indigenous language from Oaxaca in southern Mexico). The recognizer includes both an isolated word recognizer and a continuous speech recognizer built using Sphinx4.

See http://www.jaimalayalam.com/papers/chatinoVoiceRecognition09.pdf for details.

Upvotes: 2

Luis Uebel

Reputation: 44

Time to make a speech recognition will depend of your application.

You will need:

Define what are the words that you want to recognize;
Write a phonetic dictionary for these words;
Record words with several native speakers;
Validate recorded data;
Prepare data to train acoustic models;
Produce grammar or language model (in this case it is necessary to record phonetic balanced words / sentences);
Train acoustic models;
Test your system;
Make adjustments and tuning for grammar and acoustic models (speaker adaptation);
Learn how to make all 9 topics above. :)

Item 10 is the most time consuming task!!!

Answer: Yes, it is possible to make in 3 months for a commercial application.

Sphinx is one possibility, HTK is an excellent open source speech recognition system to train and test a complete system. Julius is an open source speech recognizer (engine) that uses acoustic and language models built with HTK.

Luis Uebel

ASR Labs - www.asrlabs.com.br

Upvotes: 2

Extending Sphinx to support a local language

Answers (2)

Related Questions