Reputation: 2147
A. What I am trying to implement.
A web application allowing real-time speech recognition inside web browser (like this).
B. Technologies I am currently thinking of using to achieve A.
C. Very basic workflow
D. Questions
Upvotes: 12
Views: 17577
Reputation: 6982
You should contact Andre Natal, who has shown demos similar to this at last fall's Firefox Summit, and is now on a Google Summer of Code project implementing offline speech recognition in Firefox/FxOS: http://cmusphinx.sourceforge.net/2014/04/speech-projects-on-gsoc-2014/
Upvotes: 6
Reputation: 25220
Would Node.js be suitable to achieve C?
Yes, though there are no hard requirements for that. Some people are running servers with gstreamer, for example check
http://kaljurand.github.io/dictate.js/
node should be fine too.
How could I pass received audio streams from my Node server to a speech recognition engine running separately from the server?
There are many ways for node-to-node communication. One of them is http://socket.io. There are also plain sockets. The particular framework depends on your requirements for fault-tolerance and scalability.
Could my speech recognition engine be running as another Node application (if I use Pocketsphinx)? So my Node server communicates to my Node speech recognition server.
Yes, sure. You can create a node module to warp pocketsphinx API.
UPDATE: check this, it should be similar to what you need:
http://github.com/cmusphinx/node-pocketsphinx
Upvotes: 9