Sushil Kumar
Sushil Kumar

Reputation: 11

How to feed speech files into RNN/LSTM for speech recognition?

I am working on RNN/LSTMs. I have done a simple project with RNN in which i input text into RNNs. But i don't know how to input speech into RNNs and how to preprocess speeches for recurrent networks. I have read many articles from medium and other sites. But i am not able to use speech in the networks. You can share any project in which speech and RNN/LSTMs or anything that can help me.

Upvotes: 1

Views: 2132

Answers (1)

xashru
xashru

Reputation: 3590

You will need to convert raw audio signal into spectrogram or some other convenient format that is easier to process using RNN/LSTMS. This medium blog should be helpful. You can look at this github repo for implementation.

Upvotes: 1

Related Questions