Reputation: 109
I don't have so much idea about audio detection and I just started learning it today and came across webrtcvad(this felt poorly documented :'( ) and LibROSA. The task I have to do is that given an audio file, (which can be empty or it can have noise but no speech), I have to detect whether it contains any speech. Any idea how I can give it a start? Any help would be appreciated. Thanks in advance.
Upvotes: 2
Views: 6237
Reputation: 1020
The speechmetrics
package provides two measures of absolute speech quality, the MOSNet and the SRMR. You may pass your audio excerpt to those packages, check the returned qualities for silence/noise/speech and set thresholds accordingly.
Upvotes: 0
Reputation: 1275
Sounds like a generic question. There are several possible solutions though:
EDIT: Here are some libraries for processing audio
Upvotes: 1
Reputation: 193
Could this be done through frequency analysis? If so, depending on if you're using a mp3 or wav, these are the 2 options I'm aware of:
Upvotes: 0