Binary classification of audio .wav files

Question

Hey I’m total Layman in case od audio processing so my question will be very basic. I have audio from 2 groups X and Y with .wav audio samples and I need to make model which will correctly classify is the sound X or Y. I founded how to load data into list, than I converted it to Dataframe I have 2 columns(in second one there is 8000 elements in each row).

       0    1
0   2000    [0.1329449, 0.14544961, 0.19810106, 0.21718721...
1   2000    [-0.30273795, -0.6065889, -0.4967722, -0.47117...
2   2000    [-0.07037315, -0.6685449, -0.48479277, -0.4535...

I founded those useful features from python_speech_features module so far:

 rate,signal = sw.read(i)
    features = psf.base.mfcc(signal)
    features = psf.base.fbank(features)
    features = psf.base.logfbank(features[1])
    features = psf.base.lifter(features,L=22)
    features = psf.base.delta(features,N=13)
    features = pd.DataFrame(features)

What kind of other features should I extract from audio files?
What is worth to visualize here to unveil some patterns? eg. can I visulize some feature who can show difference between A and B?
What is the best way to make this classification, is it better to do them with NN or traditional models will satisfy?

I will appreciate all kind of help Additional resources for self-learning will be highly welcome as well.

Binary classification of audio .wav files

Answers (1)

Related Questions