How to get log mel spectrogram of specific shape using librosa

Question

I have some audio files which I want to convert to log mel spectogram. I need the log mel spectogram to be in the shape of (512,512). I changed the n_mels to 512, to get the first dimension 512 but I am unable to change the second dimension to 512 for all audios. I tried experimenting with hop_length values by trial and error, in some audio files it work and in the others it doesn't.How do we get log mel spectrogram of specific shape using librosa?

path = "path/to/my/file"
scale, sr = librosa.load(path)
mel_spectrogram = librosa.feature.melspectrogram(scale, sr, n_fft=2048, hop_length=512, n_mels=512, fmax=8000)
log_mel_spectrogram = librosa.power_to_db(mel_spectrogram)
librosa.display.specshow(log_mel_spectrogram, x_axis="time", y_axis="mel", sr=sr) ```

How to get log mel spectrogram of specific shape using librosa

Answers (1)

Related Questions