Amin
Amin

Reputation: 31

How can i pad wav file to specific length?

I am using wave files for making deep learning model they are in different length , so i want to pad all of them to 16 sec length using python

Upvotes: 1

Views: 6261

Answers (3)

Maria
Maria

Reputation: 309

You can use Librosa. The Librosa.util.fix_length function adds silent patch to audio file by appending zeros to the end the numpy array containing the audio data:

from librosa import load
from librosa.util import fix_length


file_path = 'dir/audio.wav'

sf = 44100 # sampling frequency of wav file
required_audio_size = 5 # audio of size 2 second needs to be padded to 5 seconds
audio, sf = load(file_path, sr=sf, mono=True) # mono=True converts stereo audio to mono
padded_audio = fix_length(audio, size=5*sf) # array size is required_audio_size*sampling frequency


print('Array length before padding', np.shape(audio))
print('Audio length before padding in seconds', (np.shape(audio)[0]/fs))
print('Array length after padding', np.shape(padded_audio))
print('Audio length after padding in seconds', (np.shape(padded_audio)[0]/fs))

Output:

Array length before padding (88200,)
Audio length before padding in seconds 2.0
Array length after padding (220500,)
Audio length after padding in seconds 5.0

Although after looking through a number of similar questions, it seems like pydub.AudioSegment is the go to solution.

Upvotes: 0

J Agustin Barrachina
J Agustin Barrachina

Reputation: 4090

If I understood correctly, the question wants to fix all lengths to a given length. Therefore, the solution will be slightly different:

from pydub import AudioSegment

pad_ms = 1000  # Add here the fix length you want (in milliseconds)
audio = AudioSegment.from_wav('you-wav-file.wav')
assert pad_ms > len(audio), "Audio was longer that 1 second. Path: " + str(full_path)
silence = AudioSegment.silent(duration=pad_ms-len(audio)+1)


padded = audio + silence  # Adding silence after the audio
padded.export('padded-file.wav', format='wav')

This answer differs from this one in the sense that this one creates all audios from the same length where the other adds the same size of silence at the end.

Upvotes: 3

Vikrant Sharma
Vikrant Sharma

Reputation: 419

Using pydub:

from pydub import AudioSegment

pad_ms = 1000  # milliseconds of silence needed
silence = AudioSegment.silent(duration=pad_ms)
audio = AudioSegment.from_wav('you-wav-file.wav')

padded = audio + silence  # Adding silence after the audio
padded.export('padded-file.wav', format='wav')

AudioSegment objects are immutable

Upvotes: 1

Related Questions