Reputation: 1
Does anyone know why the WER does not decay? I'm trying to finetune the OpenAI Whisper medium model for low resource language.
I'm using the following parameters:
per_device_train_batch_size="32"
per_device_eval_batch_size="16"
learning_rate="1e-5"
Upvotes: 0
Views: 137