Reputation: 1
I was wondering if anyone knows of a python tool that finds phonemes from a text, as well as their duration.
In short, I want a forced alignment tool like aeneas, but I want the phonemes and their duration.
Thank you!
Upvotes: -4
Views: 310
Reputation: 31
You didn't specify what kind of data you have, but I assume it's audio files with their corresponding orthographic transcriptions.
In that case, the Montreal Forced Aligner might be suitable (there is link to the executable on that page). It is based on Kaldi, so for more robust and comprehensive solution, the kaldi-dnn-ali-gop repo provides more powerful options.
Upvotes: 1