Reputation: 1
How would I easily convert English audio files to IPA (phonetic alphabet) with time stamps on Windows?
Everything I find is way out of date. Even similar questions here on stack are out of date. Most stuff doesn't even work anymore, like Python's sphinx.
Or worst of all requires dependencies which would be out of reach for most people or otherwise be for the entirely wrong operating system.
I want to get from my audio files, typically wavs, the phonemes associated with each sound and a time stamp for those spoken phonemes. I am doing this to make it easier for me to model and rig 3d actors.
An example would be the word "Hospital" becoming
phonetic_dict = {
"0.000": "/h/",
"0.001": "/ɒ/",
"0.002": "/s/",
"0.003": "/p/",
"0.004": "/ɪ/",
"0.005": "/t/",
"0.006": "/ə/",
"0.007": "/l/"
}
Of course this is just an example and actual IPA would typically have these phonemes less separated as individual letters, as the sounds matter, the text does not.
I'd prefer something with a GUI that's relatively simple, easy to use and somewhat streamline.
I have looked into Gentle, CMUSphinx, some other stuff on github, even Simon... etc. most of this is either simply outdated, no longer supported, or straight up no longer functions in any capacity or has been yeeted off the face of the earth.
Upvotes: 0
Views: 479