Microsoft Web-Chat + SpeechSynthesizer (for text highlight or visemes)

Question

I'm trying to implement features to a bot using Web-Chat with Speech and realtime text highlighting (or visemes).

On one hand, i have that working sample which is able to select voice, synthesize text and has some events (synthesisStarted, wordBoundary, visemeReceived, etc.) : https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/js/browser/synthesis.html

and on the other hand, i have that second working sample : https://microsoft.github.io/BotFramework-WebChat/03.speech/b.cognitive-speech-services-js/

the problem i encounter is that the second sample, event if it relies also on cognitive speech services, seems to use the speech capabilities more on a "server side" than on a "client side" (for example, I had to modify the server side code to generate SSML with a "voice" markup to change the voice, I haven't found a way to do it on client side !)

expressed differently : with Web-chat, is there a way to manage speech synthesis from client side with the same "SpeechSDK.SpeechSynthesizer" class and its existing javascript events like wordBoundary or the new visemeReceived ?

JS

Additions : I also confirm that if you modify that sample (03.speech/e.select-voice) and remove/comment the lines

  selectVoice: (voices, activity) =>              
                  activity.locale === 'zh-HK'
                    ? voices.find(({ name }) => /TracyRUS/iu.test(name))
                    : voices.find(({ name }) => /JessaNeural/iu.test(name)) ||
                      voices.find(({ name }) => /Jessa/iu.test(name)),

the sample still work and you hear english and japanese ! (because the voices are coded in SSML received from the bot)

Microsoft Web-Chat + SpeechSynthesizer (for text highlight or visemes)

Answers (1)

Related Questions