Mixing languages in the same SSML

Question

If I send this small piece of SSML to the speech processor I get two voices


  
    
        Hola 
        Hello
        ¿Cómo estas?.

A man in Spanish and a woman in English. Is this a limitation of the Project Oxford Text to Speech engine? in other words, I would expect the same voice to speak several languages but it looks like this is not the case.

cthrash · Accepted Answer

To quote the SSML spec,

Specifying xml:lang does not imply a change in voice, though this may indeed occur. When a given voice is unable to speak content in the indicated language, a new voice may be selected by the processor.

While the current fallback behavior leaves something to desire, the recommendation is to create multiple voice nodes and pick a voice more explicitly when switching languages.

Mixing languages in the same SSML

Answers (1)

Related Questions