Leonardo Kenji Shikida
Leonardo Kenji Shikida

Reputation: 751

Using different intonations with Watson text to speech

I am developing a PoC using Watson text to speech and Watson conversation.

Sometimes, the chatbot needs to ask a question, so I'd like text to speech to synthesize the voice using an interrogation intonation.

Is it possible to be done?

Upvotes: 3

Views: 1014

Answers (3)

Radek Kazbunda
Radek Kazbunda

Reputation: 11

Unfortunately Concatenative TTS may have problems to create correct intonation in questions. If you think it happens consistently or too often please open a bug. If you have a specific question which gets incorrect intonation try to rephrase it a little bit if possible. A useful trick for this voice could be to use double question mark '??'

Upvotes: 1

Simon O'Doherty
Simon O'Doherty

Reputation: 9359

Watson Text to Speech supports SSML, and has expressive SSML tags.

The one you want to use is Uncertainty. As it is defined as "conveys an uncertain, interrogative message".

Example:

<express-as type="Uncertainty">
  Could she still be in the office? She told me that she might leave early.
</express-as>

More details on it's usage is here:

https://console.bluemix.net/docs/services/text-to-speech/SSML-expressive.html#the-express-as-element

Upvotes: 3

data_henrik
data_henrik

Reputation: 17118

Yes, you can certainly use text-to-speech (TTS) for output and speech-to-text (STT) for input. You would need to use a middleware or app layer to drive the conversation and route the input/output to the other services (see "how to use" in the docs).

I have used the following TJBot recipe as a simple and good started for some projects: https://github.com/damiancummins/tell_the_time

Upvotes: 1

Related Questions