Google speech API v1beta1 (syncrecognize and asyncrecognize API call)

Question

I am a Java developer and I have couple of questions related to Google speech API V1Beta1.

Question1 (Syncrecognize case):

I tried to upload (through GCS) small size (less than one min running file) audio file to google speech api it is working But the confidence output level is 0.32497215 only. That is my result is not exactly same to my audio input.

How to increase the confidence level output?

Question 2 (Asyncrecognize case):

I tried big size audio file (more than one min running file). This case I used the API call:

https://speech.googleapis.com/v1beta1/speech:asyncrecognize?key=XXXXXXXXXXXXXXXXXXXX

and Payload:

"{"config":{"encoding":"LINEAR16","sample_rate": 16000},"audio":{"uri":"gs://" + bucketName +"/"+ objectName + ""}}"

Here I got the output json like

{"name": "57...........................95"}.

After getting this output I make new API call (Operation interface) with this name value.

https://speech.googleapis.com/v1beta1/operations/57.................................95?key=XXXXXXXXXXXXXXXXX

I got the output

{
 "name": "57....................................95",
 "done": true,
 "response": {
   "@type": "type.googleapis.com/google.cloud.speech.v1beta1.AsyncRecognizeResponse"
 }
}

How to proceed the work with this value? I need to get audio speech text.

Please help me to fix this issues. Thanks in advance.

Google speech API v1beta1 (syncrecognize and asyncrecognize API call)

Question1 (Syncrecognize case):

Question 2 (Asyncrecognize case):

Answers (1)

Related Questions