JS AzureSDK create custom function to capture speech, display the text results and a confidence level of the results

Question

I need to create a simple javascript function to capture the input, then return the text with a confidence percentage using AzureSDK.

My biggest problem is that I am new to coding and this is the most difficult issue I faced, so please be kind to this humble student.

I am building a language learning webapp using voice input. I have been able to get the google services to work the way I wanted but, unfortunately, those services don't work in China where my market is. I am also using Phaser 3 api to build this app.

I have been able to get the sample code provided on git for the AzureSDK speech to text javascript to work but when I try to create my own function with the code I get: Uncaught TypeError: Cannot read property 'SpeechConfig' of undefined

I also do not know how to add a confidence level to the speech result.

recordButton.on('pointerdown', function() {
        var SDK =  window.SpeechSDK;
        try {
        AudioContext = window.AudioContext // our preferred impl
            || window.webkitAudioContext   // fallback, mostly for Safari
            || false;                      // could not find.
        if (AudioContext) {
            soundContext = new AudioContext();
          console.log("AudioContext", AudioContext);
        } else {
            alert("Audio context not supported");
        }
        }
        catch (e) {
         console.log("no sound context found, no audio output. " + e);
        }

        console.log("SpeechSDK initialized", SDK);

        speechConfig = 
          SpeechSDK.SpeechConfig.fromSubscription(subscriptionKey, 
           serviceRegion);

        speechConfig.speechRecognitionLanguage = "en-US";
        console.log("speechConfig", SpeechConfig);      

        audioConfig  = SpeechSDK.AudioConfig.fromDefaultMicrophoneInput();
        recognizer = new SpeechSDK.SpeechRecognizer(speechConfig, 
           audioConfig);
        recognizer.recognizeOnceAsync(
        function (result) {
          console.log("result", result); 
          recognizer.close();
          recognizer = undefined;
        },
        function (err) {
          console.log(err);
          recognizer.close();
          recognizer = undefined;
        });
}, this);

I need to capture the speech input, then show the words/phrases/sentences the students have said and score them based on the confidence level.

Stanley Gong · Accepted Answer

If you want to get confident score of the text value that you get from speech to text SDK ,try the code below :

   
    
       Speech SDK JavaScript Quickstart
    
    


 
  Speech Recognition Speech SDK not found (microsoft.cognitiveservices.speech.sdk.bundle.js missing).



  
    
      
      Microsoft Cognitive Services Speech SDK JavaScript Quickstart
    
    
      Subscription:
      
    
    
      Region
      
    
    
      
      
    
    
      Results

Run the page just the same as official doc indicated. In a word, while you use sdk you should config speechConfig.outputFormat=1 so that you can get detailed format of Speech service which includes confident score value.

Result:

In your code , seems the undefined error is due to you want to print SpeechConfig but that param is defined as speechConfig ...

Anyway, to demo get confient score successfully, my code is based on official demo. Hope it helps .

For your code , try the html below :

Result: as you can see the result has been logged :

If my answer is helpful , click on the check mark beside the answer to toggle it from greyed out to filled in to mark this answer , thanks !

JS AzureSDK create custom function to capture speech, display the text results and a confidence level of the results

Answers (1)

Related Questions