Reputation: 71
I am having difficulty encoding a FLAC audio file to base64 when posting a json string to the google speech api. I noticed a few \n's in the reply from google, not sure if the base64 isn't good enough going in or perhaps I don't fully grasp how to build such strings and make them json-y enough to satisfy google. I'm leaning towards utf-8 encoding but my last attempt at further processing the information to be sent left me with mountains of other error messages from perl. Any pointers at all would be a great help! (even if the pointer is to give up on doing this on my own and pay the support fee to google)
Error message:
{
"error": {
"code": 400,
"message": "Invalid value at 'audio.content' (TYPE_BYTES), Base64 decoding failed for "ZkxhQwAAACIQABAAAAlJABQpAfQA8AABS+DDBqlWu7Ba27gz/koR4+04AwAAJAAAAAAAAAAAAAAA\nAAAAAAAQAAAAAAAAATAAAAAAAAABQ5kQAAQAACggAAAAcmVmZXJlbmNlIGxpYkZMQUMgMS4zLjAg\nMjAxMzA1MjYAAAAAgQA...
My code is:
#!/usr/bin/env perl
# Render speech to text using the google cloud speech engine.
#
# Kruft Industries Sept. 2016
#
#
# Intended to replace work by the following(not sure where this is hosted): GNU General Public License Version 2 Copyright (C) 2011 - 2012, Lefteris Zafiris
# <[email protected]>
#
#
# The script takes as input flac files at 8kHz and returns the following values: status : Return status. 0 means success, non zero values indicating different
# errors.
#
# Outputs a voice transcription that satisfies the input of sendmailmp3 for freepbx authored by the above Zafiris I am by no means an expert with the perl
# language, Please forgive any blaring ugliness :)
use utf8;
use MIME::Base64;
use strict;
use warnings;
use LWP::UserAgent;
if (!@ARGV || $ARGV[0] eq '-h' || $ARGV[0] eq '--help') {
print "Speech recognition using google cloud speech api.\n\n";
print "Usage: $0 [FILES]\n\n";
exit;
}
my $url = "https://speech.googleapis.com/v1beta1/speech:syncrecognize?key=API KEY HERE";
my @file_list = @ARGV; foreach my $file
(@file_list) {
print "Opening $file\n";
open(my $fh, "<", "$file") or die "Cant read file: $!";
my $audio = do { local $/; <$fh> };
close($fh);
my $flac = encode_base64url($audio);
my $json = '{"config":{"encoding":"FLAC","sample_rate":8000,"language_code":"en-US"},"audio":{"content":"' . $flac . '"}}';
my $req = HTTP::Request->new( 'POST', $url );
$req->header( 'Content-Type' => 'application/json' );
$req->content( $json );
my $lwp = LWP::UserAgent->new;
my $response = $lwp->request($req);
print $response->as_string; #debug output google's reply headers and message
last if (!$response->is_success);
print $response->content; #debug output the full transcript
my $doodle = $response->content;
$doodle =~ s/.*\"transcript\"://g;
$doodle =~ s/}\],.*//g;
$doodle =~ s/^{\"result\":\[\]}/{\"result\":/g;
$doodle =~ s/\R//g;
$doodle =~ s/\*/_/g;
print $doodle;
}
sub encode_base64url{
my($data) = @_;
return 0 unless $data;
$data = encode_base64($data);
$data =~ tr#\-_#+/#;
return($data);
}
exit;
Upvotes: 2
Views: 1580
Reputation: 71
Here is the corrected script. This should help out quite a few, I might update the title and description to prevent duplicate questions! Thank you for pointing me in the right direction, Christopher Oicles!
#!/usr/bin/env perl
# Render speech to text using the google cloud speech engine.
#
# Kruft Industries Sept. 2016
#
#
# Intended to replace work by the following(not sure where this is hosted): GNU General Public License Version 2 Copyright (C) 2011 - 2012, Lefteris Zafiris
# <[email protected]>
#
#
# The script takes as input flac files at 8kHz and returns the following values: status : Return status. 0 means success, non zero values indicating different
# errors.
#
# Outputs a voice transcription that satisfies the input of sendmailmp3 for freepbx authored by the above Zafiris I am by no means an expert with the perl
# language, Please forgive any blaring ugliness :)
use utf8;
use MIME::Base64;
use strict;
use warnings;
use LWP::UserAgent;
if (!@ARGV || $ARGV[0] eq '-h' || $ARGV[0] eq '--help') {
print "Speech recognition using google cloud speech api.\n\n";
print "Usage: $0 [FILES]\n\n";
exit;
}
my $url = "https://speech.googleapis.com/v1beta1/speech:syncrecognize?key=API KEY GOES HERE";
my @file_list = @ARGV; foreach my $file
(@file_list) {
print "Opening $file\n";
open(my $fh, "<", "$file") or die "Cant read file: $!";
my $audio = do { local $/; <$fh> };
close($fh);
my $flac = encode_base64url($audio);
my $json = '{"config":{"encoding":"FLAC","sample_rate":8000,"language_code":"en-US"},"audio":{"content":"' . $flac . '"}}';
my $req = HTTP::Request->new( 'POST', $url );
$req->header( 'Content-Type' => 'application/json' );
$req->content( $json );
my $lwp = LWP::UserAgent->new;
my $response = $lwp->request($req);
#print $response->as_string; #debug output google's reply headers and message
last if (!$response->is_success);
#print $response->content; #debug output the full transcript
my $doodle = $response->content;
$doodle =~ s/.*\"transcript\"://g;
$doodle =~ s/}\],.*//g;
$doodle =~ s/^{\"result\":\[\]}/{\"result\":/g;
$doodle =~ s/\R//g;
$doodle =~ s/\*/_/g;
print $doodle;
}
sub encode_base64url{
my($data) = @_;
return 0 unless $data;
$data = encode_base64($data);
$data =~ s/\+/-/g;
$data =~ s/\//_/g;
$data =~ s/\=//g;
$data =~ s/\n//g;
return($data);
}
exit;
Upvotes: 2