Speech API Operation Class doesn't return full results #444

adziuk · 2017-04-11T21:18:18Z

In google-cloud-php/src/Speech/Operation.php

public function results(array $options = [])
{
    $info = $this->info($options);
    return isset($info['response']['results'])
        ? $info['response']['results'][0]['alternatives']
        : [];
}

This function assumes that the results are fixed to 1 alternative, putting max_alternatives > 1, however, can result in more alternatives being returned. There are other optional settings to the RecognizeRequest that can change the contents of the results to include information other than just the alternatives, so I'm not sure how users are supposed to access that data.

The text was updated successfully, but these errors were encountered:

adziuk · 2017-04-11T23:41:06Z

Update: After reading more of this, with sufficiently long audio, this will simply return the wrong result. With long audio, there are multiple results, corresponding to sequential segments of audio, this will return all of the alternatives from the first segment, and nothing for subsequent segments.

adziuk · 2017-04-11T23:44:19Z

link to relevant code

dwsupplee · 2017-04-12T16:45:57Z

Thanks for the report @adziuk, I'll get this fixed today.

Do you know under what conditions a response is broken into multiple result sets? I was originally under the impression the multiple result sets were for streaming calls.

danaharon · 2017-04-12T19:38:04Z

When you set maxAlternatives (from https://cloud.google.com/speech-whitelist/docs/reference/rest/v1/RecognitionConfig) to a value greater than 1 then the API returns more than one alternative, regardless of whether you use recognize or LongRunningRecognize. The confidence scores for the results beyond first one are usually missing.

adziuk · 2017-04-12T21:03:44Z

Things are generally broken into multiple result sets with longer audio, Audio around 60 seconds long looks like it's generally broken into multiple "results", for example, from the attached file (LINEAR16, sample rate = 44100)
eninv_45.wav.zip

Sync Recognize response: results {
 alternatives {
   transcript: "Pediatrics is my number one career choice. In many ways, it also reflects my second, third, and fourth career choices. Educated teach and Lead young people toward success."
   confidence: 0.9123565
 }
}
results {
 alternatives {
   transcript: " Legislators draft policies that improve processes for their constituents."
   confidence: 0.93561065
 }
}
results {
 alternatives {
   transcript: " Professional golfers commit themselves to extensive study and practice to master the skills of their profession."
   confidence: 0.96668166
 }
}
results {
 alternatives {
   transcript: " as a pediatrician, I see myself incorporating all three"
   confidence: 0.9412657
 }
}

adziuk changed the title ~~Speech API Operation Class doesn't support multiple alternatives~~ Speech API Operation Class doesn't return full results Apr 11, 2017

jdpedrie added the api: speech Issues related to the Speech-to-Text API. label Apr 12, 2017

dwsupplee mentioned this issue Apr 13, 2017

Return all results and do not detect encoding/sample rate in the client #449

Merged

dwsupplee closed this as completed in #449 Apr 14, 2017

yoshi-automation added 🚨 This issue needs some love. triage me I really want to be triaged. labels Apr 7, 2020

JustinBeckwith assigned dwsupplee Feb 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speech API Operation Class doesn't return full results #444

Speech API Operation Class doesn't return full results #444

adziuk commented Apr 11, 2017

adziuk commented Apr 11, 2017

adziuk commented Apr 11, 2017

dwsupplee commented Apr 12, 2017

danaharon commented Apr 12, 2017

adziuk commented Apr 12, 2017 •

edited by jdpedrie

Loading

Speech API Operation Class doesn't return full results #444

Speech API Operation Class doesn't return full results #444

Comments

adziuk commented Apr 11, 2017

adziuk commented Apr 11, 2017

adziuk commented Apr 11, 2017

dwsupplee commented Apr 12, 2017

danaharon commented Apr 12, 2017

adziuk commented Apr 12, 2017 • edited by jdpedrie Loading

adziuk commented Apr 12, 2017 •

edited by jdpedrie

Loading