Modules

Nested message and enum types in CustomClass.

Nested message and enum types in PhraseSet.

Nested message and enum types in RecognitionAudio.

Nested message and enum types in RecognitionConfig.

Nested message and enum types in RecognitionMetadata.

Generated client implementations.

Nested message and enum types in StreamingRecognizeRequest.

Nested message and enum types in StreamingRecognizeResponse.

Nested message and enum types in TranscriptOutputConfig.

Structs

A set of words or phrases that represents a common concept likely to appear in your audio, for example a list of passenger ship names. CustomClass items can be substituted into placeholders that you set in PhraseSet phrases.

Describes the progress of a long-running LongRunningRecognize call. It is included in the metadata field of the Operation returned by the GetOperation call of the google::longrunning::Operations service.

The top-level message sent by the client for the LongRunningRecognize method.

The only message returned to the client by the LongRunningRecognize method. It contains the result as zero or more sequential SpeechRecognitionResult messages. It is included in the result.response field of the Operation returned by the GetOperation call of the google::longrunning::Operations service.

Provides “hints” to the speech recognizer to favor specific words and phrases in the results.

Contains audio data in the encoding specified in the RecognitionConfig. Either content or uri must be supplied. Supplying both or neither returns [google.rpc.Code.INVALID_ARGUMENT][google.rpc.Code.INVALID_ARGUMENT]. See content limits.

Provides information to the recognizer that specifies how to process the request.

Description of audio data to be recognized.

The top-level message sent by the client for the Recognize method.

The only message returned to the client by the Recognize method. It contains the result as zero or more sequential SpeechRecognitionResult messages.

Config to enable speaker diarization.

Speech adaptation configuration.

Provides “hints” to the speech recognizer to favor specific words and phrases in the results.

Alternative hypotheses (a.k.a. n-best list).

A speech recognition result corresponding to a portion of the audio.

Provides information to the recognizer that specifies how to process the request.

A streaming speech recognition result corresponding to a portion of the audio that is currently being processed.

The top-level message sent by the client for the StreamingRecognize method. Multiple StreamingRecognizeRequest messages are sent. The first message must contain a streaming_config message and must not contain audio_content. All subsequent messages must contain audio_content and must not contain a streaming_config message.

StreamingRecognizeResponse is the only message returned to the client by StreamingRecognize. A series of zero or more StreamingRecognizeResponse messages are streamed back to the client. If there is no recognizable audio, and single_utterance is set to false, then no messages are streamed back to the client.

Specifies an optional destination for the recognition results.

Word-specific information for recognized words.