CONTEXT-BASED SPEECH RECOGNITION
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving an audio signal encoding a portion of an utterance;
receiving context information associated with the utterance, wherein the context information is not derived from the audio signal or any other audio signal;
providing, as input to a neural network, data corresponding to the audio signal and the context information; and
generating a transcription for the utterance based on at least an output of the neural network.
2 Assignments
0 Petitions
Accused Products
Abstract
A processing system receives an audio signal encoding a portion of an utterance. The processing system receives context information associated with the utterance, wherein the context information is not derived from the audio signal or any other audio signal. The processing system provides, as input to a neural network, data corresponding to the audio signal and the context information, and generates a transcription for the utterance based on at least an output of the neural network.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving an audio signal encoding a portion of an utterance; receiving context information associated with the utterance, wherein the context information is not derived from the audio signal or any other audio signal; providing, as input to a neural network, data corresponding to the audio signal and the context information; and generating a transcription for the utterance based on at least an output of the neural network. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving an audio signal encoding a portion of an utterance; receiving context information associated with the utterance, wherein the context information is not derived from the audio signal or any other audio signal; providing, as input to a neural network, data corresponding to the audio signal and the context information; and generating a transcription for the utterance based on at least an output of the neural network. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving an audio signal encoding a portion of an utterance; receiving context information associated with the utterance, wherein the context information is not derived from the audio signal or any other audio signal; providing, as input to a neural network, data corresponding to the audio signal and the context information; and generating a transcription for the utterance based on at least an output of the neural network. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification