Speech recognition power management
First Claim
Patent Images
1. A system comprising:
- a network interface component;
an audio input component configured to receive an audio input; and
one or more processors configured to;
determine that an energy level of the audio input satisfies a threshold;
determine, in response to determining that the energy level satisfies the threshold, that the audio input likely comprises data representing an utterance;
determine, in response to determining that the audio input likely comprises data representing the utterance, that the audio input likely comprises data representing a wakeword indicative of device-directed speech; and
cause transmission of the audio input by the network interface component in response to determining that the audio input likely comprises data representing the wakeword;
wherein the network interface component is configured to;
transmit the audio input to a remote computing system;
receive speech recognition results from the remote computing system;
receive confirmation data from the remote computing system, wherein the confirmation data indicates that the audio input likely comprises data representing the wakeword;
transmit a subsequent audio input to the remote computing system based at least partly on receiving the confirmation data; and
receive subsequent speech recognition results from the remote computing system.
0 Assignments
0 Petitions
Accused Products
Abstract
Power consumption for a computing device may be managed by one or more keywords. For example, if an audio input obtained by the computing device includes a keyword, a network interface module and/or an application processing module of the computing device may be activated. The audio input may then be transmitted via the network interface module to a remote computing device, such as a speech recognition server. Alternately, the computing device may be provided with a speech recognition engine configured to process the audio input for on-device speech recognition.
-
Citations
20 Claims
-
1. A system comprising:
-
a network interface component; an audio input component configured to receive an audio input; and one or more processors configured to; determine that an energy level of the audio input satisfies a threshold; determine, in response to determining that the energy level satisfies the threshold, that the audio input likely comprises data representing an utterance; determine, in response to determining that the audio input likely comprises data representing the utterance, that the audio input likely comprises data representing a wakeword indicative of device-directed speech; and cause transmission of the audio input by the network interface component in response to determining that the audio input likely comprises data representing the wakeword; wherein the network interface component is configured to; transmit the audio input to a remote computing system; receive speech recognition results from the remote computing system; receive confirmation data from the remote computing system, wherein the confirmation data indicates that the audio input likely comprises data representing the wakeword; transmit a subsequent audio input to the remote computing system based at least partly on receiving the confirmation data; and receive subsequent speech recognition results from the remote computing system. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method comprising:
under control of a computing system configured to execute specific computer-executable instructions, receiving an audio input; determining that an energy level of the audio input satisfies a threshold; in response to determining that the energy level satisfies the threshold, determining that the audio input likely comprises data representing an utterance; in response to determining that audio input likely comprises data representing the utterance, determining that the audio input likely comprises data representing a wakeword indicative of device-directed speech; in response to determining that the audio input likely comprises data representing the wakeword, transmitting the audio input to a remote computing system; receiving speech recognition results from the remote computing system; receiving confirmation data from the remote computing system, wherein the confirmation data indicates that the audio input likely comprises data representing the wakeword; transmitting a subsequent audio input to the remote computing system based at least partly on receiving the confirmation data; and receiving subsequent speech recognition results from the remote computing system. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15)
-
16. A system comprising:
-
an input component configured to receive audio input; and one or more processors configured to at least; determine a value representing at least one of an energy level of the audio input or a first likelihood that the audio input comprises data representing an utterance; determine, based at least partly on the value, a second likelihood that the utterance comprises data representing a wakeword; determine, based at least partly on the second likelihood, to cause transmission of the audio input to a computing system; cause transmission of the audio input to the computing system; receive speech recognition results generated by the computing system; receive confirmation data from the computing system, wherein the confirmation data indicates that the audio input likely comprises data representing the wakeword; transmit a subsequent audio input to the computing system based at least partly on receiving the confirmation data; receive subsequent speech recognition results generated by the computing system. - View Dependent Claims (17, 18, 19, 20)
-
Specification