Speech recognition power management
First Claim
Patent Images
1. A system comprising:
- an audio input module;
an audio detection module in communication with the audio input module;
a speech detection module in communication with the audio detection module;
a wakeword recognition module in communication with the speech detection module; and
a network interface module in communication with the wakeword recognition module,wherein;
the audio detection module is configured to;
receive audio input from the audio input module;
determine a volume of at least a portion of the audio input;
cause the audio input module to increase a sampling rate of the audio input based at least in part on the volume exceeding a threshold; and
cause activation of the speech detection module based at least in part on the volume exceeding the threshold;
the speech detection module is configured to determine a first score indicating a likelihood that the audio input comprises speech and cause activation of the wakeword recognition module based at least on part on the score; and
the wakeword recognition module is configured to;
determine a second score indicating a likelihood that the audio input comprises a wakeword; and
cause activation of a network interface module based on the second score by providing power to the network interface module; and
the network interface module is configured to transmit at least a portion of the obtained audio input to a remote computing device.
1 Assignment
0 Petitions
Accused Products
Abstract
Power consumption for a computing device may be managed by one or more keywords. For example, if an audio input obtained by the computing device includes a keyword, a network interface module and/or an application processing module of the computing device may be activated. The audio input may then be transmitted via the network interface module to a remote computing device, such as a speech recognition server. Alternately, the computing device may be provided with a speech recognition engine configured to process the audio input for on-device speech recognition.
-
Citations
34 Claims
-
1. A system comprising:
-
an audio input module; an audio detection module in communication with the audio input module; a speech detection module in communication with the audio detection module; a wakeword recognition module in communication with the speech detection module; and a network interface module in communication with the wakeword recognition module, wherein; the audio detection module is configured to; receive audio input from the audio input module; determine a volume of at least a portion of the audio input; cause the audio input module to increase a sampling rate of the audio input based at least in part on the volume exceeding a threshold; and cause activation of the speech detection module based at least in part on the volume exceeding the threshold; the speech detection module is configured to determine a first score indicating a likelihood that the audio input comprises speech and cause activation of the wakeword recognition module based at least on part on the score; and the wakeword recognition module is configured to; determine a second score indicating a likelihood that the audio input comprises a wakeword; and cause activation of a network interface module based on the second score by providing power to the network interface module; and the network interface module is configured to transmit at least a portion of the obtained audio input to a remote computing device. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method of operating a first computing device, the method comprising:
-
receiving an audio input; determining one or more values from the audio input, wherein the one or more values comprise at least one of; a first value indicating an energy level of the audio input;
ora second value indicating a likelihood that the audio input comprises speech; increasing a sampling rate of the audio input, from a first lower sampling rate to a second higher sampling rate, based at least in part on the one or more values; activating a first module of the first computing device based at least in part on the one or more values; performing an operation, by the first module, wherein the operation comprises at least one of; determining that the audio input comprises a wakeword and causing activation of a network interface module in response to determining that the audio input comprises a wakeword, wherein causing activation of the network interface module comprises providing power to the network interface module; performing speech recognition on at least a portion of the audio input to obtain speech recognition results;
orcausing transmission of at least a portion of the audio input to a second computing device. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A device comprising:
-
a first processor configured to; determine one or more values, wherein the one or more values comprise at least one of a first value indicating an energy level of an audio input or a second value indicating a likelihood that the audio input comprises speech; and cause an increase in a sampling rate of the audio input, from a first lower sampling rate to a second higher sampling rate, based at least in part on the one or more values; cause activation of a second processor based at least in part on the one or more values; the second processor configured to perform an operation, wherein the operation comprises at least one of; determining that the audio input comprises a wakeword and causing activation of a network interface module in response to determining that the audio input comprises a wakeword, wherein causing activation of the network interface module comprises providing power to the network interface module; performing speech recognition on at least a portion of the audio input to obtain speech recognition results;
orcausing transmission of at least a portion of the audio input to a second device. - View Dependent Claims (21, 22, 23, 24, 25)
-
-
26. A system comprising:
-
an audio input module configured to obtain an audio input; a first module in communication with the audio input module; a second module in communication with the first module; and a network interface module in communication with the first module; wherein the first module is configured to; determine one or more values based at least in part on the audio input, wherein the one or more values comprises at least one of; a first value indicating an energy level of the audio input;
ora second value indicating a likelihood that the audio input comprises data representing speech; cause the audio input module to increase a sampling rate of the audio input, from a first lower sampling rate to a second higher sampling rate, based at least in part on the one or more values; cause activation of the network interface module based on the one or more values by providing power to the network interface module; and cause activation of the second module based at least in part on the one or more values; and wherein the second module is configured to; determine that the audio input likely comprises data representing a wakeword; and cause speech recognition to be performed on at least a portion of the audio input. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34)
-
Specification