Hotword detection on multiple devices
First Claim
1. A computer-implemented method comprising:
- receiving, by a computing device that is configured to process voice commands that are preceded by a predefined hotword, first audio data of an utterance of a voice command that is preceded by the predefined hotword;
receiving, by the computing device, second audio data;
determining, by the computing device, that the second audio data includes a frequency pattern indicating that the computing device is to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword;
in response to determining that the second audio data includes the frequency pattern indicating that the computing device is to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, generating, by the computing device, a command for the computing device to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword; and
in response to the command to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, (i) placing the computing device into a sleep mode, (ii) bypassing, by the computing device, further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, (iii) bypassing, by the computing device, emitting third audio data that includes the frequency pattern, and (iv) bypassing, by the computing device, outputting a visual indication that the computing device is processing the first audio data.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio data that corresponds to an utterance. The actions further include determining a likelihood that the utterance includes a hotword. The actions further include determining a loudness score for the audio data. The actions further include based on the loudness score, determining an amount of delay time. The actions further include, after the amount of delay time has elapsed, transmitting a signal that indicates that the computing device will initiate speech recognition processing on the audio data.
48 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a computing device that is configured to process voice commands that are preceded by a predefined hotword, first audio data of an utterance of a voice command that is preceded by the predefined hotword; receiving, by the computing device, second audio data; determining, by the computing device, that the second audio data includes a frequency pattern indicating that the computing device is to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword; in response to determining that the second audio data includes the frequency pattern indicating that the computing device is to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, generating, by the computing device, a command for the computing device to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword; and in response to the command to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, (i) placing the computing device into a sleep mode, (ii) bypassing, by the computing device, further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, (iii) bypassing, by the computing device, emitting third audio data that includes the frequency pattern, and (iv) bypassing, by the computing device, outputting a visual indication that the computing device is processing the first audio data. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
one or more computers; and one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by a computing device that is configured to process voice commands that are preceded by a predefined hotword, first audio data of an utterance of a voice command that is preceded by the predefined hotword; receiving, by the computing device, second audio data; determining, by the computing device, that the second audio data includes a frequency pattern indicating that the computing device is to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword; in response to determining that the second audio data includes the frequency pattern indicating that the computing device is to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, generating, by the computing device, a command for the computing device to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword; and in response to the command to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, (i) placing the computing device into a sleep mode, (ii) bypassing, by the computing device, further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, (iii) bypassing, by the computing device, emitting third audio data that includes the frequency pattern, and (iv) bypassing, by the computing device, outputting a visual indication that the computing device is processing the first audio data. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, by a computing device that is configured to process voice commands that are preceded by a predefined hotword, first audio data of an utterance of a voice command that is preceded by the predefined hotword; receiving, by the computing device, second audio data; determining, by the computing device, that the second audio data includes a frequency pattern indicating that the computing device is to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword; in response to determining that the second audio data includes the frequency pattern indicating that the computing device is to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, generating, by the computing device, a command for the computing device to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword; and in response to the command to bypass further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, (i) placing the computing device into a sleep mode, (ii) bypassing, by the computing device, further processing of the first audio data of the utterance of the voice command that is preceded by the predefined hotword, (iii) bypassing, by the computing device, emitting third audio data that includes the frequency pattern, and (iv) bypassing, by the computing device, outputting a visual indication that the computing device is processing the first audio data. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification