Hotword detection on multiple devices
DCFirst Claim
1. A computer-implemented method comprising:
- receiving, by a computing device that is in a low power mode and that is configured to exit a low power mode upon detecting an utterance of a particular, predefined hotword using an on-device hotword detector, audio data that corresponds to an utterance of the particular, predefined hotword;
while the computing device remains in the low power mode, and in response to receiving the audio data that corresponds to the utterance of the particular, predefined hotword, transmitting, by the computing device and to another computing device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, an output of processing the audio data using the on-device hotword detector;
while the computing device remains in low power mode, receiving, by the computing device and from the other computing device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, an additional output of processing the audio data; and
after transmitting the output of processing the audio data using the on-device hotword detector and after receiving the additional output of processing the audio data from the other using device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, determining, by the computing device, to remain in the low power mode.
2 Assignments
Litigations
1 Petition
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance. The actions further include determining a first value corresponding to a likelihood that the utterance includes a hotword. The actions further include receiving a second value corresponding to a likelihood that the utterance includes the hotword, the second value being determined by a second computing device. The actions further include comparing the first value and the second value. The actions further include based on comparing the first value to the second value, initiating speech recognition processing on the audio data.
111 Citations
21 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a computing device that is in a low power mode and that is configured to exit a low power mode upon detecting an utterance of a particular, predefined hotword using an on-device hotword detector, audio data that corresponds to an utterance of the particular, predefined hotword; while the computing device remains in the low power mode, and in response to receiving the audio data that corresponds to the utterance of the particular, predefined hotword, transmitting, by the computing device and to another computing device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, an output of processing the audio data using the on-device hotword detector; while the computing device remains in low power mode, receiving, by the computing device and from the other computing device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, an additional output of processing the audio data; and after transmitting the output of processing the audio data using the on-device hotword detector and after receiving the additional output of processing the audio data from the other using device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, determining, by the computing device, to remain in the low power mode. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by a computing device that is in a low power mode and that is configured to exit a low power mode upon detecting an utterance of a particular, predefined hotword using an on-device hotword detector, audio data that corresponds to an utterance of a particular, predefined hotword; while the computing device remains in the low power mode, and in response to receiving the audio data that corresponds to the utterance of the particular, predefined hotword, transmitting, by the computing device and to another computing device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, an output of processing the audio data using the on-device hotword detector; while the computing device remains in low power mode, receiving, by the computing device and from the other computing device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, an additional output of processing the audio data; and after transmitting the output of processing the audio data using the on-device hotword detector and after receiving the additional output of processing the audio data from the other using device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, determining, by the computing device, to remain in the low power mode. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
16. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, by a computing device that is in a low power mode and that is configured to exit a low power mode upon detecting an utterance of a particular, predefined hotword using an on-device hotword detector, audio data that corresponds to an utterance of a particular, predefined hotword; while the computing device remains in the low power mode, and in response to receiving the audio data that corresponds to the utterance of the particular, predefined hotword, transmitting, by the computing device and to another computing device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, an output of processing the audio data using the on-device hotword detector; while the computing device remains in low power mode, receiving, by the computing device and from the other computing device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, an additional output of processing the audio data; and after transmitting the output of processing the audio data using the on-device hotword detector and after receiving the additional output of processing the audio data from the other using device that is configured to exit a low power mode upon detecting an utterance of the particular, predefined hotword, determining, by the computing device, to remain in the low power mode. - View Dependent Claims (17, 18, 19, 20, 21)
-
Specification