Hotword detection on multiple devices
First Claim
1. A computer-implemented method comprising:
- receiving, by a first computing device that is configured to respond to a particular, predefined hotword and from a second computing device that is in a vicinity of the first computing device, data indicating that the second computing device is configured to respond to the particular, predefined hotword;
transmitting, to the second computing device and by the first computing device, data indicating that the first computing device is configured to respond to the particular, predefined hotword;
receiving, by the first computing device, audio data that corresponds to an utterance;
determining that the utterance likely includes a particular, predefined hotword;
in response to determining that the utterance likely includes the particular, predefined hotword, transmitting, to a server, (i) data indicating that the first computing device likely received the particular, predefined hotword, and (ii) data identifying the first computing device;
receiving, from the server, an instruction to suppress speech recognition processing on the audio data; and
in response to receiving the instruction to suppress speech recognition processing on the audio data, suppressing, by the first computing device, processing of the audio data using the automated speech recognizer.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving audio data that corresponds to an utterance. The actions further include determining that the utterance likely includes a particular, predefined hotword. The actions further include transmitting (i) data indicating that the computing device likely received the particular, predefined hotword, (ii) data identifying the computing device, and (iii) data identifying a group of nearby computing devices that includes the computing device. The actions further include receiving an instruction to commence speech recognition processing on the audio data. The actions further include in response to receiving the instruction to commence speech recognition processing on the audio data, processing at least a portion of the audio data using an automated speech recognizer on the computing device.
110 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a first computing device that is configured to respond to a particular, predefined hotword and from a second computing device that is in a vicinity of the first computing device, data indicating that the second computing device is configured to respond to the particular, predefined hotword; transmitting, to the second computing device and by the first computing device, data indicating that the first computing device is configured to respond to the particular, predefined hotword; receiving, by the first computing device, audio data that corresponds to an utterance; determining that the utterance likely includes a particular, predefined hotword; in response to determining that the utterance likely includes the particular, predefined hotword, transmitting, to a server, (i) data indicating that the first computing device likely received the particular, predefined hotword, and (ii) data identifying the first computing device; receiving, from the server, an instruction to suppress speech recognition processing on the audio data; and in response to receiving the instruction to suppress speech recognition processing on the audio data, suppressing, by the first computing device, processing of the audio data using the automated speech recognizer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
one or more computers; and one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by a first computing device that is configured to respond to a particular, predefined hotword and from a second computing device that is in a vicinity of the first computing device, data indicating that the second computing device is configured to respond to the particular, predefined hotword; transmitting, to the second computing device and by the first computing device, data indicating that the first computing device is configured to respond to the particular, predefined hotword; receiving, by the first computing device, audio data that corresponds to an utterance; determining that the utterance likely includes a particular, predefined hotword; in response to determining that the utterance likely includes the particular, predefined hotword, transmitting, to a server, (i) data indicating that the first computing device likely received the particular, predefined hotword, and (ii) data identifying the first computing device; receiving, from the server, an instruction to suppress speech recognition processing on the audio data; and in response to receiving the instruction to suppress speech recognition processing on the audio data, suppressing, by the first computing device, processing of the audio data using the automated speech recognizer. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, by a first computing device that is configured to respond to a particular, predefined hotword and from a second computing device that is in a vicinity of the first computing device, data indicating that the second computing device is configured to respond to the particular, predefined hotword; transmitting, to the second computing device and by the first computing device, data indicating that the first computing device is configured to respond to the particular, predefined hotword; receiving, by the first computing device, audio data that corresponds to an utterance; determining that the utterance likely includes a particular, predefined hotword; in response to determining that the utterance likely includes the particular, predefined hotword, transmitting, to a server, (i) data indicating that the first computing device likely received the particular, predefined hotword, and (ii) data identifying the first computing device; receiving, from the server, an instruction to suppress speech recognition processing on the audio data; and in response to receiving the instruction to suppress speech recognition processing on the audio data, suppressing, by the first computing device, processing of the audio data using the automated speech recognizer. - View Dependent Claims (20)
-
Specification