Hotword detection on multiple devices
First Claim
1. A computer-implemented method comprising:
- receiving, by a mobile computing device that is (i) configured to process voice commands that are preceded by a predefined hotword, and (ii) is in proximity to another mobile computing device that is also configured to process voice commands that are preceded by the same, predefined hotword, an audio input representing an utterance by the speaker of a voice command that is preceded by the predefined hotword;
while receiving the audio input representing the utterance by the speaker of the voice command that is preceded by the predefined hotword, performing, by the mobile computing device, an operation;
after receiving the audio input representing the utterance by the speaker of the voice command that is preceded by the predefined hotword, receiving an ultrasonic signal from the other mobile computing device;
in response to receiving the ultrasonic signal from the other mobile computing device, (i) placing the mobile device into a sleep mode, (ii) bypassing, by the mobile computing device, further processing of the voice command, (iii) bypassing, by the mobile computing device, emitting an ultrasonic signal, and (iv) bypassing, by the mobile computing device, outputting a visual indication that the mobile computing device is processing the voice command; and
while receiving the ultrasonic signal from the other mobile computing device and bypassing processing of the voice command, continuing, by the mobile computing device, to perform the operation without interruption.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio data that corresponds to an utterance. The actions further include determining a likelihood that the utterance includes a hotword. The actions further include determining a loudness score for the audio data. The actions further include based on the loudness score, determining an amount of delay time. The actions further include, after the amount of delay time has elapsed, transmitting a signal that indicates that the computing device will initiate speech recognition processing on the audio data.
-
Citations
15 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a mobile computing device that is (i) configured to process voice commands that are preceded by a predefined hotword, and (ii) is in proximity to another mobile computing device that is also configured to process voice commands that are preceded by the same, predefined hotword, an audio input representing an utterance by the speaker of a voice command that is preceded by the predefined hotword; while receiving the audio input representing the utterance by the speaker of the voice command that is preceded by the predefined hotword, performing, by the mobile computing device, an operation; after receiving the audio input representing the utterance by the speaker of the voice command that is preceded by the predefined hotword, receiving an ultrasonic signal from the other mobile computing device; in response to receiving the ultrasonic signal from the other mobile computing device, (i) placing the mobile device into a sleep mode, (ii) bypassing, by the mobile computing device, further processing of the voice command, (iii) bypassing, by the mobile computing device, emitting an ultrasonic signal, and (iv) bypassing, by the mobile computing device, outputting a visual indication that the mobile computing device is processing the voice command; and while receiving the ultrasonic signal from the other mobile computing device and bypassing processing of the voice command, continuing, by the mobile computing device, to perform the operation without interruption. - View Dependent Claims (2, 3, 10, 11, 15)
-
-
4. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by a mobile computing device that is (i) configured to process voice commands that are preceded by a predefined hotword, and (ii) is in proximity to another mobile computing device that is also configured to process voice commands that are preceded by the same, predefined hotword, an audio input representing an utterance by the speaker of a voice command that is preceded by the predefined hotword; while receiving the audio input representing the utterance by the speaker of the voice command that is preceded by the predefined hotword, performing, by the mobile computing device, an operation; after receiving the audio input representing the utterance by the speaker of the voice command that is preceded by the predefined hotword, receiving an ultrasonic signal from the other mobile computing device; in response to receiving the ultrasonic signal from the other mobile computing device, (i) placing the mobile device into a sleep mode, (ii) bypassing, by the mobile computing device, further processing of the voice command, (iii) bypassing, by the mobile computing device, emitting an ultrasonic signal, and (iv) bypassing, by the mobile computing device, outputting a visual indication that the mobile computing device is processing the voice command; and while receiving the ultrasonic signal from the other mobile computing device and bypassing processing of the voice command, continuing, by the mobile computing device, to perform the operation without interruption. - View Dependent Claims (5, 6, 12, 13)
-
7. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, by a mobile computing device that is (i) configured to process voice commands that are preceded by a predefined hotword, and (ii) is in proximity to another mobile computing device that is also configured to process voice commands that are preceded by the same, predefined hotword, an audio input representing an utterance by the speaker of a voice command that is preceded by the predefined hotword; while receiving the audio input representing the utterance by the speaker of the voice command that is preceded by the predefined hotword, performing, by the mobile computing device, an operation; after receiving the audio input representing the utterance by the speaker of the voice command that is preceded by the predefined hotword, receiving an ultrasonic signal from the other mobile computing device; in response to receiving the ultrasonic signal from the other mobile computing device, (i) placing the mobile device into a sleep mode, (ii) bypassing, by the mobile computing device, further processing of the voice command, (iii) bypassing, by the mobile computing device, emitting an ultrasonic signal, and (iv) bypassing, by the mobile computing device, outputting a visual indication that the mobile computing device is processing the voice command; and while receiving the ultrasonic signal from the other mobile computing device and bypassing processing of the voice command, continuing, by the mobile computing device, to perform the operation without interruption. - View Dependent Claims (8, 9, 14)
-
Specification