Recorded media hotword trigger suppression
First Claim
1. A computer-implemented method comprising:
- receiving, by a microphone of a computing device that includes (i) the microphone, (ii) a hotword identifier, (iii) an audio watermark identifier, and (iv) an automated speech recognizer, audio corresponding to playback of an item of media content, wherein the automated speech recognizer of the computing device is configured to perform speech recognition on received audio that follows a predefined hotword;
determining, by the hotword identifier of the computing device, that the audio includes an utterance of the predefined hotword;
determining, by the audio watermark identifier of the computing device, that the audio includes an audio watermark;
analyzing, by the audio watermark identifier of the computing device, the audio watermark; and
based on analyzing the audio watermark, bypassing, by the automated speech recognizer of the computing device, performing speech recognition on a portion of the audio following the predefined hotword.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for suppressing hotword triggers when detecting a hotword in recorded media are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio corresponding to playback of an item of media content. The actions further include determining, by the computing device, that the audio includes an utterance of a predefined hotword and that the audio includes an audio watermark. The actions further include analyzing, by the computing device, the audio watermark. The actions further include based on analyzing the audio watermark, determining, by the computing device, whether to perform speech recognition on a portion of the audio following the predefined hotword.
105 Citations
17 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a microphone of a computing device that includes (i) the microphone, (ii) a hotword identifier, (iii) an audio watermark identifier, and (iv) an automated speech recognizer, audio corresponding to playback of an item of media content, wherein the automated speech recognizer of the computing device is configured to perform speech recognition on received audio that follows a predefined hotword; determining, by the hotword identifier of the computing device, that the audio includes an utterance of the predefined hotword; determining, by the audio watermark identifier of the computing device, that the audio includes an audio watermark; analyzing, by the audio watermark identifier of the computing device, the audio watermark; and based on analyzing the audio watermark, bypassing, by the automated speech recognizer of the computing device, performing speech recognition on a portion of the audio following the predefined hotword. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
one or more computers; and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by a microphone of a computing device that includes (i) the microphone, (ii) a hotword identifier, (iii) an audio watermark identifier, and (iv) an automated speech recognizer, audio corresponding to playback of an item of media content, wherein the automated speech recognizer of the computing device is configured to perform speech recognition on received audio that follows a predefined hotword; determining, by the hotword identifier of the computing device, that the audio includes an utterance of the predefined hotword; determining, by the audio watermark identifier of the computing device, that the audio includes an audio watermark; analyzing, by the audio watermark identifier of the computing device, the audio watermark; and based on analyzing the audio watermark, bypassing, by the automated speech recognizer of the computing device, performing speech recognition on a portion of the audio following the predefined hotword. - View Dependent Claims (16)
-
-
17. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, by a microphone of a computing device that includes (i) the microphone, (ii) a hotword identifier, (iii) an audio watermark identifier, and (iv) an automated speech recognizer, audio corresponding to playback of an item of media content, wherein the automated speech recognizer of the computing device is configured to perform speech recognition on received audio that follows a predefined hotword; determining, by the hotword identifier of the computing device, that the audio includes an utterance of the predefined hotword; determining, by the audio watermark identifier of the computing device, that the audio includes an audio watermark; analyzing, by the audio watermark identifier of the computing device, the audio watermark; and based on analyzing the audio watermark, bypassing, by the automated speech recognizer of the computing device, performing speech recognition on a portion of the audio following the predefined hotword.
-
Specification