Key phrase detection with audio watermarking
First Claim
1. A playback device comprising a speaker and one or more storage devices on which are stored instructions that are operable, when executed by the playback device, to cause the playback device to perform operations comprising:
- receiving an audio data stream;
determining, before the audio data stream is output by the playback device, whether a portion of the audio data stream encodes a particular key phrase by analyzing the portion using an automated speech recognizer;
in response to determining that the portion of the audio data stream encodes the particular key phrase, modifying the audio data stream to include an audio watermark, where the audio watermark includes data specifying that a key phrase is encoded in the portion of the audio data stream; and
outputting the modified audio data stream through the speaker of the playback device.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using audio watermarks with key phrases. One of the methods includes receiving, by a playback device, an audio data stream; determining, before the audio data stream is output by the playback device, whether a portion of the audio data stream encodes a particular key phrase by analyzing the portion using an automated speech recognizer; in response to determining that the portion of the audio data stream encodes the particular key phrase, modifying the audio data stream to include an audio watermark; and providing the modified audio data stream for output.
-
Citations
19 Claims
-
1. A playback device comprising a speaker and one or more storage devices on which are stored instructions that are operable, when executed by the playback device, to cause the playback device to perform operations comprising:
-
receiving an audio data stream; determining, before the audio data stream is output by the playback device, whether a portion of the audio data stream encodes a particular key phrase by analyzing the portion using an automated speech recognizer; in response to determining that the portion of the audio data stream encodes the particular key phrase, modifying the audio data stream to include an audio watermark, where the audio watermark includes data specifying that a key phrase is encoded in the portion of the audio data stream; and outputting the modified audio data stream through the speaker of the playback device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A non-transitory computer storage medium encoded with instructions that, when executed by one or more computers, cause the one or more computers to perform operations comprising:
-
receiving an audio data stream; determining, before the audio data stream is output by the one or more computers, whether a portion of the audio data stream encodes a particular key phrase by analyzing the portion using an automated speech recognizer; in response to determining that the portion of the audio data stream does not encode the particular key phrase, determining to skip modifying the audio data stream to include an audio watermark based on the portion of the audio data stream that does not encode the particular key phrase, where the audio watermark includes data specifying that a key phrase is encoded in the portion of the audio data stream; and after determining to skip modifying the audio data stream to include the audio watermark based on the portion of the audio data stream that does not encode the particular key phrase, outputting the audio data stream through a speaker of a playback device. - View Dependent Claims (15, 16)
-
-
17. A computer-implemented method comprising:
-
receiving, by a playback device, an audio data stream; determining, before the audio data stream is output by the playback device, whether a portion of the audio data stream encodes a particular key phrase by analyzing the portion using an automated speech recognizer; in response to determining that the portion of the audio data stream encodes the particular key phrase, modifying the audio data stream to include an audio watermark, where the audio watermark includes data specifying that a key phrase is encoded in the portion of the audio data stream; and outputting, by the playback device, the modified audio data stream through a speaker of the playback device. - View Dependent Claims (18, 19)
-
Specification