Multi-layer keyword detection for distributed voice communications
First Claim
1. A first device, comprising:
- at least one processor; and
at least one memory including instructions that, when executed by the at least one processor, cause the first device to;
receive, from a second device, first audio data originating from a third device;
determine, using a first detector, that the first audio data includes a first representation of a keyword;
disable a second detector after determining the first audio data includes the first representation, the second detector configured to determine second audio data includes a second representation of the keyword; and
after disabling the second detector, cause audio, corresponding to the first audio data, to be output.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for temporarily disabling keyword detection to avoid detection of machine-generated keywords. A local device may operate two keyword detectors. The first keyword detector operates on input audio data received by a microphone to capture keywords uttered by a user. In these instances, the keyword may be detected by the first detector and the audio data may be transmitted to a remote device for processing. The remote device may generate output audio data to be sent to the local device. The local device may process the output audio data to determine that it also includes the keyword. The device may then disable the first keyword detector while the output audio data is played back by an audio speaker of the local device. Thus the local device may avoid detection of a keyword originating from the output audio. The first keyword detector may be reactivated after a time interval during which the keyword might be detectable in the output audio.
106 Citations
18 Claims
-
1. A first device, comprising:
-
at least one processor; and at least one memory including instructions that, when executed by the at least one processor, cause the first device to; receive, from a second device, first audio data originating from a third device; determine, using a first detector, that the first audio data includes a first representation of a keyword; disable a second detector after determining the first audio data includes the first representation, the second detector configured to determine second audio data includes a second representation of the keyword; and after disabling the second detector, cause audio, corresponding to the first audio data, to be output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method performed by a first device, comprising:
-
receiving, from a second device, first audio data originating from a third device; determining, using a first detector, that the first audio data includes a first representation of a keyword; disabling a second detector after determining the first audio data includes the first representation, the second detector configured to determine second audio data includes a second representation of the keyword; and after disabling the second detector, causing audio, corresponding to the first audio data, to be output. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification