Detecting self-generated wake expressions
First Claim
Patent Images
1. An audio device comprising:
- one or more processors;
a microphone;
an audio speaker configured to produce output audio; and
memory storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising;
receiving, from the microphone, an audio signal generated by the microphone to represent input audio received at the microphone;
determining a confidence level that the audio signal includes a predefined expression;
determining that the confidence level is greater than a predetermined threshold;
generating a parameter that indicates at least one of;
whether the output audio is currently being produced by the audio speaker, whether the output audio contains speech, whether the output audio contains the predefined expression, loudness of the output audio, loudness of the input audio, or an echo characteristic of the audio signal; and
determining, based at least in part on the parameter and the confidence level being greater than the predetermined threshold, that an occurrence of the predefined expression in the input audio is a result of the output audio produced by the audio speaker.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech-based audio device may be configured to detect a user-uttered wake expression. For example, the audio device may generate a parameter indicating whether output audio is currently being produced by an audio speaker, whether the output audio contains speech, whether the output audio contains a predefined expression, loudness of the output audio, loudness of input audio, and/or an echo characteristic. Based on the parameter, the audio device may determine whether an occurrence of the predefined expression in the input audio is a result of an utterance of the predefined expression by a user.
60 Citations
20 Claims
-
1. An audio device comprising:
-
one or more processors; a microphone; an audio speaker configured to produce output audio; and memory storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising; receiving, from the microphone, an audio signal generated by the microphone to represent input audio received at the microphone; determining a confidence level that the audio signal includes a predefined expression; determining that the confidence level is greater than a predetermined threshold; generating a parameter that indicates at least one of;
whether the output audio is currently being produced by the audio speaker, whether the output audio contains speech, whether the output audio contains the predefined expression, loudness of the output audio, loudness of the input audio, or an echo characteristic of the audio signal; anddetermining, based at least in part on the parameter and the confidence level being greater than the predetermined threshold, that an occurrence of the predefined expression in the input audio is a result of the output audio produced by the audio speaker. - View Dependent Claims (2, 3, 4, 5)
-
-
6. An audio device comprising:
-
one or more processors; a microphone; an audio speaker configured to produce output audio; and memory storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising; receiving, from the microphone, an audio signal generated by the microphone to represent an input audio received at the microphone; generating a parameter that indicates at least one of;
whether the output audio is currently being produced by the audio speaker, whether the output audio contains speech, whether the output audio contains a predefined expression, loudness of the output audio, loudness of the input audio, or an echo characteristic of the audio signal; anddetermining, based at least in part on the parameter, that an occurrence of the predefined expression in the input audio is a result of the predefined expression occurring in the output audio from the audio speaker. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
receiving, by an audio device, an audio signal generated by a microphone and representing input audio received at the microphone; generating, by the audio device, a parameter that indicates at least one of;
whether output audio is currently being produced by an audio speaker, whether the output audio contains speech, whether the output audio contains a predefined expression, loudness of the output audio, loudness of the input audio, or an echo characteristic of the audio signal; andevaluating, by the audio device, the parameter to distinguish between utterance of the predefined expression by a user and production of the predefined expression by the audio speaker. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification