DYNAMIC THRESHOLDS FOR ALWAYS LISTENING SPEECH TRIGGER
First Claim
1. A method for dynamically adjusting a speech trigger threshold, the method comprising:
- at an electronic device having a processor and memory;
receiving audio input via a microphone;
sampling the received audio input;
determining a confidence level that the sampled audio input comprises a portion of a spoken trigger; and
in response to the confidence level exceeding a threshold, triggering a virtual assistant to receive a user command from the audio input;
wherein the threshold is dynamically adjusted in response to a perceived event.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and processes are disclosed for dynamically adjusting a speech trigger threshold, which can be used in triggering a virtual assistant. Audio input can be received via a microphone. The received audio input can be sampled, and a confidence level can be determined of whether the sampled audio input includes a portion of a spoken trigger. In response to the confidence level exceeding a threshold, a virtual assistant can be triggered to receive a user command from the audio input. The threshold can be dynamically adjusted in response to perceived events (e.g., events indicating a user may be more or less likely to initiate speech interactions, events indicating a trigger may be difficult to detect, events indicating a trigger was missed, etc.), thereby minimizing both missed triggers and false positive triggering events.
348 Citations
25 Claims
-
1. A method for dynamically adjusting a speech trigger threshold, the method comprising:
at an electronic device having a processor and memory; receiving audio input via a microphone; sampling the received audio input; determining a confidence level that the sampled audio input comprises a portion of a spoken trigger; and in response to the confidence level exceeding a threshold, triggering a virtual assistant to receive a user command from the audio input; wherein the threshold is dynamically adjusted in response to a perceived event. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
24. A non-transitory computer-readable storage medium comprising computer-executable instructions for:
-
receiving audio input via a microphone; sampling the received audio input; determining a confidence level that the sampled audio input comprises a portion of a spoken trigger; and in response to the confidence level exceeding a threshold, triggering a virtual assistant to receive a user command from the audio input; wherein the threshold is dynamically adjusted in response to a perceived event.
-
-
25. A system comprising:
-
one or more processors; memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for; receiving audio input via a microphone; sampling the received audio input; determining a confidence level that the sampled audio input comprises a portion of a spoken trigger; and in response to the confidence level exceeding a threshold, triggering a virtual assistant to receive a user command from the audio input; wherein the threshold is dynamically adjusted in response to a perceived event.
-
Specification