Pre-wakeword speech processing
First Claim
Patent Images
1. A computer-implemented method, comprising:
- receiving audio;
storing, in non-transitory memory, audio data representing the audio;
determining a first location in the audio data that includes a first amount of non-speech audio data;
determining a wakeword at a second location in the audio data, the audio data including non-wakeword speech between the first location and the second location;
determining a third location in the audio data that includes a second amount of non-speech audio data, the third location being after the second location in the audio data; and
selecting, for speech processing, a portion of the audio data starting with the first location and ending with the third location, the portion of the audio data comprising at least the non-wakeword speech.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for capturing and processing portions of a spoken utterance command that may occur before a wakeword. The system buffers incoming audio and indicates locations in the audio where the utterance changes, for example when a long pause is detected. When the system detects a wakeword within a particular utterance, the system determines the most recent utterance change location prior to the wakeword and sends the audio from that location to the end of the command utterance to a server for further speech processing.
223 Citations
17 Claims
-
1. A computer-implemented method, comprising:
-
receiving audio; storing, in non-transitory memory, audio data representing the audio; determining a first location in the audio data that includes a first amount of non-speech audio data; determining a wakeword at a second location in the audio data, the audio data including non-wakeword speech between the first location and the second location; determining a third location in the audio data that includes a second amount of non-speech audio data, the third location being after the second location in the audio data; and selecting, for speech processing, a portion of the audio data starting with the first location and ending with the third location, the portion of the audio data comprising at least the non-wakeword speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computing device, comprising:
-
at least one processor; and at least one memory including instructions that, when executed by the at least one processor, cause the computing device to; receive audio; store, in non-transitory memory, audio data representing at least some of the audio; determine a first location in the audio data that includes a first amount of non-speech audio data; determine a wakeword at a second location in the audio data, the audio data including non-wakeword speech between the first location and the second location; determine a third location in the audio data that includes a second number amount of non-speech audio data, the third location being after the second location in the audio data; and determine, for speech processing, a portion of the audio data starting with the first location and ending with the third location, the portion of the audio data comprising at least the non-wakeword speech. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
Specification