Keyword spotting with competitor models
First Claim
Patent Images
1. A system comprising:
- an electronic data store configured to store a keyword model that models a keyword and a competitor model that models a competitor word; and
a user device in communication with the electronic data store, the user device configured to;
receive a voice signal corresponding to a first utterance of a user;
compute feature vectors using the voice signal;
obtain a first score using the feature vectors and the keyword model, wherein the first score indicates a likelihood that the voice signal comprises the keyword;
obtain a second score using the feature vectors and the competitor model, wherein the second score indicates a likelihood that the voice signal comprises the competitor word;
determine that the voice signal comprises the keyword using the first score and the second score; and
transmit information identifying the keyword and a portion of the voice signal that corresponds to the keyword to a server device, wherein the server device is configured to;
perform speech recognition on the portion to obtain speech recognition results;
generate, using the speech recognition results, a model, wherein the model is one of an updated keyword model, an updated competitor model, or a second competitor model; and
transmit the generated model to the user device; and
store the generated model in the electronic data store.
1 Assignment
0 Petitions
Accused Products
Abstract
Keyword spotting may be improved by using a competitor model. In some embodiments, audio data is received by a device. At least a portion of the audio data may be compared with a keyword model to obtain a first score. The keyword model may model a keyword. The portion of the audio data may also be compared with a competitor model to obtain a second score. The competitor model may model a competitor word, which may be a word that is similar to the keyword. The device may compare the first score and the second score to determine if a keyword is spoken.
125 Citations
32 Claims
-
1. A system comprising:
-
an electronic data store configured to store a keyword model that models a keyword and a competitor model that models a competitor word; and a user device in communication with the electronic data store, the user device configured to; receive a voice signal corresponding to a first utterance of a user; compute feature vectors using the voice signal; obtain a first score using the feature vectors and the keyword model, wherein the first score indicates a likelihood that the voice signal comprises the keyword; obtain a second score using the feature vectors and the competitor model, wherein the second score indicates a likelihood that the voice signal comprises the competitor word; determine that the voice signal comprises the keyword using the first score and the second score; and transmit information identifying the keyword and a portion of the voice signal that corresponds to the keyword to a server device, wherein the server device is configured to; perform speech recognition on the portion to obtain speech recognition results; generate, using the speech recognition results, a model, wherein the model is one of an updated keyword model, an updated competitor model, or a second competitor model; and transmit the generated model to the user device; and store the generated model in the electronic data store. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method, comprising:
as implemented by one or more computing devices configured with specific computer-executable instructions, receiving a voice signal corresponding to a first utterance; obtaining a first score using the voice signal and a keyword model, wherein the first score indicates a degree of similarity between the voice signal and a keyword; obtaining a second score using the voice signal and a competitor model, wherein the second score indicates a degree of similarity between the voice signal and a competitor word; determining that the voice signal comprises the keyword using the first score and the second score; and transmitting the keyword and a portion of the voice signal that corresponds to the keyword to a second device configured to perform speech recognition on the portion and configured to provide, based on the performed speech recognition, one of an updated keyword model or a second competitor model. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
14. A non-transitory computer-readable medium comprising one or more modules configured to execute in one or more processors of a computing device, the one or more modules being further configured to:
-
receive a voice signal corresponding to a first utterance; obtain a first score using the voice signal and a keyword model, wherein the first score indicates a degree of similarity between the voice signal and a keyword; obtain a second score using the voice signal and a competitor model, wherein the second score indicates a degree of similarity between the voice signal and a competitor word; determine that the voice signal comprises the keyword using the first score and the second score; and transmit the keyword and a portion of the voice signal that corresponds to the keyword to a second device configured to perform speech recognition on the portion and configured to provide a model based on the performed speech recognition. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
-
21. A system comprising:
-
a memory storing specific computer-executable instructions; and a processor configured to execute the specific computer-executable instructions, wherein execution of the specific computer-executable instructions by the processor causes the system to; receive, from a user device, a voice signal and a keyword identified by a keyword model as corresponding to the voice signal; perform speech recognition on the voice signal to obtain speech recognition results; determine that the speech recognition results do not comprise the keyword; and determine a competitor model, wherein the competitor model models a competitor word, and wherein the speech recognition results comprise the competitor word. - View Dependent Claims (22, 23, 24, 25, 26)
-
-
27. A computer-implemented method, comprising:
-
as implemented by one or more computing devices configured with specific computer-executable instructions, receiving, from a user device, a voice signal and a keyword identified by a keyword model as corresponding the voice signal; performing speech recognition on the voice signal to obtain speech recognition results; determining that the speech recognition results do not comprise the keyword; and determining a competitor model, wherein the competitor model models a competitor word, and wherein the speech recognition results comprise the competitor word. - View Dependent Claims (28, 29, 30, 31, 32)
-
Specification