Wake word evaluation
First Claim
Patent Images
1. A computing system comprising:
- one or more processors;
one or more computer-readable storage media storing computer-executable instructions;
a speech recognition module maintained in the one or more computer-readable storage media and executed by the one or more processors to interpret a signal representing the audio input as one or more words; and
a wake word evaluation module maintained in the one or more computer-readable storage media and executed by the one or more processors to;
receive a plurality of candidate wake words for evaluation, wherein the plurality of candidate wake words comprises a first candidate wake word;
obtain a corpus of text for evaluating the plurality of candidate wake words;
obtain a corpus of audio data for evaluating the plurality of candidate wake words;
perform speech recognition on the corpus of audio data using the speech recognition module to obtain speech recognition results;
determine a plurality of values from wake word metrics for the first candidate wake word using at least one of the corpus of text or the speech recognition results, at least one of the plurality of values including a value indicative of an ease of pronouncing the first candidate word;
determine an overall score for the first candidate wake word based at least in part on the plurality of values; and
select a first wake word from the plurality of candidate wake words using the overall score.
2 Assignments
0 Petitions
Accused Products
Abstract
Natural language controlled devices may be configured to activate command recognition in response to one or more wake words. Techniques are provided to receive a candidate word for evaluation as a wake word that activates a natural language control functionality of a computing device. The candidate word may include one or more words or sounds. Values for multiple wake word metrics are then determined. The candidate word is evaluated based on the various wake word metrics.
-
Citations
35 Claims
-
1. A computing system comprising:
-
one or more processors; one or more computer-readable storage media storing computer-executable instructions; a speech recognition module maintained in the one or more computer-readable storage media and executed by the one or more processors to interpret a signal representing the audio input as one or more words; and a wake word evaluation module maintained in the one or more computer-readable storage media and executed by the one or more processors to; receive a plurality of candidate wake words for evaluation, wherein the plurality of candidate wake words comprises a first candidate wake word; obtain a corpus of text for evaluating the plurality of candidate wake words; obtain a corpus of audio data for evaluating the plurality of candidate wake words; perform speech recognition on the corpus of audio data using the speech recognition module to obtain speech recognition results; determine a plurality of values from wake word metrics for the first candidate wake word using at least one of the corpus of text or the speech recognition results, at least one of the plurality of values including a value indicative of an ease of pronouncing the first candidate word; determine an overall score for the first candidate wake word based at least in part on the plurality of values; and select a first wake word from the plurality of candidate wake words using the overall score. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
under control of one or more computer systems configured with executable instructions, receiving an audio input including a first candidate word for evaluation as a wake word that activates functionality of a computing device; determining a plurality of values from wake word metrics for the first candidate word, at least one of the plurality of values including a value indicative of an ease of pronouncing the first candidate word; and determining an overall score for the first candidate word based at least in part on the plurality of values. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
under control of one or more computer systems configured with executable instructions, receiving an audio input including a first candidate word for evaluation as a wake word that activates functionality of a computing device; determining a plurality of values from wake word metrics for the first candidate word, wherein the plurality of values includes a false alarm count value that is determined for the first candidate word by; comparing a phone sequence of the first candidate word to a plurality of phone sequences obtained from speech recognition results; determining a match for each of the plurality of phone sequences obtained from speech recognition results; and determining a count of phone sequences obtained from speech recognition results that have a match below a threshold level; and determining an overall score for the first candidate word based at least in part on the plurality of values. - View Dependent Claims (16, 17)
-
-
18. A method comprising:
-
under control of one or more computer systems configured with executable instructions, receiving an audio input including a first candidate word for evaluation as a wake word that activates functionality of a computing device; determining a plurality of values from wake word metrics for the first candidate word, wherein the plurality of values includes a word frequency value that is determined for the first candidate word by determining a frequency of occurrence of the first candidate word in a test set comprising one or more words; and determining an overall score for the first candidate word based at least in part on the plurality of values. - View Dependent Claims (19, 20)
-
-
21. A computing system comprising:
-
one or more processors; one or more computer-readable storage media storing computer-executable instructions; a wake word evaluation module maintained in the one or more computer-readable storage media and executed by the one or more processors to; receive a candidate word for evaluation as a wake word that activates functionality of a computing device, the candidate word received as an audio input; determine a plurality of values from wake word metrics for the candidate word, at least one of the plurality of values including a word frequency value; and determine an overall score for the candidate wake word based at least in part on the plurality of values. - View Dependent Claims (22, 23, 24, 25)
-
-
26. A computing system comprising:
-
one or more processors; one or more computer-readable storage media storing computer-executable instructions; a wake word evaluation module maintained in the one or more computer-readable storage media and executed by the one or more processors to; receive an audio input including a candidate word for evaluation as a wake word that activates functionality of a computing device; determine a plurality of values from wake word metrics for the candidate word, wherein the plurality of values includes a word frequency value that is determined for the candidate word by determining a frequency of occurrence of the candidate word in a test set comprising one or more words; and determine an overall score for the candidate wake word based at least in part on the plurality of values. - View Dependent Claims (27, 28)
-
-
29. A computing system comprising:
-
one or more processors; one or more computer-readable storage media storing computer-executable instructions; a wake word evaluation module maintained in the one or more computer-readable storage media and executed by the one or more processors to; receive an audio input including a candidate word for evaluation as a wake word that activates functionality of a computing device; determine a plurality of values from wake word metrics for the candidate word, wherein the plurality of values includes a value indicative of an ease of pronouncing the candidate word determined at least in part by determining a number of constraints on syllables in a language that the candidate word violates, wherein the determination includes a weighting of each of the constraints violated by the candidate word; and determine an overall score for the candidate wake word based at least in part on the plurality of values.
-
-
30. One or more non-transitory computer-readable media having computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising:
-
receiving an audio input including a candidate word for evaluation as a wake word that activates functionality of a computing device, the candidate word comprising a sequence of subword units; determining a plurality of wake word values for the candidate word, at least one wake word value including a word frequency value; and determining an overall value for the candidate word based at least in part on the plurality of wake word values. - View Dependent Claims (31, 32, 33, 34, 35)
-
Specification