Providing an indication of the suitability of speech recognition
First Claim
1. A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to:
- receive speech input from a user;
determine whether the speech input includes a spoken trigger;
in response to determining that the speech input includes a spoken trigger, obtain audio input from an acoustic environment;
while producing speech recognition results by performing speech recognition on the audio input, determine a speech recognition suitability value based on the audio input; and
in accordance with a determination that the speech recognition suitability value does not satisfy a predetermined criterion, provide an output to indicate that the acoustic environment is not suitable for performing speech recognition.
0 Assignments
0 Petitions
Accused Products
Abstract
This relates to providing an indication of the suitability of an acoustic environment for performing speech recognition. One process can include receiving an audio input and determining a speech recognition suitability based on the audio input. The speech recognition suitability can include a numerical, textual, graphical, or other representation of the suitability of an acoustic environment for performing speech recognition. The process can further include displaying a visual representation of the speech recognition suitability to indicate the likelihood that a spoken user input will be interpreted correctly. This allows a user to determine whether to proceed with the performance of a speech recognition process, or to move to a different location having a better acoustic environment before performing the speech recognition process. In some examples, the user device can disable operation of a speech recognition process in response to determining that the speech recognition suitability is below a threshold suitability.
-
Citations
54 Claims
-
1. A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to:
-
receive speech input from a user; determine whether the speech input includes a spoken trigger; in response to determining that the speech input includes a spoken trigger, obtain audio input from an acoustic environment; while producing speech recognition results by performing speech recognition on the audio input, determine a speech recognition suitability value based on the audio input; and in accordance with a determination that the speech recognition suitability value does not satisfy a predetermined criterion, provide an output to indicate that the acoustic environment is not suitable for performing speech recognition. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An electronic device, comprising:
-
one or more processors; a memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for; receiving speech input from a user; determining whether the speech input includes a spoken trigger; in response to determining that the speech input includes a spoken trigger, obtaining audio input from an acoustic environment; while producing speech recognition results by performing speech recognition on the audio input, determining a speech recognition suitability value based on the audio input; and in accordance with a determination that the speech recognition suitability value does not satisfy a predetermined criterion, providing an output to indicate that the acoustic environment is not suitable for performing speech recognition. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
-
20. A method, comprising:
at an electronic device with one or more processors and memory; receiving speech input from a user; determining whether the speech input includes a spoken trigger; in response to determining that the speech input includes a spoken trigger, obtaining audio input from an acoustic environment; while producing speech recognition results by performing speech recognition on the audio input, determining a speech recognition suitability value based on the audio input; and in accordance with a determination that the speech recognition suitability value does not satisfy a predetermined criterion, providing an output to indicate that the acoustic environment is not suitable for performing speech recognition. - View Dependent Claims (38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
Specification