Background speech recognition assistant using speaker verification
First Claim
Patent Images
1. A method comprising:
- receiving, by a computing device, an acoustic input signal at a speech recognizer;
identifying, by the computing device, a user that is speaking based on the acoustic input signal;
recognizing, by the computing device via the speech recognizer, speech uttered by the user in the acoustic input signal;
determining, by the computing device, speaker-specific information previously stored for the user;
determining, by the computing device, a set of potential responses based on the recognized speech and the speaker-specific information for the user;
ranking, by the computing device, the set of potential responses based on one or more criteria and the speaker-specific information;
determining, by the computing device for each response in the set of potential responses, whether the response should be output or should not be output based on the response'"'"'s ranking; and
if the response should be output;
selecting, by the computing device from among a plurality of preconfigured output methods, an output method for outputting the response to the user, the selecting being based on the response'"'"'s ranking; and
outputting, by the computing device, the response to the user using the selected output method.
0 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, a method includes receiving an acoustic input signal at a speech recognizer. A user is identified that is speaking based on the acoustic input signal. The method then determines speaker-specific information previously stored for the user and a set of responses based on the recognized acoustic input signal and the speaker-specific information for the user. It is determined if the response should be output and the response is outputted if it is determined the response should be output.
46 Citations
21 Claims
-
1. A method comprising:
-
receiving, by a computing device, an acoustic input signal at a speech recognizer; identifying, by the computing device, a user that is speaking based on the acoustic input signal; recognizing, by the computing device via the speech recognizer, speech uttered by the user in the acoustic input signal; determining, by the computing device, speaker-specific information previously stored for the user; determining, by the computing device, a set of potential responses based on the recognized speech and the speaker-specific information for the user; ranking, by the computing device, the set of potential responses based on one or more criteria and the speaker-specific information; determining, by the computing device for each response in the set of potential responses, whether the response should be output or should not be output based on the response'"'"'s ranking; and if the response should be output; selecting, by the computing device from among a plurality of preconfigured output methods, an output method for outputting the response to the user, the selecting being based on the response'"'"'s ranking; and outputting, by the computing device, the response to the user using the selected output method. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer readable medium having stored thereon program code executable by a processor, the program code comprising:
-
code that causes the processor to receive an acoustic input signal at a speech recognizer; code that causes the processor to identify a user that is speaking based on the acoustic input signal; code that causes the processor to recognize, via the speech recognizer, speech uttered by the user in the acoustic input signal; code that causes the processor to determine speaker-specific information previously stored for the user; code that causes the processor to determine a set of potential responses based on the recognized speech and the speaker-specific information for the user; code that causes the processor to rank the set of potential responses based on one or more criteria and the speaker-specific information; code that causes the processor to determine, for each response in the set of potential responses, whether the response should be output or should not be output based on the response'"'"'s ranking; and if the response should be output; code that causes the processor to select, from among a plurality of preconfigured output methods, an output method for outputting the response to the user, the selecting being based on the response'"'"'s ranking; and code that causes the processor to output the response to the user using the selected output method.
-
-
21. A system comprising:
-
a processor; and a non-transitory computer readable medium having stored thereon program code that, when executed by the processor, causes the processor to; receive an acoustic input signal at a speech recognizer; identify a user that is speaking based on the acoustic input signal; recognize, via the speech recognizer, speech uttered by the user in the acoustic input signal; determine speaker-specific information previously stored for the user; determine a set of potential responses based on the recognized speech and the speaker-specific information for the user; rank the set of potential responses based on one or more criteria and the speaker-specific information; determine, for each response in the set of potential responses, whether the response should be output or should not be output based on the response'"'"'s ranking; and if the response should be output; select, from among a plurality of preconfigured output methods, an output method for outputting the response to the user, the selecting being based on the response'"'"'s ranking; and output the response to the user using the selected output method.
-
Specification