Background speech recognition assistant using speaker verification
First Claim
Patent Images
1. A method comprising:
- receiving, by a computing device, an acoustic input signal at a speech recognizer;
identifying, by the computing device, a user that is speaking based on the acoustic input signal;
determining, by the computing device, speaker-specific information previously stored for the user;
determining, by the computing device, a set of classifications, wherein the set of classifications are determined based on the speaker-specific information;
classifying, by the computing device, portions of the acoustic input signal into different classifications in the set of classifications;
selecting, by the computing device, a classification in the set of classifications based on a criterion associated with the classification;
determining, by the computing device, a set of responses based on the recognized acoustic input signal, the classification, and the speaker-specific information for the user;
determining, by the computing device, if the response should be output; and
outputting, by the computing device, the response if it is determined the response should be output,wherein classifying portions is performed in an always on mode, and wherein identifying the user that is speaking is performed after receiving a trigger phrase to activate the speech recognizer.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, a method includes receiving an acoustic input signal at a speech recognizer. A user is identified that is speaking based on the acoustic input signal. The method then determines speaker-specific information previously stored for the user and a set of responses based on the recognized acoustic input signal and the speaker-specific information for the user. It is determined if the response should be output and the response is outputted if it is determined the response should be output.
-
Citations
25 Claims
-
1. A method comprising:
-
receiving, by a computing device, an acoustic input signal at a speech recognizer; identifying, by the computing device, a user that is speaking based on the acoustic input signal; determining, by the computing device, speaker-specific information previously stored for the user; determining, by the computing device, a set of classifications, wherein the set of classifications are determined based on the speaker-specific information; classifying, by the computing device, portions of the acoustic input signal into different classifications in the set of classifications; selecting, by the computing device, a classification in the set of classifications based on a criterion associated with the classification; determining, by the computing device, a set of responses based on the recognized acoustic input signal, the classification, and the speaker-specific information for the user; determining, by the computing device, if the response should be output; and outputting, by the computing device, the response if it is determined the response should be output, wherein classifying portions is performed in an always on mode, and wherein identifying the user that is speaking is performed after receiving a trigger phrase to activate the speech recognizer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method comprising:
-
receiving, by a computing device, a signal from a first stage recognizer based on recognition of an acoustic input signal and classification of portions of the acoustic input signal into a classification in a plurality of classifications using a first speech recognition algorithm, the first stage recognizer being configured to recognize the acoustic input signal in an always on mode; activating, by the computing device, the second stage recognizer upon receiving the signal to recognize the acoustic input signal, the second stage recognizer configured to use a second speech recognition algorithm; identifying, by the computing device, a user that is speaking based on the acoustic input signal; determining, by the computing device, speaker-specific information previously stored for the user; determining, by the computing device, a response to the recognized acoustic input signal based on the speaker-specific information; determining, by the computing device, if the response should be output based on a ranking of the response; and outputting, by the computing device, the response if it is determined the response should be output. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A system comprising:
-
a first stage recognizer configured to recognize the acoustic input signal using a first speech recognition algorithm in an always on mode, the first stage recognizer configured to; receive an acoustic input signal; identify a user that is speaking based on the acoustic input signal; determine speaker specific information previously stored for the user; classify portions of the acoustic input signal into different classifications using a first speech recognition algorithm; determine a second stage recognizer should be triggered based on a selection of a classification based on classified portions being classified with the selected classification and the speaker-specific information; and a second stage recognizer configured to; receive a signal from the first stage recognizer to activate the second stage recognizer; activate the second stage recognizer upon receiving the signal to recognize the acoustic input signal, the second stage recognizer configured to use a second speech recognition algorithm different from the first speech recognition algorithm to recognize the acoustic input signal; determine a response to the recognized acoustic input signal using the speaker-specific information; determine if the response should be output based on a ranking of the response; and output the response if it is determined the response should be output. - View Dependent Claims (19, 20)
-
-
21. A method comprising:
-
receiving, by a computing device, a trigger phrase; activating, by the computing device, a speech recognizer based on receiving the trigger phrase; receiving, by the computing device, an acoustic input signal at the speech recognizer; identifying, by the computing device, a user that is speaking based on the acoustic input signal or the trigger phrase; determining, by the computing device, speaker-specific information previously stored for the user; determining, by the computing device, a set of classifications, wherein the set of classifications are determined based on the speaker-specific information; classifying, by the computing device, portions of the acoustic input signal into different classifications in the set of classifications; selecting, by the computing device, a classification in the set of classifications based on a criterion associated with the classification; determining, by the computing device, a set of responses based on the recognized acoustic input signal, the classification, and the speaker-specific information for the user; and outputting, by the computing device, the response if it is determined the response should be output wherein classifying portions is performed in an always on mode, and wherein identifying the user that is speaking is performed after receiving the trigger phrase to activate the speech recognizer. - View Dependent Claims (22, 23, 24, 25)
-
Specification