DYNAMIC THRESHOLD FOR SPEAKER VERIFICATION
First Claim
1. A computer-implemented method comprising:
- receiving, for each of multiple utterances of a hotword, a data set including at least (i) a speaker verification confidence score associated with the utterance, and (ii) environmental context data associated with the utterance;
selecting from among the data sets, a subset of the data sets that are associated with a particular environmental context;
selecting a particular data set from among the subset of data sets based on one or more selection criteria;
selecting, as a speaker verification threshold for the particular environmental context, the speaker verification confidence score included in the particular data set; and
providing the speaker verification threshold for use in performing speaker verification of utterances that are associated with the particular environmental context.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a dynamic threshold for speaker verification are disclosed. In one aspect, a method includes the actions of receiving, for each of multiple utterances of a hotword, a data set including at least a speaker verification confidence score, and environmental context data. The actions further include selecting from among the data sets, a subset of the data sets that are associated with a particular environmental context. The actions further include selecting a particular data set from among the subset of data sets based on one or more selection criteria. The actions further include selecting, as a speaker verification threshold for the particular environmental context, the speaker verification confidence score. The actions further include providing the speaker verification threshold for use in performing speaker verification of utterances that are associated with the particular environmental context.
230 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving, for each of multiple utterances of a hotword, a data set including at least (i) a speaker verification confidence score associated with the utterance, and (ii) environmental context data associated with the utterance; selecting from among the data sets, a subset of the data sets that are associated with a particular environmental context; selecting a particular data set from among the subset of data sets based on one or more selection criteria; selecting, as a speaker verification threshold for the particular environmental context, the speaker verification confidence score included in the particular data set; and providing the speaker verification threshold for use in performing speaker verification of utterances that are associated with the particular environmental context. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system comprising:
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, for each of multiple utterances of a hotword, a data set including at least (i) a speaker verification confidence score associated with the utterance, and (ii) environmental context data associated with the utterance; selecting from among the data sets, a subset of the data sets that are associated with a particular environmental context; selecting a particular data set from among the subset of data sets based on one or more selection criteria; selecting, as a speaker verification threshold for the particular environmental context, the speaker verification confidence score included in the particular data set; and providing the speaker verification threshold for use in performing speaker verification of utterances that are associated with the particular environmental context. - View Dependent Claims (15, 16, 17, 18, 19)
-
20. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
receiving, for each of multiple utterances of a hotword, a data set including at least (i) a speaker verification confidence score associated with the utterance, and (ii) environmental context data associated with the utterance; selecting from among the data sets, a subset of the data sets that are associated with a particular environmental context; selecting a particular data set from among the subset of data sets based on one or more selection criteria; selecting, as a speaker verification threshold for the particular environmental context, the speaker verification confidence score included in the particular data set; and providing the speaker verification threshold for use in performing speaker verification of utterances that are associated with the particular environmental context.
-
Specification