System and method for speaker verification using short utterance enrollments
First Claim
1. A method for verifying an identity of a speaker, comprising:
- receiving feature vectors extracted from an utterance received from a microphone and made by a speaker claiming an identity;
measuring dissimilarity between the feature vectors and a codebook associated with a version of the utterance known to be made by the identity;
analyzing the utterance to ascertain information about repeating occurrences of the feature vectors in the utterance;
comparing the information about repeating occurrences of feature vectors occurring in the utterance with information about repeating occurrences of feature vectors in a version of the utterance known to be made by the claimed identity;
assigning a penalty to the measured dissimilarity based on the comparison; and
determining whether to accept the speaker as the identity using the measured dissimilarity modified by the assigned penalty, wherein if the number of repeating occurrences for any of the feature vectors of the utterance exceeds a maximum number, rejecting the speaker as the identity, and wherein an additional penalty is assigned to the dissimilarity if any of the feature vectors of the utterance is determined to have a number of repeating occurrences exceeding the maximum number.
2 Assignments
0 Petitions
Accused Products
Abstract
In a system and method verifying an identity of a speaker, feature vectors are received that were extracted from an utterance made by a speaker claiming an identity. Dissimilarity may be measured between the feature vectors and a codebook associated with a version of the utterance known to be made by the identity. The utterance may be further analyzed to ascertain information about repeating occurrences of the feature vectors in the utterance. The information about repeating occurrences of feature vectors occurring in the utterance may be compared to information about repeating occurrences of feature vectors in a version of the utterance known to be made by the claimed identity. Based on the comparison, a penalty may be assigned to the measured dissimilarity. Using the measured dissimilarity modified by the assigned penalty, a determination may be made as to whether to accept the speaker as the identity.
28 Citations
11 Claims
-
1. A method for verifying an identity of a speaker, comprising:
-
receiving feature vectors extracted from an utterance received from a microphone and made by a speaker claiming an identity; measuring dissimilarity between the feature vectors and a codebook associated with a version of the utterance known to be made by the identity; analyzing the utterance to ascertain information about repeating occurrences of the feature vectors in the utterance; comparing the information about repeating occurrences of feature vectors occurring in the utterance with information about repeating occurrences of feature vectors in a version of the utterance known to be made by the claimed identity; assigning a penalty to the measured dissimilarity based on the comparison; and determining whether to accept the speaker as the identity using the measured dissimilarity modified by the assigned penalty, wherein if the number of repeating occurrences for any of the feature vectors of the utterance exceeds a maximum number, rejecting the speaker as the identity, and wherein an additional penalty is assigned to the dissimilarity if any of the feature vectors of the utterance is determined to have a number of repeating occurrences exceeding the maximum number. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for verifying an identity of a speaker, comprising:
-
receiving feature vectors extracted from an utterance received from a microphone and made by a speaker claiming an identity; measuring dissimilarity between the feature vectors and a codebook associated with a version of the utterance known to be made by the identity; analyzing the utterance to ascertain information about repeating occurrences of the feature vectors in the utterance; comparing the information about repeating occurrences of feature vectors occurring in the utterance with information about repeating occurrences of feature vectors in a version of the utterance known to be made by the claimed identity; assigning a penalty to the measured dissimilarity based on the comparison; and determining whether to accept the speaker as the identity using the measured dissimilarity modified by the assigned penalty, wherein if the number of repeating occurrences for any of the feature vectors of the utterance exceeds a maximum number, rejecting the speaker as the identity, and wherein the maximum number for a given feature vector is obtained by analyzing a plurality of utterances made by a plurality of speakers to identify the utterance of the plurality of utterances having the largest number of repeating occurrences of the given feature vector.
-
-
7. A method for verifying an identity of a speaker, comprising:
-
receiving feature vectors extracted from an utterance received from a microphone and made by a speaker claiming an identity; measuring dissimilarity between the feature vectors and a codebook associated with a version of the utterance known to be made by the identity; analyzing the utterance to ascertain information about repeating occurrences of the feature vectors in the utterance; comparing the information about repeating occurrences of feature vectors occurring in the utterance with information about repeating occurrences of feature vectors in a version of the utterance known to be made by the claimed identity; assigning a penalty to the measured dissimilarity based on the comparison; and determining whether to accept the speaker as the identity using the measured dissimilarity modified by the assigned penalty, wherein the assigned penalty comprises a penalty for each of the feature vectors of the utterance, wherein the assigned penalty for each feature vector is based on a difference between a number of repeating occurrences of the respective feature vector of the utterance and a number of repeating occurrences of the corresponding feature vector of the version of the utterance know to be made by the identity. - View Dependent Claims (8, 9, 10, 11)
-
Specification