Speaker identification with user-selected password phrases
First Claim
1. A speaker identification system, comprising:
- a lexicon database storing for each of a group of enrolled users a set of one or more phonetic transcriptions of a password utterance associated with the enrolled user;
an HNM database storing for each of the group of enrolled users a hidden Markov model corresponding to said password utterance;
a speaker-independent phrase recognizer, which (i) selects N best matching password utterances based on an unknown utterance, and (ii) determines a speaker-independent score for each of the N best matching password utterances;
a speaker-dependent phrase recognizer, which determines a speaker-dependent score for each of the N best matching password utterances; and
a score processor, which (i) for each of the N best matching password utterances, sums the speaker-independent score and the speaker-dependent score to generate a combined score, and (ii) determines a putative identity based on the highest combined score.
4 Assignments
0 Petitions
Accused Products
Abstract
A speaker identification system includes a speaker-independent phrase recognizer. The speaker-independent phrase recognizer scores a password utterance against all the sets of phonetic transcriptions in a lexicon database to determine the N best speaker-independent scores, determines the N best sets of phonetic transcriptions based on the N best speaker-independent scores, and determines the N best possible identities. A speaker-dependent phrase recognizer retrieves the hidden Markov model corresponding to each of the N best possible identities, and scores the password utterance against each of the N hidden Markov models to generate a speaker-dependent score for each of the N best possible identities. A score processor coupled to the outputs of the speaker-independent phrase recognizer and the speaker-dependent phrase recognizer determines a putative identity. A verifier coupled to the score processor authenticates the determined putative identity.
-
Citations
14 Claims
-
1. A speaker identification system, comprising:
-
a lexicon database storing for each of a group of enrolled users a set of one or more phonetic transcriptions of a password utterance associated with the enrolled user; an HNM database storing for each of the group of enrolled users a hidden Markov model corresponding to said password utterance; a speaker-independent phrase recognizer, which (i) selects N best matching password utterances based on an unknown utterance, and (ii) determines a speaker-independent score for each of the N best matching password utterances; a speaker-dependent phrase recognizer, which determines a speaker-dependent score for each of the N best matching password utterances; and a score processor, which (i) for each of the N best matching password utterances, sums the speaker-independent score and the speaker-dependent score to generate a combined score, and (ii) determines a putative identity based on the highest combined score. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for identifying a speaker, comprising the steps of:
-
speaking a password utterance; determining a putative identity from an enrolled user group based on said password utterance by (i) selecting N best matching password utterances, (ii) determining a speaker-independent score for each of the N best matching password utterances, (iii) determining a speaker-dependent score for each of the N best matching password utterances, (iv) for each of the N best matching password utterances, summing the speaker-independent score and the speaker-dependent score to generate a combined score, and (v) determining the putative identity based on the highest combined score; and verifying the determined putative identity using said password utterance. - View Dependent Claims (7, 8, 9, 14)
-
-
10. A speaker identification system, comprising:
-
means for selecting a putative identity from an enrolled user group based on a spoken password utterance, wherein the means for selecting includes a means for summing speaker-independent and speaker-dependent scores to obtain a combined score; and means for verifying the selected putative identity using said spoken password utterance. - View Dependent Claims (11, 12, 13)
-
Specification