Voice log-in using spoken name input
First Claim
1. A voice log-in method for logging in to a system based on computerized recognition of a spoken name input comprising the steps:
- creating an augmented name recognition model from the spoken name input for each person to be enrolled, wherein said augmented name recognition model includes constituent name-part utterances, and also includes any pause in the spoken name input, and wherein said augmented name recognition model represents a portion of said constituent name-part utterances as optional, thereby accommodating elimination of optional name-part utterances, and wherein said pause is also represented as optional, thereby accommodating unpredictable variations in said pause, and wherein said creating name recognition models is accomplished using HMM (Hidden Markov Modeling) to create HMM name recognition models;
storing said name recognition model for each person enrolled in a name recognition model database;
comparing the spoken name input with the stored name recognition models each time a person seeks access to the system by voice log-in; and
logging a person in to the system if a pattern match is found, during said comparing, between the spoken name input and one of the stored name recognition models.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice log-in system is based on a person'"'"'s spoken name input only, using speaker-dependent acoustic name recognition models in a performing speaker-independent name recognition. In an enrollment phase, a dual pass endpointing procedure defines both the person'"'"'s full name (broad endpoints), and the component names separated by pauses (precise endpoints). An HMM (Hidden Markov Model) recognition model generator generates a corresponding HMM name recognition model modified by the insertion of additional skip transitions for the pauses between component names. In a recognition/update phase, a spoken-name speech signal is input to an HMM name recognition engine which performs speaker-independent name recognition--the modified HMM name recognition model permits the name recognition operation to accommodate pauses between component names of variable duration.
322 Citations
17 Claims
-
1. A voice log-in method for logging in to a system based on computerized recognition of a spoken name input comprising the steps:
-
creating an augmented name recognition model from the spoken name input for each person to be enrolled, wherein said augmented name recognition model includes constituent name-part utterances, and also includes any pause in the spoken name input, and wherein said augmented name recognition model represents a portion of said constituent name-part utterances as optional, thereby accommodating elimination of optional name-part utterances, and wherein said pause is also represented as optional, thereby accommodating unpredictable variations in said pause, and wherein said creating name recognition models is accomplished using HMM (Hidden Markov Modeling) to create HMM name recognition models; storing said name recognition model for each person enrolled in a name recognition model database; comparing the spoken name input with the stored name recognition models each time a person seeks access to the system by voice log-in; and logging a person in to the system if a pattern match is found, during said comparing, between the spoken name input and one of the stored name recognition models. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A voice log-in method for logging in to a system based on computerized recognition of a spoken name input comprising the steps:
-
creating an augmented name recognition model from the spoken name input for each person to be enrolled, wherein said augmented name recognition model includes constituent name-part utterances, and also includes any pause in the spoken name input, and wherein said augmented name recognition model represents a portion of said constituent name-part utterances as optional, thereby accommodating elimination of optional name-part utterances, and wherein said pause is also represented as optional, thereby accommodating unpredictable variations in said pause; storing a name recognition model for each person enrolled in a name recognition model database; comparing the spoken name input for each person seeking access to the system by voice log-in with the stored name recognition models; and logging a person in to the system if during said comparing, a pattern match is found between the spoken name input and one of the stored name recognition models. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A voice log-in method for logging in to a system based on computerized recognition of a spoken name input comprising the steps:
-
receiving the spoken name input for each person to be enrolled; making a first endpointing pass using broad endpoint criteria to delimit the beginning and end of a corresponding full-name utterance; and
thenmaking a second endpointing pass using precise endpoint criteria to locate the precise endpoints within such delimited full-name utterance that define constituent name-part utterances and any associated pause; creating an augmented name recognition model, based on broad and precise endpoints and the full-name and name-part utterances, wherein said augmented name recognition model represents a portion of said constituent name-part utterances as optional, thereby accommodating elimination of optional name-part utterances, and wherein said pause is also represented as optional, thereby accommodating unpredictable variations in said pause; storing said augmented name recognition model for each person enrolled in a name recognition model database; comparing the spoken name input each time a person seeks access to the system by voice log-in with the stored augmented name recognition models; and logging a person in to the system if, while during said comparing, a pattern match is found between the spoken name input and one of the stored name recognition models. - View Dependent Claims (16, 17)
-
Specification