Voice authentication system
First Claim
1. A text-dependent voice authentication system that performs authentication by receiving a keyword which a user inputs by voice, comprising:
- an input part that receives a voice input of a keyword from the user, said keyword is divided into a plurality of portions with an utterable unit which is a minimum unit, said voice input is carried out over a plurality of times at a time interval for each of the portions;
a speaker model storage part that previously stores a registered keyword of the user as a speaker model created in the utterable unit;
a feature value conversion part that obtains a feature value of a voice from a portion of the keyword which is received by a first voice input in the input part;
a similarity calculation part that obtains a similarity between the feature value obtained in the feature value conversion part and the speaker model;
a speech content determination part that determines information on a speech content by the plurality of times of voice inputs, based on the similarity obtained in the similarity calculation part;
a keyword checking part that determines whether or not the speech content of the plurality of times of voice inputs is capable of configuring an entire registered keyword, based on the information on the speech content determined in the speech content determination part; and
an authentication determination part that determines whether to accept or reject authentication, based on a determination result in the keyword checking part and the similarity obtained in the similarity calculation part.
1 Assignment
0 Petitions
Accused Products
Abstract
A text-dependent voice authentication system that performs authentication by urging a user to input a keyword by voice includes: an input part (11) that receives a voice input of a keyword divided into a plurality of portions with an utterable unit being a minimum unit over a plurality of times at a time interval for each of the portions; registered speaker-specific syllable model DB (20) that previously stores a registered keyword of a user as a speaker model created in the utterable unit; a feature value conversion part (12) that obtains a feature value of a voice contained in a portion of the keyword received by the first voice input in the input part (11) from the portion; a similarity calculation part (13) that obtains a similarity between the feature value and the speaker model; a keyword checking part (17) that determines whether or not voice inputs of all the syllables or phonemes configuring an entire registered keyword by the plurality of times of voice inputs, based on the similarity obtained in the similarity calculation part; and an authentication determination part (19) that determines whether to accept or reject authentication, based on a determination result in the keyword checking part and the similarity obtained in the similarity calculation part.
46 Citations
10 Claims
-
1. A text-dependent voice authentication system that performs authentication by receiving a keyword which a user inputs by voice, comprising:
-
an input part that receives a voice input of a keyword from the user, said keyword is divided into a plurality of portions with an utterable unit which is a minimum unit, said voice input is carried out over a plurality of times at a time interval for each of the portions; a speaker model storage part that previously stores a registered keyword of the user as a speaker model created in the utterable unit; a feature value conversion part that obtains a feature value of a voice from a portion of the keyword which is received by a first voice input in the input part; a similarity calculation part that obtains a similarity between the feature value obtained in the feature value conversion part and the speaker model; a speech content determination part that determines information on a speech content by the plurality of times of voice inputs, based on the similarity obtained in the similarity calculation part; a keyword checking part that determines whether or not the speech content of the plurality of times of voice inputs is capable of configuring an entire registered keyword, based on the information on the speech content determined in the speech content determination part; and an authentication determination part that determines whether to accept or reject authentication, based on a determination result in the keyword checking part and the similarity obtained in the similarity calculation part. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer program product stored on a computer-readable medium, for causing a computer to embody a text-dependent voice authentication system that performs authentication by receiving a keyword which a user to input a keyword by voice, said computer program comprising the operations of
an input operation of receiving a voice input of a keyword from the user, said keyword is divided into a plurality of portions with an utterable unit which is a minimum unit, said voice input is carried out over a plurality of times at a time interval for each of the portions; -
a feature value conversion operation of obtaining a feature value of from a portion of the keyword which is received by a first voice input in the input part, said feature value of a voice is a speech signal contained in the portion of the keyword; a similarity calculation operation of referring to a speaker model storage part in which a keyword of a user is previously registered as a speaker model created in the utterable unit, and obtaining a similarity between the feature value obtained in the feature value conversion operation and the speaker model; a speech content determination operation of determining information on a speech content by the plurality of times of voice inputs, based on the similarity obtained in the similarity calculation part; a keyword checking operation of determining whether or not the speech content of the plurality of times of voice inputs is capable of configuring an entire registered keyword, based on the information on the speech content determined in the speech content determination operation; and an authentication determination operation of determining whether to accept or reject authentication, based on a determination result by the keyword checking operation and the similarity obtained by the similarity calculation operation.
-
Specification