SPEAKER IDENTIFICATION METHOD AND SPEAKER IDENTIFICATION DEVICE
First Claim
1. A speaker identification method that includes:
- learning mode processing in which a first database, in which a plurality of unspecified speakers and a plurality of unspecified speaker models obtained by modeling features of voices of the plurality of unspecified speakers are associated and stored, is used to create a second database, in which first speakers who are not stored in the first database and a plurality of the unspecified speaker models are associated and stored; and
identification mode processing in which the second database is used to identify a second speaker,wherein, in the learning mode processing,voice signal of each of the first speakers is acquired,first similarity degrees between a feature value in the acquired voice signal of each of the first speakers and each feature value in the plurality of unspecified speaker models stored in the first database are calculated,a plurality of the unspecified speaker models for which the calculated first similarity degrees are equal to or greater than a prescribed value are specified,and each of the first speakers and the specified plurality of unspecified speaker models are associated and stored in the second database,and in the identification mode processing,a voice signal of the second speaker is acquired,a plurality of second similarity degrees between a feature value in the acquired voice signal of the second speaker and each feature value in the plurality of unspecified speaker models associated with the first speakers and stored in the second database are calculated for each of the first speakers,and one of the first speakers stored in the second database who corresponds to the second speaker is specified based on the calculated plurality of second similarity degrees.
1 Assignment
0 Petitions
Accused Products
Abstract
A first similarity degree calculation unit calculates first similarity degrees between a feature value in voice signal of each of first speakers and each feature value in a plurality of unspecified speaker models of a plurality of unspecified speakers, a model specifying unit specifies a plurality of the unspecified speaker models for which the first similarity degrees are equal to or greater than a prescribed value, a second speaker model storage unit associates and stores each of the first speakers and the specified unspecified speaker models, a second similarity degree calculation unit calculates, for each of the first speakers, a plurality of second similarity degrees between a feature value in a voice signal of a second speaker and each feature values in the unspecified speaker models associated with the first speakers and stored in the second speaker model storage unit, and a speaker identification unit that specifies the second speaker based on the second similarity degrees.
45 Citations
16 Claims
-
1. A speaker identification method that includes:
-
learning mode processing in which a first database, in which a plurality of unspecified speakers and a plurality of unspecified speaker models obtained by modeling features of voices of the plurality of unspecified speakers are associated and stored, is used to create a second database, in which first speakers who are not stored in the first database and a plurality of the unspecified speaker models are associated and stored; and identification mode processing in which the second database is used to identify a second speaker, wherein, in the learning mode processing, voice signal of each of the first speakers is acquired, first similarity degrees between a feature value in the acquired voice signal of each of the first speakers and each feature value in the plurality of unspecified speaker models stored in the first database are calculated, a plurality of the unspecified speaker models for which the calculated first similarity degrees are equal to or greater than a prescribed value are specified, and each of the first speakers and the specified plurality of unspecified speaker models are associated and stored in the second database, and in the identification mode processing, a voice signal of the second speaker is acquired, a plurality of second similarity degrees between a feature value in the acquired voice signal of the second speaker and each feature value in the plurality of unspecified speaker models associated with the first speakers and stored in the second database are calculated for each of the first speakers, and one of the first speakers stored in the second database who corresponds to the second speaker is specified based on the calculated plurality of second similarity degrees. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A speaker identification device comprising:
-
a learning mode processing unit that uses a first database, in which a plurality of unspecified speakers and a plurality of unspecified speaker models obtained by modeling features of voices of the plurality of unspecified speakers are associated and stored, to create a second database, in which first speakers who are not stored in the first database and a plurality of the unspecified speaker models are associated and stored; and an identification mode processing unit that uses the second database to identify a second speaker, wherein, the learning mode processing unit includes a first voice acquirer that acquires voice signal of each of the first speakers, a first similarity degree calculator that calculates first similarity degrees between a feature value in the voice signal of each of the first speakers acquired by the first voice acquirer and each feature value in the plurality of unspecified speaker models stored in the first database, a first specifier that specifies a plurality of the unspecified speaker models for which the first similarity degrees calculated by the first similarity degree calculator are equal to or greater than a prescribed value, and a storage processing unit that associates and stores each of the first speakers and the plurality of unspecified speaker models specified by the first specifier in the second database, and the identification mode processing unit includes a second voice acquirer that acquires a voice signal of the second speaker, a second similarity degree calculator that calculates, for each of the first speakers, a plurality of second similarity degrees between a feature value in the voice signal of the second speaker acquired by the second voice acquirer and each feature value in the plurality of unspecified speaker models associated with the first speakers and stored in the second database, and a second specifier that, based on the plurality of second similarity degrees calculated by the second similarity degree calculator, specifies one of the first speakers stored in the second database who corresponds to the second speaker. - View Dependent Claims (14)
-
-
15. A speaker identification method that includes:
-
learning mode processing in which a first database, in which a plurality of unspecified speakers and a plurality of unspecified speaker models obtained by modeling features of voices of the plurality of unspecified speakers are associated and stored, is used to create a second database, in which first speakers who are not stored in the first database and a plurality of the unspecified speaker models are associated and stored; and identification mode processing in which the second database is used to identify a second speaker, wherein, in the learning mode processing, voice signal of each of the first speakers is acquired, first similarity degrees between a feature value in the acquired voice signal of each of the first speakers and each feature value in the plurality of unspecified speaker models of a plurality of the unspecified speakers who are different from the first speakers and are stored in the first database are calculated, a plurality of the unspecified speaker models for which the calculated first similarity degrees are equal to or greater than a prescribed value are specified, speaker model corresponding to each of the first speakers is newly created based on the specified plurality of the unspecified speaker models and the acquired voice signals of the first speakers, and the created speaker model is associated with the first speakers and stored in the second database, and in the identification mode processing, a voice signal of the second speaker is acquired, a plurality of second similarity degrees between a feature value in the acquired voice signal of the second speaker and feature values in the speaker models associated with the first speakers and stored in the second database are calculated for each of the first speakers, and one of the first speakers stored in the second database who corresponds to the second speaker is specified based on the calculated plurality of second similarity degrees. - View Dependent Claims (16)
-
Specification