Voice registration method and system, and voice recognition method and system based on voice registration method and system
First Claim
1. A voice registration method for voice recognition, comprising the steps of:
- analyzing a spectrum of a sound signal inputted from the outside;
extracting predetermined language units for a speaker recognition from a voice signal in the sound signal;
measuring the loudness of each language unit;
collecting voice data on registered speakers including loudness data of the plurality of background speakers as a reference onto voice database;
determining whether the loudness of each language unit is within a predetermined loudness range based on the voice data base;
learning each language unit by using a multi-layer perceptron in the case that at least a predetermined number of language units are within the predetermined loudness range; and
storing data on the learned language unit as data for recognizing the speaker.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed is a voice registration method for voice recognition, comprising the steps of analyzing a spectrum of a sound signal inputted from the outside; extracting predetermined language units for a speaker recognition from a voice signal in the sound signal; measuring the loudness of each language unit; collecting voice data on registered (background) speakers including loudness data of the plurality of background speakers as a reference onto voice database; determining whether the loudness of each language unit is within a predetermined loudness range based on the voice data base; learning each language unit by using a multi-layer perceptron in the case that at least a predetermined number of language units are within the predetermined loudness range; and storing data on the learned language unit as data for recognizing the speaker. With this configuration, loudness of a speaker is considered at learning for registering his/her voice and at verifying a speaker.
-
Citations
33 Claims
-
1. A voice registration method for voice recognition, comprising the steps of:
-
analyzing a spectrum of a sound signal inputted from the outside; extracting predetermined language units for a speaker recognition from a voice signal in the sound signal; measuring the loudness of each language unit; collecting voice data on registered speakers including loudness data of the plurality of background speakers as a reference onto voice database; determining whether the loudness of each language unit is within a predetermined loudness range based on the voice data base; learning each language unit by using a multi-layer perceptron in the case that at least a predetermined number of language units are within the predetermined loudness range; and storing data on the learned language unit as data for recognizing the speaker. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A speaker recognition method for recognizing whether a speaker is a registered speaker, comprising the steps of:
-
analyzing a spectrum of a sound signal inputted from the outside; extracting predetermined language units for a speaker recognition from a voice signal in the sound signal; measuring the loudness of each language unit; determining whether the loudness of each language unit is within a predetermined loudness range; calculating a speaker score by calculating the probability that the language unit will belong to the speaker through a multi-layer perceptron, and by averaging the probability, in the case that at least a predetermined number of language units are within the predetermined loudness range; and verifying that the speaker is registered when the speaker score is beyond a threshold value by comparing the calculated speaker score with the predetermined threshold value which is a predetermined minimum speaker score for verifying the registered speaker. - View Dependent Claims (19, 20)
-
-
21. A voice recognition system for voice recognition, comprising:
-
a voice analyzer analyzing a spectrum of a sound signal inputted from the outside; a voice extractor extracting a voice signal from the sound signal and extracting predetermined language units for recognizing a speaker from the voice signal; a voice database storing therein background speaker voice data including the loudness of a plurality of reference background speakers; a loudness determiner determining the loudness of each language unit, and determining whether the loudness of each language unit is within a predetermined loudness range on the basis of the voice database; a learner learning the language unit in the case that at least a predetermined number of additional ones of the language units are within the predetermined loudness range; a memory storing data on the learned language units as recognition data for the speaker; and a controller controlling operations of the voice analyzer, the voice extractor, the loudness determiner and the learner when a voice is inputted, and storing the recognition data for the speaker in the memory. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
-
31. A speaker recognition system for recognizing whether a speaker is a registered speaker, comprising:
-
a voice analyzer analyzing a spectrum of a voice signal inputted from external sound signals; a voice extractor picking out voice signals among inputted sound and abstracting predetermined language units for recognizing the speaker from the voice signals; a loudness determiner determining the loudness of each language unit, and determining whether the loudness of each language unit is within a predetermined loudness range; a speaker score calculator calculating a speaker score by calculating probability of that the language unit will belong to the speaker, and by averaging the probability; and a controller controlling the speaker score calculator to calculate the speaker score in the case that at least the predetermined number more among all language units is within the predetermined loudness range, and ascertaining that the speaker has been registered when the speaker score is beyond a threshold value by comparing the calculated speaker score with the predetermined threshold value which is a predetermined minimum speaker score for ascertaining the registered speaker. - View Dependent Claims (32, 33)
-
Specification