Bimodal emotion recognition method and system utilizing a support vector machine
First Claim
1. A method used for emotion recognition comprising the steps of:
- (a) establishing hyperplanes, further comprising the steps of;
(a1) establishing a plurality of training samples; and
(a2) using a means of support vector machine (SVM) to establish the hyperplanes basing upon the plurality of training samples(b) inputting at least two unknown data to be identified while enabling each unknown data to correspond to one of the hyperplanes whereas there are two emotion category being defined in the one of the hyperplanes, and each unknown data being a data selected from an image data and a vocal data;
(c) respectively performing a calculation process, using a computer, upon the at least two unknown data for assigning each with a weight, the calculation process further comprising the steps of;
(c1) basing upon the plurality of training samples used for establishing the one of the hyperplanes to acquire a standard deviation and a mean distance between the plurality of training samples and the one of the hyperplanes;
(c2) respectively calculating feature distances between the one of the hyperplanes and the at least two unknown data to be identified; and
(c3) obtaining the weights of the at least two unknown data by performing a mathematic operation upon the feature distances, the plurality of training samples, the mean distance and the standard deviation, the mathematic operation further comprising the steps of;
obtaining differences between the feature distances and the standard deviation; and
normalizing the differences for obtaining the weights, wherein weights of facial image ZFi and weights of vocal data ZAi are obtained wherein
1 Assignment
0 Petitions
Accused Products
Abstract
A method is disclosed in the present disclosure for recognizing emotion by setting different weights to at least of two kinds of unknown information, such as image and audio information, based on their recognition reliability respectively. The weights are determined by the distance between test data and hyperplane and the standard deviation of training data and normalized by the mean distance between training data and hyperplane, representing the classification reliability of different information. The method recognizes the emotion according to the unidentified information having higher weights while the at least two kinds of unidentified information have different result classified by the hyperplane and correcting wrong classification result of the other unidentified information so as to raise the accuracy while emotion recognition. Meanwhile, the present disclosure also provides a learning step with a characteristic of higher learning speed through an algorithm of iteration.
-
Citations
28 Claims
-
1. A method used for emotion recognition comprising the steps of:
-
(a) establishing hyperplanes, further comprising the steps of; (a1) establishing a plurality of training samples; and (a2) using a means of support vector machine (SVM) to establish the hyperplanes basing upon the plurality of training samples (b) inputting at least two unknown data to be identified while enabling each unknown data to correspond to one of the hyperplanes whereas there are two emotion category being defined in the one of the hyperplanes, and each unknown data being a data selected from an image data and a vocal data; (c) respectively performing a calculation process, using a computer, upon the at least two unknown data for assigning each with a weight, the calculation process further comprising the steps of; (c1) basing upon the plurality of training samples used for establishing the one of the hyperplanes to acquire a standard deviation and a mean distance between the plurality of training samples and the one of the hyperplanes; (c2) respectively calculating feature distances between the one of the hyperplanes and the at least two unknown data to be identified; and (c3) obtaining the weights of the at least two unknown data by performing a mathematic operation upon the feature distances, the plurality of training samples, the mean distance and the standard deviation, the mathematic operation further comprising the steps of; obtaining differences between the feature distances and the standard deviation; and normalizing the differences for obtaining the weights, wherein weights of facial image ZFi and weights of vocal data ZAi are obtained wherein - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method used for emotion recognition, comprising the steps of:
-
(a) providing at least two training samples, each of the at least two training samples being defined in a specified characteristic space established by performing a transformation process upon the each of the at least two training samples with respect to its original space; (b) establishing at least two corresponding hyperplanes in the specified characteristic spaces of the at least two training samples, each of the at least two hyperplanes capable of defining two emotion categories; (c) inputting at least two unknown data to be identified in correspondence to the at least two hyperplanes, and transforming each unknown data to its corresponding characteristic space by the use of the transformation process while enabling each unknown data to correspond to one emotion category selected from the two emotion categories of the each of the at least two hyperplanes corresponding thereto, and each unknown data being a data selected from an image data and a vocal data; (d) respectively performing a calculation process, using a computer, upon the two unknown data for assigning each with a weight; (e) comparing the assigned weight of the two unknown data while using the comparison as base for selecting one emotion category out of a plurality of emotion categories as an emotion recognition result; and (f) performing a learning process with respect to a new unknown data for updating the each of the at least two hyperplanes, and further comprising the steps of; (f1) acquiring a parameter of the each of the at least two hyperplanes to be updated; (f2) transforming the new unknown data into its corresponding characteristic space by the use of the transformation process; and (f3) using feature values detected from the unknown data and the parameter to update the each of the at least two hyperplanes through an algorithm of iteration. (f4) when updating the each of the at least two hyperplanes, a critical set is determined by using a fixed number of samples close to the each of the at least two hyperplanes, and the critical set is defined by, Xi=arg min |w·
Xi+b|, wherein the Xi is a number of the samples, the W represents a normal vector of the each of the at least two hyperplanes, and the b represents an intercept. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
Specification