Emotion recognition in video conferencing
First Claim
1. A computer-implemented method for video conferencing, the method comprising:
- receiving a video including a sequence of images corresponding to a videoconference between first and second users;
detecting at least one object of interest in one or more of the images;
locating feature reference points of the at least one object of interest;
determining that at least one deformation between two or more of the feature reference points refers to a facial emotion selected from a plurality of reference facial emotions;
determining that the facial emotion is a negative facial emotion; and
in response to determining that the facial emotion is the negative facial emotion, generating a communication for transmission to a non-participant of the videoconference between the first and second users, the communication bearing data associated with the negative facial emotion.
4 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for videoconferencing include recognition of emotions related to one videoconference participant such as a customer. This ultimately enables another videoconference participant, such as a service provider or supervisor, to handle angry, annoyed, or distressed customers. One example method includes the steps of receiving a video that includes a sequence of images, detecting at least one object of interest (e.g., a face), locating feature reference points of the at least one object of interest, aligning a virtual face mesh to the at least one object of interest based on the feature reference points, finding over the sequence of images at least one deformation of the virtual face mesh that reflect face mimics, determining that the at least one deformation refers to a facial emotion selected from a plurality of reference facial emotions, and generating a communication bearing data associated with the facial emotion.
50 Citations
20 Claims
-
1. A computer-implemented method for video conferencing, the method comprising:
-
receiving a video including a sequence of images corresponding to a videoconference between first and second users; detecting at least one object of interest in one or more of the images; locating feature reference points of the at least one object of interest; determining that at least one deformation between two or more of the feature reference points refers to a facial emotion selected from a plurality of reference facial emotions; determining that the facial emotion is a negative facial emotion; and in response to determining that the facial emotion is the negative facial emotion, generating a communication for transmission to a non-participant of the videoconference between the first and second users, the communication bearing data associated with the negative facial emotion. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
one or more processors; and a non-transitory processor-readable medium coupled to the one or more processors, the non-transitory processor-readable medium comprising processor-executable instructions that, when executed by one or more processors of a machine, cause the machine to perform operations comprising; receiving a video including a sequence of images corresponding to a videoconference between first and second users; detecting at least one object of interest in one or more of the images; locating feature reference points of the at least one object of interest; determining that at least one deformation between two or more of the feature reference points refers to a facial emotion selected from a plurality of reference facial emotions; determining that the facial emotion is a negative facial emotion; and in response to determining that the facial emotion is the negative facial emotion, generating a communication for transmission to a non-participant of the videoconference between the first and second users, the communication bearing data associated with the negative facial emotion. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory processor-readable medium comprising processor-executable instructions that, when executed by one or more processors of a machine, cause the machine to perform operations comprising:
-
receiving a video including a sequence of images corresponding to a videoconference between first and second users; detecting at least one object of interest in one or more of the images; locating feature reference points of the at least one object of interest; determining that at least one deformation between two or more of the feature reference points refers to a facial emotion selected from a plurality of reference facial emotions; determining that the facial emotion is a negative facial emotion; and in response to determining that the facial emotion is the negative facial emotion, generating a communication for transmission to a non-participant of the videoconference between the first and second users, the communication bearing data associated with the negative facial emotion. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification