Voice messaging system that organizes voice messages based on detected emotion
First Claim
Patent Images
1. A method for managing voice messages based on emotion characteristics of the voice messages comprising the steps of:
- (a) receiving a plurality of voice messages transferred over a telecommunication network, wherein the voice messages include at least one voice signal;
(b) storing the voice messages on a storage medium;
(c) extracting at least one feature from the voice signal selected from a group of features consisting of a maximum value of a fundamental frequency, a standard deviation of the fundamental frequency, a range of the fundamental frequency, a mean of the fundamental frequency, a mean of a bandwidth of a first formant, a mean of a bandwidth of a second formant, a standard deviation of energy, a speaking rate, a slope of the fundamental frequency, a maximum value of the first formant, a maximum value of the energy, a range of the energy, a range of the second formant, and a range of the first formant;
(d) determining an emotion associated with the voice signals of the voice messages based on said feature of the voice signals;
(e) organizing the voice messages based on the determined emotion; and
(f) allowing access to the organized voice messages.
6 Assignments
0 Petitions
Accused Products
Abstract
A system, method and article of manufacture are provided for managing voice messages based on emotion characteristics of the voice messages. First, a plurality of voice messages that are transferred over a telecommunication network are received. Thereafter, such voice messages are stored on a storage medium. An emotion associated with voice signals of the voice messages is then determined. The voice messages are organized based on the determined emotion. Access to the organized voice messages is then permitted.
172 Citations
22 Claims
-
1. A method for managing voice messages based on emotion characteristics of the voice messages comprising the steps of:
-
(a) receiving a plurality of voice messages transferred over a telecommunication network, wherein the voice messages include at least one voice signal;
(b) storing the voice messages on a storage medium;
(c) extracting at least one feature from the voice signal selected from a group of features consisting of a maximum value of a fundamental frequency, a standard deviation of the fundamental frequency, a range of the fundamental frequency, a mean of the fundamental frequency, a mean of a bandwidth of a first formant, a mean of a bandwidth of a second formant, a standard deviation of energy, a speaking rate, a slope of the fundamental frequency, a maximum value of the first formant, a maximum value of the energy, a range of the energy, a range of the second formant, and a range of the first formant;
(d) determining an emotion associated with the voice signals of the voice messages based on said feature of the voice signals;
(e) organizing the voice messages based on the determined emotion; and
(f) allowing access to the organized voice messages. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program embodied on a computer readable medium for managing voice messages based on emotion characteristics of the voice messages comprising:
-
(a) a code segment that receives a plurality of voice messages transferred over a telecommunication network, wherein the voice messages include at least one voice signal;
(b) a code segment that stores the voice messages on a storage medium;
(c) a code segment that extracts at least one feature from the voice signal selected from a group of features consisting of a maximum value of a fundamental frequency, a standard deviation of the fundamental frequency, a range of the fundamental frequency, a mean of the fundamental frequency, a mean of a bandwidth of a first formant, a mean of a bandwidth of a second formant, a standard deviation of energy, a speaking rate, a slope of the fundamental frequency, a maximum value of the first formant, a maximum value of the energy, a range of the energy, a range of the second formant, and a range of the first formant;
(d) a code segment that determines an emotion associated with voice signals of the voice messages;
(e) a code segment that organizes the voice messages based on the determined emotion; and
(f) a code segment that allows access to the organized voice messages. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A system for managing voice messages based on emotion characteristics of the voice messages comprising:
-
(a) logic that receives a plurality of voice messages transferred over a telecommunication network, wherein the voice messages include at least one voice signal;
(b) logic that stores the voice messages on a storage medium;
(c) logic that extracts at least one feature from the voice signal selected from a group of features consisting of a maximum value of a fundamental frequency, a standard deviation of the fundamental frequency, a range of the fundamental frequency, a mean of the fundamental frequency, a mean of a bandwidth of a first formant, a mean of a bandwidth of a second formant, a standard deviation of energy, a speaking rate, a slope of the fundamental frequency, a maximum value of the first formant, a maximum value of the energy, a range of the energy, a range of the second formant, and a range of the first formant;
(d) logic that determines an emotion associated with voice signals of the voice messages;
(e) logic that organizes the voice messages based on the determined emotion; and
(f) logic that allows access to the organized voice messages. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A method for managing voice messages based on emotion characteristics of the voice messages, comprising the steps of:
-
(a) receiving a plurality of voice messages transferred over a telecommunication network;
(b) extracting at least one segment of audio frequency from each said voice message;
(c) extracting at least one audio feature from the segment of audio frequency, wherein the at least one audio feature is selected from a group of audio features consisting of a maximum value of a fundamental frequency, a standard deviation of the fundamental frequency, a range of the fundamental frequency, a mean of the fundamental frequency, a mean of a bandwidth of a first formant, a mean of a bandwidth of a second formant, a standard deviation of energy, a speaking rate, a slope of the fundamental frequency, a maximum value of the first formant, a maximum value of the energy, a range of the energy, a range of the second formant, and a range of the first formant; and (d) determining an emotion associated with said voice message using the at least one audio feature of the voice message. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
Specification