METHOD AND APPARATUS FOR IDENTIFYING SPEAKERS AND EMPHASIZING SELECTED OBJECTS IN PICTURE AND VIDEO MESSAGES
First Claim
1. A method for emphasizing selected objects in digital data of at least one of pictures and video that is stored in digital messages, the messages including sender addresses and being stored in a memory system of a digital messaging system for a plurality of users, the method comprising:
- identifying picture regions including at least one of faces and persons in the digital data of the messages on the digital messaging system having the same sender address;
determining sender-relevant picture regions in the identified picture regions that represent a sender of the message based on at least one of;
a) a comparison with a reference picture of the sender stored on the memory system;
b) a comparison of speech data in the digital data with reference speech data using at least one of speaker recognition methods, speaker verification methods and speaker identification methods, taking into account picture data and speech data in the message; and
c) a frequency of occurrence of the identified picture regions in the messages that have the same sender address; and
modifying the digital data of the messages so as to emphasize the sender-relevant picture region.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for emphasizing selected objects in digital data of at least one of pictures and video that is stored in digital messages, the messages including sender addresses and being stored in a memory system of a digital messaging system for a plurality of users, includes the step of identifying picture regions including at least one of faces and persons in the digital data of the messages on the digital messaging system having the same sender address. Sender-relevant picture regions in the identified picture regions that represent a sender of the message are determined based on at least one of: a) a comparison with a reference picture of the sender stored on the memory system; b) a comparison of speech data in the digital data with reference speech data using at least one of speaker recognition methods, speaker verification methods and speaker identification methods, taking into account picture data and speech data in the message; and c) a frequency of occurrence of the identified picture regions in the messages that have the same sender address. The digital data of the messages is then modified so as to emphasize the sender-relevant picture region.
-
Citations
33 Claims
-
1. A method for emphasizing selected objects in digital data of at least one of pictures and video that is stored in digital messages, the messages including sender addresses and being stored in a memory system of a digital messaging system for a plurality of users, the method comprising:
-
identifying picture regions including at least one of faces and persons in the digital data of the messages on the digital messaging system having the same sender address; determining sender-relevant picture regions in the identified picture regions that represent a sender of the message based on at least one of; a) a comparison with a reference picture of the sender stored on the memory system; b) a comparison of speech data in the digital data with reference speech data using at least one of speaker recognition methods, speaker verification methods and speaker identification methods, taking into account picture data and speech data in the message; and c) a frequency of occurrence of the identified picture regions in the messages that have the same sender address; and modifying the digital data of the messages so as to emphasize the sender-relevant picture region. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A digital messaging system having a plurality of person-related messages stored on a memory system, the messages being provided with sender addresses and at least a portion of the messages including digital data of at least one of pictures and video, the digital messaging system comprising:
-
an identifying unit configured to identify picture regions having at least one of faces and persons in the digital data of messages that have the same sender address; at least one calculation unit configured to determine, from the identified picture regions, sender-relevant picture regions that represent a sender of the message based on at least one of; a) reference pictures that are stored on the memory system, b) a comparison of speech data in the digital data with reference speech data using speaker recognition methods and taking into account speech data in the message, and c) a frequency of occurrence of sender-relevant picture regions in the messages that have the same sender address; and a modification unit configured to modify the digital data of the messages that are received by emphasizing the sender-relevant picture regions. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A method for controlling a digital messaging system in which a plurality of person-related messages are stored on a memory system, the messages being provided with sender addresses and at least a portion of the messages further comprising digital data of at least one of pictures and video, the method comprising:
-
identifying picture regions including at least one of faces and persons in the digital data of the messages on the digital messaging system having the same sender address; determining sender-relevant picture regions in the identified picture regions that represent a sender of the message based on at least one of; a) a comparison with a reference picture of the sender stored on the memory system, b) a comparison of speech data in the digital data with reference speech data using at least one of speaker recognition methods, speaker verification methods and speaker identification methods, taking into account picture data and speech data in the message and c) a frequency of occurrence of the identified picture regions in the messages that have the same sender address; and generating a digital, graphic model of a speaker using the sender-relevant picture region so as to reproduce content of the message through animation of mouth regions in the reproduction of the message. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25)
-
-
26. A digital messaging system having a plurality of person-related messages are stored on a memory system, the messages being provided with sender addresses and at least a portion of the messages including digital data of at least one of pictures and video, the digital messaging system comprising:
-
an identifying unit configured to identify picture regions having at least one of faces and persons in the digital data of messages that have the same sender address; at least one calculation unit configured to determine, from the identified picture regions, sender-relevant picture regions that represent a sender of the message based on at least one of; a) reference pictures that are stored on the memory system, b) a comparison of speech data in the digital data with reference speech data using speaker recognition methods and taking into account speech data in the message, and c) a frequency of occurrence of sender-relevant picture regions in the messages that have the same sender address; and a generation unit configured to generate a digital, graphic model of a speaker using the sender-relevant picture region so as to reproduce content of the message through animation of mouth regions. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33)
-
Specification