INDEXING MULTIMEDIA COMMUNICATIONS
First Claim
1. A method for indexing a multimedia communication, comprising:
- receiving the multimedia communication, the multimedia communication including a plurality of multimedia data packets;
processing the plurality of multimedia data packets to identify distinguishing features; and
indexing the plurality of multimedia data packets based on the identified distinguishing features, wherein the processing step comprises associating each of the plurality of multimedia data packets with one of a plurality of objects within the multimedia communication.
1 Assignment
0 Petitions
Accused Products
Abstract
A network based platform uses face recognition, speech recognition, background change detection and key scene events to index multimedia communications. Before the multimedia communication begins, active participants register their speech and face models with a server. The process consists of creating a speech sample, capturing a sample image of the participant and storing the data in a database. The server provides an indexing function for the multimedia communication. During the multimedia communication, metadata including time stamping is retained along with the multimedia content. The time stamping information is used for synchronizing the multimedia elements. The multimedia communication is then processed through the server to identify the multimedia communication participants based on speaker and face recognition models. This allows the server to create an index table that becomes an index of the multimedia communication. In addition, through scene change detection and background recognition, certain backgrounds and key scene information can be used for indexing. Therefore, through this indexing apparatus and method, a specific participant can be recognized as speaking and the content that the participant discussed can also be used for indexing.
162 Citations
21 Claims
-
1. A method for indexing a multimedia communication, comprising:
-
receiving the multimedia communication, the multimedia communication including a plurality of multimedia data packets;
processing the plurality of multimedia data packets to identify distinguishing features; and
indexing the plurality of multimedia data packets based on the identified distinguishing features, wherein the processing step comprises associating each of the plurality of multimedia data packets with one of a plurality of objects within the multimedia communication. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. An apparatus for indexing a multimedia communication, comprising:
-
a server that receives multimedia communications in multimedia data packets including audio, visual and data communications and identifies distinguishing features in the multimedia communication based on audio and video recognition and a source of the multimedia communications;
a header function module connected to the server, the header function module entering metadata in a header segment corresponding to the multimedia data packets received by the server, the metadata being related to the distinguishing features; and
a storage device that stores the multimedia data packets. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A method of identifying participants to a multimedia communication, comprising:
-
comparing audio speech patterns for each participant to speech models;
comparing video face patterns for each participant to face models; and
determining an identity of a particular participant when both the audio speech patterns and the video face patterns match speech and face models for the particular participant. - View Dependent Claims (21)
-
Specification