Apparatus and methods for semantic representation and retrieval of multimedia content
First Claim
1. A method of representing multimedia content, comprising:
- performing feature extraction on one or more modalities of the multimedia content to extract one or more features of the multimedia content;
identifying one or more generic cues based on the one or more extracted features;
identifying a semantic based on a combination of the one or more generic cues; and
generating a model for the multimedia content based on the identified semantic.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus and method for analyzing multimedia content to identify the presence of audio, visual and textual cues that together correspond to one or more high-level semantics are provided. The apparatus and method make use of one or more analysis models that are trained to analyze audio, visual and textual portions of multimedia content to generate scores associated with the audio, visual and textual portions with respect to various high-level semantic concepts. These scores are used to generate a vector of scores. The apparatus is trained with regard to relationships between audio, visual and textual scores to thereby take the vector of scores generated for the multimedia content and classify the multimedia content into one or more high-level semantic concepts. Based on the scores for the various audio, video and textual portions of the multimedia content, a level of certainty regarding the high-level semantic concepts may be generated. These high-level semantic concepts are then used to generate one or more labels for the multimedia content that may be used to retrieve the multimedia content using a conceptual search engine. These semantic concept labels and their associated certainty levels may be stored in a file, associated with the multimedia content, for use in retrieving the multimedia content using the conceptual search engine.
94 Citations
27 Claims
-
1. A method of representing multimedia content, comprising:
-
performing feature extraction on one or more modalities of the multimedia content to extract one or more features of the multimedia content;
identifying one or more generic cues based on the one or more extracted features;
identifying a semantic based on a combination of the one or more generic cues; and
generating a model for the multimedia content based on the identified semantic. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product in a computer readable medium for representing multimedia content, comprising:
-
first instructions for performing feature extraction on one or more modalities of the multimedia content to extract one or more features of the multimedia content;
second instructions for identifying one or more generic cues based on the one or more extracted features;
third instructions for identifying a semantic based on a combination of the one or more generic cues; and
fourth instructions for generating a model for the multimedia content based on the identified semantic. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. An apparatus for representing multimedia content, comprising:
-
means for performing feature extraction on one or more modalities of the multimedia content to extract one or more features of the multimedia content;
means for identifying one or more generic cues based on the one or more extracted features;
means for identifying a semantic based on a combination of the one or more generic cues; and
means for generating a model for the multimedia content based on the identified semantic.
-
-
18. A method of searching for multimedia content, comprising:
-
providing an interface for entering a search request, wherein the interface includes a field for entering a search term and a field for designating a modality corresponding to the search term;
receiving a search request from a client device via the interface, wherein the search request includes a search term and a corresponding modality;
searching a data structure of multimedia content models based on the identified search term and corresponding modality; and
returning results of searching the data structure to the client device. - View Dependent Claims (19, 20, 21, 22)
-
-
23. A computer program product in a computer readable medium for searching for multimedia content, comprising:
-
first instructions for providing an interface for entering a search request, wherein the interface includes a field for entering a search term and a field for designating a modality corresponding to the search term;
second instructions for receiving a search request from a client device via the interface, wherein the search request includes a search term and a corresponding modality;
third instructions for searching a data structure of multimedia content models based on the identified search term and corresponding modality; and
fourth instructions for returning results of searching the data structure to the client device. - View Dependent Claims (24, 25, 26, 27)
-
Specification