Multimedia content description framework
First Claim
1. A method for describing a multimedia content source including at least one terminal object, said at least one terminal object including one or more modalities, each of said one or more modalities having one or more fidelities associated therewith, said method comprising the steps of:
- generating a distribution of one or more modalities and fidelities, said distribution corresponding to an audience of said multimedia content source;
grouping said multimedia content source into one or more target modalities and target fidelities according to said distribution;
generating a modality-fidelity dependency representation for a terminal object in said multimedia content source, said dependency representation including a description scheme comprising predetermined transformation rules for describing at least one of a relationship between two modalities and a relationship between two fidelities;
decomposing the multimedia content source according to said description scheme to create an InfoPyramid representation of each modality;
transforming said multimedia content source according to said modality-fidelity transformation rules;
generating annotations for each object in said multimedia content source; and
repeating said decomposing step, said transforming step and said step of generating annotations until every terminal object in said multimedia content source has been processed.
1 Assignment
0 Petitions
Accused Products
Abstract
A framework is provided for describing multimedia content and a system in which a plurality of multimedia storage devices employing the content description methods of the present invention can interoperate. In accordance with one form of the present invention, the content description framework is a description scheme (DS) for describing streams or aggregations of multimedia objects, which may comprise audio, images, video, text, time series, and various other modalities. This description scheme can accommodate an essentially limitless number of descriptors in terms of features, semantics or metadata, and facilitate content-based search, index, and retrieval, among other capabilities, for both streamed or aggregated multimedia objects.
611 Citations
2 Claims
-
1. A method for describing a multimedia content source including at least one terminal object, said at least one terminal object including one or more modalities, each of said one or more modalities having one or more fidelities associated therewith, said method comprising the steps of:
-
generating a distribution of one or more modalities and fidelities, said distribution corresponding to an audience of said multimedia content source;
grouping said multimedia content source into one or more target modalities and target fidelities according to said distribution;
generating a modality-fidelity dependency representation for a terminal object in said multimedia content source, said dependency representation including a description scheme comprising predetermined transformation rules for describing at least one of a relationship between two modalities and a relationship between two fidelities;
decomposing the multimedia content source according to said description scheme to create an InfoPyramid representation of each modality;
transforming said multimedia content source according to said modality-fidelity transformation rules;
generating annotations for each object in said multimedia content source; and
repeating said decomposing step, said transforming step and said step of generating annotations until every terminal object in said multimedia content source has been processed.
-
-
2. A method for creating a multimedia content source including at least one terminal object, said at least one terminal object including one or more modalities, each of said one or more modalities having one or more fidelities associated therewith, said method comprising the steps of:
-
generating a distribution of one or more modalities and fidelities, said distribution corresponding to an audience of said multimedia content source;
selecting one or more source modalities and associated source fidelities, said source modalities and source fidelities being selected according to a union of said distribution;
generating a modality-fidelity dependency representation for a terminal object in said multimedia content source, said dependency representation including a description scheme comprising predetermined transformation rules for describing at least one of a relationship between two modalities and a relationship between two fidelities;
synthesizing said multimedia content source according to the description scheme and including predetermined intra-object and inter-object relationships;
transforming said multimedia content source according to said modality-fidelity transformation rules;
generating-annotations for each object in said multimedia content source; and
repeating said synthesizing step, said transforming step and said step of generating annotations until every terminal object in said multimedia content source has been processed.
-
Specification