MULTIMEDIA METADATA ANALYSIS USING INVERTED INDEX WITH TEMPORAL AND SEGMENT IDENTIFYING PAYLOADS
5 Assignments
0 Petitions
Accused Products
Abstract
The addition of relative term positions, temporal positions, and segment identifiers to an inverted index allows for temporal and phrase queries of multimedia assets. Segment identifiers enable any search results to be examined in context. The system makes advantageous use of Lucene'"'"'s binary payload functionality to store temporal data and segment identifiers as additional binary data for each term instance in the inverted index. The payloads are made up of three variable-length integers, which account for twelve extra bytes of metadata, which are stored for each term instance. A content database on a Master/Administrator server node provides the indexes for search into content in response to user events, returning results in JSON format. The search results may then be used to locate and present content segments to a user containing both requested search term results and the time location within the multimedia asset in which the search term(s) is found.
45 Citations
63 Claims
-
1-39. -39. (canceled)
-
40. A system for indexing multimedia digital content, comprising:
-
receiving at a data aggregator time-based metadata associated with the multimedia digital content, the time-based metadata being organized into a plurality of raw content segments; storing the plurality of raw content segments in a database in electronic communication with the data aggregator, each of the raw content segments being retrievable from the database based on a segment identifier assigned to each of the respective raw content segments; using a computer processor to normalize the plurality of raw content segments; and creating a searchable inverted index for the multimedia digital content that defines a segment instance for each occurrence of the textual description of the plurality of normalized content segments associated with the time-based metadata, where each segment instance is associated with at least one of the plurality of raw content segments stored in the database. - View Dependent Claims (41, 42, 43, 44, 45, 46, 47)
-
-
48. A method for identification and indexing of time-based portions of a multimedia digital content asset, comprising:
-
receiving time-based metadata associated with a multimedia digital content asset, the time-based metadata being organized into a plurality of raw content segments; storing the plurality of raw content segments in a database, each of the raw content segments being retrievable from the database based on a segment identifier assigned to each of the respective raw content segments; normalizing in a computer processor the plurality of raw content segments, where the textual description of each raw content segment includes one or more terms; creating a searchable inverted index for the multimedia digital content that defines a segment instance for each occurrence of the one or more terms from the textual description of the plurality of normalized content segments associated with the time-based metadata, where each segment instance is associated with at least one of the plurality of raw content segment stored in the database. - View Dependent Claims (49, 50, 51, 52, 53, 54, 55)
-
-
56. A computer program product embodied in a computer readable medium that when executed within a computer processor provides for identification of time-based portions of a multimedia digital content asset, comprising:
-
receiving time-based metadata associated with a multimedia digital content asset, the time-based metadata being organized into a plurality of raw content segments; storing the plurality of raw content segments in a database, each of the raw content segments being retrievable from the database based on a segment identifier assigned to each of the respective raw content segments; normalizing through the use of a computer processor the plurality of raw content segments, where the textual description of each raw content segment includes one or more terms; creating a searchable inverted index for the multimedia digital content that defines a segment instance for each occurrence of the one or more terms from the textual description of the plurality of normalized content segments associated with the time-based metadata, where each segment instance is associated with at least one of the plurality of raw content segment stored in the database. - View Dependent Claims (57, 58, 59, 60, 61, 62, 63)
-
Specification