ACCESSING MEDIA DATA USING METADATA REPOSITORY
First Claim
1. A computer-implemented method comprising:
- tagging, in dialog and action text from an input script document regarding video content, at least some grammatical units of each sentence according to part-of-speech to generate tagged verb and noun phrases;
submitting the tagged verb and noun phrases to a named entity recognition (NER) extractor;
identifying and classifying, by the NER extractor, entities and actions in the tagged verb and noun phrases, the NER extractor using one or more external world knowledge ontologies in performing the identification and classification;
generating an entity-relationship data model that represents the entities and actions identified and classified by the NER extractor;
processing the generated entity-relationship data model to generate a metadata repository;
receiving, in a computer system, a user query comprising at least a first term;
parsing the user query to at least determine whether the user query assigns an action field defining the first term, the action field being a description of an action performed by an entity in a video;
converting the user query into a parsed query that conforms to a predefined format;
performing a search in the metadata repository using the parsed query, the metadata repository embodied in a computer readable medium and being generated based on multiple modes of metadata for the video content, the search identifying a set of candidate scenes from the video content;
ranking the set of candidate scenes according to a scoring metric into a ranked scene list; and
generating an output from the computer system that includes at least part of the ranked scene list, the output generated in response to the user query.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented method includes receiving, in a computer system, a user query comprising at least a first term, parsing the user query to at least determine whether the user query assigns a field to the first term, the parsing resulting in a parsed query that conforms to a predefined format, performing a search in a metadata repository using the parsed query, the metadata repository embodied in a computer readable medium and including triplets generated based on multiple modes of metadata for video content, the search identifying a set of candidate scenes from the video content, ranking the set of candidate scenes according to a scoring metric into a ranked scene list, and generating an output from the computer system that includes at least part of the ranked scene list, the output generated in response to the user query.
401 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
tagging, in dialog and action text from an input script document regarding video content, at least some grammatical units of each sentence according to part-of-speech to generate tagged verb and noun phrases; submitting the tagged verb and noun phrases to a named entity recognition (NER) extractor; identifying and classifying, by the NER extractor, entities and actions in the tagged verb and noun phrases, the NER extractor using one or more external world knowledge ontologies in performing the identification and classification; generating an entity-relationship data model that represents the entities and actions identified and classified by the NER extractor; processing the generated entity-relationship data model to generate a metadata repository; receiving, in a computer system, a user query comprising at least a first term; parsing the user query to at least determine whether the user query assigns an action field defining the first term, the action field being a description of an action performed by an entity in a video; converting the user query into a parsed query that conforms to a predefined format; performing a search in the metadata repository using the parsed query, the metadata repository embodied in a computer readable medium and being generated based on multiple modes of metadata for the video content, the search identifying a set of candidate scenes from the video content; ranking the set of candidate scenes according to a scoring metric into a ranked scene list; and generating an output from the computer system that includes at least part of the ranked scene list, the output generated in response to the user query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer program product tangibly embodied in a computer-readable storage medium and comprising instructions executable by a processor to perform a method comprising:
-
tagging, in dialog and action text from an input script document regarding video content, at least some grammatical units of each sentence according to part-of-speech to generate tagged verb and noun phrases; identifying and classifying, by the named entity recognition (NER) extractor, entities and actions in the tagged verb and noun phrases, the NER extractor using one or more external world knowledge ontologies in performing the identification and classification; generating an entity-relationship data model that represents the entities and actions identified and classified by the NER extractor; processing the generated entity-relationship data model to generate a metadata repository; receiving, in a computer system, a user query comprising at least a first term; parsing the user query to at least determine whether the user query assigns an action field defining the first term, the action field being a description of an action performed by an entity in a video; converting the user query into a parsed query that conforms to a predefined format; performing a search in the metadata repository using the parsed query, the metadata repository embodied in a computer readable medium and being generated based on multiple modes of metadata for the video content, the search identifying a set of candidate scenes from the video content; ranking the set of candidate scenes according to a scoring metric into a ranked scene list; and generating an output from the computer system that includes at least part of the ranked scene list, the output generated in response to the user query.
-
-
18. A computer system comprising:
-
a metadata repository embodied in a computer readable medium and being generated based on multiple modes of metadata for video content, including; tagging, in dialog and action text from an input script document regarding video content, at least some grammatical units of each sentence according to part-of-speech to generate tagged verb and noun phrases; submitting the tagged verb and noun phrases to a named entity recognition (NER) extractor; identifying and classifying, by the NER extractor, entities and actions in the tagged verb and noun phrases, the NER extractor using one or more external world knowledge ontologies in performing the identification and classification; generating an entity-relationship data model that represents the entities and actions identified and classified by the NER extractor; and processing the generated entity-relationship data model to generate a metadata repository; a multimodal query engine embodied in a computer readable medium and configured for searching the metadata repository based on a user query, the multimodal query engine comprising; a parser configured to parse the user query to at least determine whether the user query assigns an action field defining the first term, the action field being a description of an action performed by an entity in a video; converting the user query into a parsed query that conforms to a predefined format; a scene searcher configured to perform a search in the metadata repository using the parsed query, the search identifying a set of candidate scenes from the video content; and a scene scorer configured to rank the set of candidate scenes according to a scoring metric into a ranked scene list; and a user interface embodied in a computer readable medium and configured to receive the user query from a user and generate an output that includes at least part of the ranked scene list in response to the user query. - View Dependent Claims (19, 20)
-
Specification