Hierarchical video search and recognition system
First Claim
1. A system for content recognition, search, and retrieval in visual data, the system comprising one or more processors configured to perform operations of:
- receiving unlabeled visual data as an input;
extracting a set of activity-agnostic content descriptors from the unlabeled, visual data with a hierarchical content descriptor module having a plurality of levels, resulting in a set of generated activity-agnostic content descriptors, wherein each level of the hierarchical content descriptor module generates a distinct set of activity-agnostic content descriptors;
indexing the generated activity-agnostic content descriptors with a hierarchical content indexing module having a plural of levels corresponding to the plurality of levels of the hierarchical content descriptor module, resulting in a set of indexed content descriptors;
wherein each level of the content indexing module comprises a distinct set of indexed content descriptors from a corresponding level in the hierarchical content descriptor module;
storing the unlabeled visual data, the set of generated activity-agnostic content descriptors, and the set of indexed content descriptors in a storage module;
searching the storage module and retrieving visual data containing, a content of interest based on a user query;
refining the user query based on feature weighting; and
presenting the visual data containing the content of interest to the user.
1 Assignment
0 Petitions
Accused Products
Abstract
Described is a system for content recognition, search, and retrieval in visual data. The system is configured to perform operations of receiving visual data as an input, processing the visual data, and extracting distinct activity-agnostic content descriptors from the visual data at each level of a hierarchical content descriptor module. The resulting content descriptors are then indexed with a hierarchical content indexing module, wherein each level of the content indexing module comprises a distinct set of indexed content descriptors. The visual data, generated content descriptors, and indexed content descriptors are then stored in a storage module. Finally, based on a content-based query by a user, the storage module is searched, and visual data containing the content of interest is retrieved and presented to the user. A method and computer program product for content recognition, search, and retrieval in visual data are also described.
-
Citations
18 Claims
-
1. A system for content recognition, search, and retrieval in visual data, the system comprising one or more processors configured to perform operations of:
-
receiving unlabeled visual data as an input; extracting a set of activity-agnostic content descriptors from the unlabeled, visual data with a hierarchical content descriptor module having a plurality of levels, resulting in a set of generated activity-agnostic content descriptors, wherein each level of the hierarchical content descriptor module generates a distinct set of activity-agnostic content descriptors; indexing the generated activity-agnostic content descriptors with a hierarchical content indexing module having a plural of levels corresponding to the plurality of levels of the hierarchical content descriptor module, resulting in a set of indexed content descriptors; wherein each level of the content indexing module comprises a distinct set of indexed content descriptors from a corresponding level in the hierarchical content descriptor module; storing the unlabeled visual data, the set of generated activity-agnostic content descriptors, and the set of indexed content descriptors in a storage module; searching the storage module and retrieving visual data containing, a content of interest based on a user query; refining the user query based on feature weighting; and presenting the visual data containing the content of interest to the user. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-implemented method for content recognition, search, and retrieval in visual data, the method comprising an act of causing a processor to perform operations of:
-
receiving unlabeled visual data as an input; extracting a set of activity-agnostic content descriptors from the unlabeled visual data with a hierarchical content descriptor module having a plurality of levels, resulting in a set of generated activity-agnostic content descriptors, wherein each level of the hierarchical content descriptor module generates a distinct set of activity-agnostic content descriptors; indexing the generated activity-agnostic content descriptors with a hierarchical content indexing module having a plurality of levels corresponding to the plurality of levels of the hierarchical content descriptor module, resulting in a set of indexed content descriptors; wherein each level of the content indexing module comprises a distinct set of indexed content descriptors from a corresponding level in the hierarchical content descriptor module; storing the unlabeled visual data, the set of generated activity-agnostic content descriptors, and the set of indexed content descriptors in a storage module; searching the storage module and retrieving visual data containing a content of interest based on a user query; refining the user query based on feature weighting; and presenting the visual data containing the content of interest to the user. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer program product for content recognition, search, and retrieval in visual data, the computer program product comprising computer-readable instruction means stored on a computer-readable medium that are executable by a computer having a processor for causing the processor to perform operations of:
-
receiving unlabeled visual data as an input; extracting a set of activity-agnostic content descriptors from the unlabeled visual data with a hierarchical content descriptor module having a plurality of levels, resulting in a set of generated activity-agnostic content descriptors, wherein each level of the hierarchical content descriptor module generates a distinct set of activity-agnostic content descriptors; indexing the generated activity-agnostic content descriptors with a hierarchical content indexing module having a plurality of levels corresponding to the plurality of levels of the hierarchical content descriptor module, resulting in a set of indexed content descriptors; wherein each level of the content indexing module comprises a distinct set of indexed content descriptors from a corresponding level in the hierarchical content descriptor module; storing the unlabeled visual data, the set of generated activity-agnostic content descriptors, and the set of indexed content descriptors in a storage module; searching the storage module and retrieving visual data containing a content of interest based on a user query; refining the user query based on feature weighting; and presenting the visual, data containing the content of interest to the user. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification