MIXED MEDIA REALITY INDEXING AND RETRIEVAL FOR REPEATED CONTENT
First Claim
1. An apparatus for use in recognizing documents from an image patch, the apparatus comprising:
- an image recognition unit having and input and an output, the input of the image recognition unit coupled to receive the image patch, the image recognition unit extracting features from the image patch and comparing the extracted features;
a hierarchical shared content index coupled to the image recognition unit, the hierarchical shared content index having a hierarchical description of document pages; and
a hierarchical MMR index coupled and communicating with the hierarchical shared content index, the hierarchical MMR index being a tree with a plurality of nodes, each node is an index.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for indexing and retrieval of document images in an MMR system having repeated content is described. The system provides one or more hierarchical shared content indices that produce faster and/or more accurate search results. The system is also advantageous because the number and configuration of the hierarchical shared content indices is automated, scalable and efficient for processing documents with partially repeated content. In particular, the MMR matching unit includes a hierarchical shared content index (HSCI) and associated methods of use for processing images where the MMR system includes repeated content. The present invention also includes a number of novel methods including a method for adding an image to a hierarchical shared content index; a method for deleting an image from the hierarchical shared content index, and a method for using the hierarchical shared content index for image recognition, as well as a method for combining multiple MMR indexes into a hierarchical MMR index.
119 Citations
20 Claims
-
1. An apparatus for use in recognizing documents from an image patch, the apparatus comprising:
-
an image recognition unit having and input and an output, the input of the image recognition unit coupled to receive the image patch, the image recognition unit extracting features from the image patch and comparing the extracted features; a hierarchical shared content index coupled to the image recognition unit, the hierarchical shared content index having a hierarchical description of document pages; and a hierarchical MMR index coupled and communicating with the hierarchical shared content index, the hierarchical MMR index being a tree with a plurality of nodes, each node is an index. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method of creating a hierarchical index, the method comprising:
-
receiving an image; comparing the received image to the hierarchical shared content index; determining whether there is repeated content in the received image; identifying the repeated content; and updating the hierarchical index for the identified repeated content. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A method of using a hierarchical index to recognize a document, the method comprising:
-
receiving an image patch; determining a document correspond to the image patch; determining a node in the hierarchical index corresponding to the document; and producing a list of matching documents by traversing the hierarchical index. - View Dependent Claims (17, 18, 19, 20)
-
Specification