×

Method and system for document fingerprint matching in a mixed media environment

  • US 8,335,789 B2
  • Filed: 07/31/2006
  • Issued: 12/18/2012
  • Est. Priority Date: 10/01/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method of image matching, comprising:

  • receiving an image of at least part of a first media type;

    generating a horizontal profile from the image, the horizontal profile identifying words in the image;

    generating a plurality of bounding boxes, each bounding box surrounding a word in the horizontal profile;

    horizontally classifying the plurality of bounding boxes in the image;

    vertically classifying the plurality of bounding boxes in the image;

    determining at least one spatial relationship between the plurality of bounding boxes by associating a first length of a first word with a second length of a second word in the horizontal profile and combining the horizontal and vertical classifications;

    generating at least one horizontal grouping of bounding boxes and at least one vertical grouping of bounding boxes based on the spatial relationship;

    generating a list of documents from a database of one or more documents, the list of documents including at least one common bounding box comprising an overlap of the at least one horizontal grouping of bounding boxes and the at least one vertical grouping of bounding boxes at a location in each document in the list;

    determining a number of votes for each document in the list based on a number of common bounding boxes; and

    identifying a matching document with a most number of votes from the list as a document containing the image.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×