×

High-speed retrieval by example

  • US 5,867,597 A
  • Filed: 09/05/1995
  • Issued: 02/02/1999
  • Est. Priority Date: 09/05/1995
  • Status: Expired due to Term
First Claim
Patent Images

1. A document retrieval apparatus, wherein a target document is input and a matching document is retrieved from a document database, comprising:

  • character detecting means for detecting character bounds in the target document based on image content of the target document;

    discrimination means, coupled to the character detecting means, for discriminating the character bounds into classes, wherein discrimination is done based on at least one unambiguous characteristic of the character bound including the pixel density over an area of the character bound;

    descriptor generating means, coupled to receive class indications of character bounds from the character detecting means, for generating target document descriptors based on patterns of class indications;

    searching means, coupled to receive the target document descriptors from the descriptor generating means, for searching the document database for potentially matching documents which have descriptors in common with the target document;

    evaluation means, coupled to receive a set of potentially matching documents from the searching means, for determining at least one matching document from among the potentially matching documents; and

    output means, coupled to the evaluation means, for outputting the at least one matching document or indication thereof as a result of a retrieval request wherein the target document is input.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×