×

METHODS FOR EFFICIENTLY AND SYSTEMATICALLY SEARCHING STOCK, IMAGE, AND OTHER NON-WORD-BASED DOCUMENTS

  • US 20100153402A1
  • Filed: 02/04/2010
  • Published: 06/17/2010
  • Est. Priority Date: 05/24/2006
  • Status: Active Grant
First Claim
Patent Images

1. A computer method for searching image documents containing non-word-based data, comprising the computer executed steps of:

  • (a) collecting a group of image documents to form a collection of image documents;

    (b) dividing each document in said group of collected documents into an array of cells of same type, each of said cells of said array comprises a plurality of pixels;

    (c) defining a plurality of non-word-based token patterns;

    (d) tokenizing said documents by matching said array of cells against said plurality of defined non-word-based token patterns to generate a collection of tokens for each of said documents, and providing a name for each of said tokens;

    (e) combining the collections of tokens for said documents into a master collection of tokens;

    (f) providing a query image, said query image is a part of an image document, and dividing said query into an array of cells, each of said cells of said array comprises a plurality of pixels;

    (g) tokenizing said query image by matching said array of cells of said query image against said plurality of defined non-word-based token patterns to generate a collection of tokens of said query image, and providing a name for each of said tokens;

    (h) searching for image documents in said collection of documents that have the same tokens with the same position arrangement as said tokens of said query image by searching said query token names in said master collection of tokens, to provide a plurality of matching documents with respective scores; and

    (i) displaying matching documents in the order of their matching scores.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×