×

Method and system for commercial document image classification

  • US 8,831,361 B2
  • Filed: 03/09/2012
  • Issued: 09/09/2014
  • Est. Priority Date: 03/09/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method of automatic commercial document image classification using a computer for performing the steps of:

  • automatically obtaining the document layout using the salient features of the document images layout, said features consisting of text blocks, geometric line segments and the contents (sets of words) of said text blocks;

    defining distances between such layouts;

    automatically creating a plurality of classification template layouts directly from said distances;

    Ordering said plurality of classification template layouts and generating a ranked list of candidate template layouts best matching a given input document layout in such a way that the classification template layout most similar to the document layout to be classified is at the top of the list (highest ranked layout), the next likely candidate template layout is in the second position and so forth;

    classifying the input image as belonging to the class of the top template layout in said ranked list of template layouts.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×