×

Polygon-based technique for the automatic classification of text and graphics components from digitized paper-based forms

  • US 5,050,222 A
  • Filed: 05/21/1990
  • Issued: 09/17/1991
  • Est. Priority Date: 05/21/1990
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of classifying components of an image into text or graphics comprising the steps of:

  • a) digitizing the image to form a bit map representation of the image;

    b) extracting a set of contour vectors from the bit map image; and

    c) extracting from the set of contour vectors a set of polygon features;

    d) employing the set of polygon features to classify a first set of graphics components;

    e) separating the image contour vectors into inner and outer contours;

    f) sorting all of the inner and outer contours according to horizontal location in their respective group;

    g) employing the inner and outer contours and the row segregation to classify a second set of graphic components;

    h) employing the polygon features and row segmentation to detect space between polygons in a horizontal projection between two consecutive polygons to identify a group of object strings;

    i) extracting from the group of textual strings a third set of graphic components in the form of single like text strings.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×