×

Method for classifying non-running text in an image

  • US 6,009,196 A
  • Filed: 11/28/1995
  • Issued: 12/28/1999
  • Est. Priority Date: 11/28/1995
  • Status: Expired due to Term
First Claim
Patent Images

1. A method comprising the steps of:

  • retrieving an input image, the image comprising an array of image signals and associated data defining a set of boundaries of a plurality of text-blocks represented therein, and storing the array of image signals in a bitmap array and the data defining the set of boundaries in a second array;

    automatically partitioning the text-blocks defined by the set of boundaries stored in the second array into text groups;

    automatically classifying the text- groups to determine those text-groups which represent running text regions of the image and those which represent non-running text regions of the image;

    automatically regrouping at least one non-running text region of the image based upon locations of the text blocks within the non-running text region, said step of regrouping at least one non-running text reunion further comprising the sub-steps of;

    determining the locations of the text blocks within the non-running text region;

    grouping the text blocks within the non-running text region into horizontally and vertically aligned groups of text blocks based upon at least one common coordinate;

    assigning a common identifier to all text blocks in each horizontally and vertically aligned group;

    counting the number of terms in the horizontally and vertically aligned groups, where an item of a horizontally aligned group is a set of text blocks in the horizontally aligned group belonging to a single vertically aligned group, and an item of a vertically aligned group is a set of text blocks in the vertically aligned group belonging to a single horizontally aligned group; and

    storing the number of items in each group in memory;

    further automatically classifying a non-running text region as to the extent to which such a text region is tabularized.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×