×

Method for determining boundaries of words in text

  • US 5,321,770 A
  • Filed: 11/19/1991
  • Issued: 06/14/1994
  • Est. Priority Date: 11/19/1991
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of isolating at least one word object composed of multiple adjacent symbols within data defining a first image, comprising the steps of:

  • (a) determining a skew angle corresponding to the orientation of the word object within the data defining the first image(b) producing a second image, as a function of the first image, where adjacent symbols which form the work object are represented as having at least one point of contact between one another, thereby representing the word object as a group of connected symbols;

    (c) locating the connected symbols within the second image;

    (d) identifying boundaries about each group of connected symbols within the second image, so as to segment the second image by word objects oriented with respect to the skew angle;

    (e) applying the boundaries identified in step (d), to the first image to group the data associated with each word object, thereby isolating the word object within the data defining the first image;

    (f) constructing a first line through the image, said first line being composed of a plurality of discrete points and oriented in a direction perpendicular to the orientation of the word object, as defined by the skew angle;

    (g) projecting the data defining the first image onto said first line, to produce an indication of the frequency of occurrence of symbol extents within the projected data as a function of the displacement along the line, including the steps of;

    (1) counting, along a branch line extending orthogonally from the first line, the number of pixels lying within the bounds of symbols which are intersected by the branch line;

    (2) counting, along the branch line extending orthogonally from the first line, the number of pixels which lie at a vertical extend of a symbol and are intersected by the branch line;

    (3) calculating a ratio between the number of intersected vertical extend pixels determined in step (g)(2) and the number intersected symbol pixels determined in step (g)(1);

    (4) storing the ratio calculated in step (g)(3) in a unique memory position;

    (5) repeating steps (g)(1) through (g)(4) for all discrete points lying along the first line, thereby determining ratios for the first image; and

    (h) identifying the locations of local maxima within the projected data, wherein the maxima are representative of characteristic lines for the word objects and are oriented with respect to the skew angle.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×