×

Method for line and word segmentation for handwritten text images

  • US 10,643,094 B2
  • Filed: 07/23/2018
  • Issued: 05/05/2020
  • Est. Priority Date: 09/29/2016
  • Status: Active Grant
First Claim
Patent Images

1. A method implemented on a computer for segmenting an input image into line segments and word segments, the input image being a binary image containing text, the method comprising:

  • (a) down sampling the input image along a first direction using a first down-sampling ratio;

    (b) detecting connected regions in the down-sampled image obtained in step (a);

    (c) identifying neighboring connected regions that are neighbors of each other along the first direction that belong to same lines to form line lists containing such neighboring connected regions;

    (d) segmenting the input image into a plurality of line segments of the input image, each line segment of the input image being a region of the input image that corresponds to a bounding box in the down-sampled image containing all connected regions in a corresponding line list obtained in step (c); and

    for each of the line segments of the input image obtained in step (d),(e) down sampling the line segment of the input image along the first direction using a second down-sampling ratio;

    (f) detecting connected regions in the down-sampled line segment obtained in step (e); and

    (g) segmenting the line segment of the input image obtained from step (d) into word segments at one or more word segmentation positions using the connected regions obtained in step (f), wherein the word segmentation positions are a subset of positions corresponding to locations in gaps between the connected regions in the down-sampled line segment of step (e) that have been detected in step (f).

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×