×

System and method for detection and segmentation of touching characters for OCR

  • US 9,922,263 B2
  • Filed: 03/20/2013
  • Issued: 03/20/2018
  • Est. Priority Date: 04/12/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method for detection of touching characters in a media by segmentation of adjoining character spaces, the method comprising:

  • acquiring each component of the media in a predetermined sequence, each component having at least two touching characters;

    determining an aspect ratio of each component; and

    performing a component investigation for each aspect ratio higher than a threshold aspect ratio, the component investigation comprising;

    determining a candidate touching position of the at least two touching characters in a plurality of geometric orientations of the at least two touching characters;

    computing a number of pixels representing a text of the at least two touching characters;

    computing a length of a longest run of the number of pixels representing the text of the at least two touching characters for each column of the component;

    determining a candidate cut column based on a relation between a column pixel density and a corresponding length of the column; and

    segmenting the at least two touching characters with a referential boundary of the candidate cut column in the component.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×