×

Method of producing character templates using unsegmented samples

  • US 5,706,364 A
  • Filed: 04/28/1995
  • Issued: 01/06/1998
  • Est. Priority Date: 04/28/1995
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of operating a machine to train a set of character templates;

  • the machine including a processor and a memory device for storing data;

    the data stored in the memory device including instruction data the processor executes to operate the machine;

    the processor being connected to the memory device for accessing the data stored therein;

    the method comprising;

    operating the processor to receive and store, in the memory device of the machine, an image definition data structure including a plurality of image pixel positions defining an image;

    the image definition data structure, hereafter referred to as an image glyph source, including a plurality of glyphs occurring therein;

    each glyph being an image instance of a respective one of a plurality of characters in a character set;

    the set of character templates being trained representing respective ones of the plurality of characters in the character set;

    operating the processor to receive and store, in the memory device of the machine, a plurality of labeled glyph position data items;

    each labeled glyph position data item indicating one of the plurality of image pixel positions in the image glyph source as an image glyph source location of a glyph occurring therein;

    each labeled glyph position data item further indicating a glyph label, hereafter referred to as a respectively paired glyph label, paired with the image pixel position identifying the glyph indicated by the image pixel position as a respective one of the plurality of characters in the character set;

    operating the processor to determine a sample image region included in the image glyph source for each labeled glyph position data item;

    the sample image region including the image pixel position indicating the image glyph source location of a glyph and being identified as a training data sample for the character template indicated by the respectively paired glyph label;

    each sample image region including a plurality of image pixel positions in the image glyph source, hereafter referred to as sample pixel positions, each indicating a sample pixel value;

    operating the processor to produce, for each respective character template to be trained, a template image region including a plurality of template pixel positions for storing the respective character template; and

    operating the processor to produce the set of character templates using the template image regions and the sample image regions;

    producing the set of character templates including(a) computing template pixel scores for respective ones of the template pixel positions in the template image regions using the sample pixel values indicated by the sample pixel positions included in the sample image regions;

    (b) assigning a foreground pixel value to a template pixel position, referred to as an assigned template pixel position, in one of the template image regions;

    the assigned template pixel position being selected on the basis of the template pixel scores;

    (c) modifying the sample pixel values of the sample pixel positions used in computing the template pixel score for the assigned template pixel position to indicate modified sample pixel values that, when used in computing a subsequent template pixel score for an unassigned template pixel position, reduce the chance that a foreground pixel value will be assigned to the unassigned template pixel position on the basis of the subsequent template pixel score; and

    (d) repeating substeps (a), (b) and (c) until a stopping condition indicates that the set of character templates is complete.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×