×

Removing character from text in non-image form where location of character in image of text falls outside of valid content boundary

  • US 8,682,075 B2
  • Filed: 12/28/2010
  • Issued: 03/25/2014
  • Est. Priority Date: 12/28/2010
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving, by a processor, data representing an image of text, and data representing the text in non-image form;

    determining, by the processor, a valid content boundary within the image of the text, the valid content boundary dividing a portion of the image corresponding to valid text of the image from and excluding another portion of the image corresponding to one or more of stray marks, dirt, debris, and handwritten notes;

    for each character of a plurality of characters within the text in the non-image form,determining, by the processor, a location of the character within the image of the text; and

    where the location of the character within the image of the text falls outside the valid content boundary, removing the character from the data representing the text in the non-image form, by the processor.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×