×

Methods and systems for assessing the quality of automatically generated text

  • US 8,442,813 B1
  • Filed: 02/05/2009
  • Issued: 05/14/2013
  • Est. Priority Date: 02/05/2009
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-implemented method of assessing the quality of computer-generated text, the method comprising:

  • receiving a plurality of characters generated from an image of a document;

    determining, for the plurality of characters generated from the image of the document, language-conditional character probabilities based on a set of language models and an ordering of the characters, a language-conditional character probability for a target character in the plurality of characters describing a degree to which the target character and an ordered set of characters preceding the target character concord with a given language model in the set of language models;

    identifying, for the target character, neighbor characters proximate to a location of the target character in the image of the document, wherein the neighbor characters have associated language-conditional character probabilities and are within a defined distance from the location of the target character in the image of the document;

    combining the language-conditional character probabilities associated with the neighbor characters and the language-conditional character probabilities associated with the target character to generate a local language-conditional likelihood for the target character; and

    storing the local language-conditional likelihood for the target character.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×