SEGMENTAL RESCORING IN TEXT RECOGNITION
First Claim
Patent Images
1. A method for text recognition comprising:
- generating a plurality text hypotheses for an image that includes text, each text hypothesis being associated with a first score;
for each text hypothesis of the generated hypotheses, forming data representing one or more segmentations of the image associated with the hypothesis, each segmentation including a series of segments of the image, each segment corresponding to a part of the text hypothesis;
for each of the one or more segmentations, for each segment in the segmentation, forming data representing segmental features of the segment;
determining a segmental score for each segment according to the segmental features of the segment and the corresponding part of the text hypothesis associated with the segmentation including the segment;
for each text hypothesis, determining an overall segmental score according to the determined segmental score for the segments of the one or more segmentations associated with the text hypothesis, and determining an overall score by combining the overall segmental score and the first score associated with the hypotheses; and
providing data representing a text recognition of the image according to the determined overall score for each of the generated text hypotheses for the image.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for text recognition includes generating a number of text hypotheses for an image, for example, using an HMM based approach using fixed-width analysis features. For each text hypothesis, one or more segmentations are generated and scored at the segmental level, for example, according to character or character group segments of the text hypothesis. In some embodiments, multiple alternative segmentations are considered for each text hypothesis. In some examples, scores determined in generating the text hypothesis and the segmental score are combined to select an overall text recognition of the image.
51 Citations
20 Claims
-
1. A method for text recognition comprising:
-
generating a plurality text hypotheses for an image that includes text, each text hypothesis being associated with a first score; for each text hypothesis of the generated hypotheses, forming data representing one or more segmentations of the image associated with the hypothesis, each segmentation including a series of segments of the image, each segment corresponding to a part of the text hypothesis; for each of the one or more segmentations, for each segment in the segmentation, forming data representing segmental features of the segment; determining a segmental score for each segment according to the segmental features of the segment and the corresponding part of the text hypothesis associated with the segmentation including the segment; for each text hypothesis, determining an overall segmental score according to the determined segmental score for the segments of the one or more segmentations associated with the text hypothesis, and determining an overall score by combining the overall segmental score and the first score associated with the hypotheses; and providing data representing a text recognition of the image according to the determined overall score for each of the generated text hypotheses for the image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A text recognition system comprising:
-
a first text recognition system configured to generating a plurality text hypotheses for an input image, each text hypothesis being associated with a first score, the first recognition system being further configured, for each text hypothesis of the generated hypotheses, to form data representing one or more segmentations of the image associated with the hypothesis, each segmentation including a series of segments of the image, each segment corresponding to a part of the text hypothesis; a segment processor configured to accept the generated text hypotheses and associated segmentations from the first recognition system, and, for each text hypothesis, form one or more segmentations of the image associated with the hypothesis, each segmentation including a series of segments of the image, each segment corresponding to a part of the text hypothesis, and for each of the one or more segmentations, for each segment in the segmentation, forming data representing segmental features of the segment; wherein the segment processor includes a segment scorer for determining a segmental score for each segment according to the segmental features of the segment and the corresponding part of the text hypothesis associated with the segmentation including the segment; wherein the segment processor is further configured, for each text hypothesis, to determine an overall segmental score according to the determined segmental score for the segments of the one or more segmentations associated with the text hypothesis; the system further comprising a scorer configured, for each text hypothesis, to determine an overall score by combining the overall segmental score and the first score generated by the first recognition system, and to output data representing a text recognition of the image according to the determined overall score for each of the generated text hypotheses for the image.
-
-
20. Software instructions embodied on a computer readable medium for causing a data processing system to:
-
generate a plurality text hypotheses for an image that includes text, each text hypothesis being associated with a first score; for each text hypothesis of the generated hypotheses, form data representing one or more segmentations of the image associated with the hypothesis, each segmentation including a series of segments of the image, each segment corresponding to a part of the text hypothesis; for each of the one or more segmentations, for each segment in the segmentation, form data representing segmental features of the segment; determine a segmental score for each segment according to the segmental features of the segment and the corresponding part of the text hypothesis associated with the segmentation including the segment; for each text hypothesis, determine an overall segmental score according to the determined segmental score for the segments of the one or more segmentations associated with the text hypothesis, and determine an overall score by combining the overall segmental score and the first score associated with the hypotheses; and provide data representing a text recognition of the image according to the determined overall score for each of the generated text hypotheses for the image.
-
Specification