Method and system for character recognition
First Claim
Patent Images
1. An article of manufacture comprising a non-transitory computer-readable medium with instructions encoded thereon, the instructions configured to cause one or more processors to perform a method comprising:
- receiving an image of a sequence of glyphs;
for each of a subset of glyphs of the sequence;
determining whether the glyph matches a retained glyph;
assigning to the glyph a character identity associated with the retained glyph in response to determining that the glyph matches a retained glyph;
retaining the unmatched glyph, selecting a character identity for the unmatched glyph, and assigning the selected character identity to the unmatched glyph in response to determining that the glyph does not match a retained glyph;
generating a representation of the image of the sequence of glyphs with the assigned character identities; and
identifying the sequence of glyphs using the generated representation.
4 Assignments
0 Petitions
Accused Products
Abstract
Character recognition is described. In one embodiment, it may use matched sequences rather than character shape to determine a computer-legible result.
208 Citations
18 Claims
-
1. An article of manufacture comprising a non-transitory computer-readable medium with instructions encoded thereon, the instructions configured to cause one or more processors to perform a method comprising:
-
receiving an image of a sequence of glyphs; for each of a subset of glyphs of the sequence; determining whether the glyph matches a retained glyph; assigning to the glyph a character identity associated with the retained glyph in response to determining that the glyph matches a retained glyph; retaining the unmatched glyph, selecting a character identity for the unmatched glyph, and assigning the selected character identity to the unmatched glyph in response to determining that the glyph does not match a retained glyph; generating a representation of the image of the sequence of glyphs with the assigned character identities; and identifying the sequence of glyphs using the generated representation. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
storing an image of a text sequence in a non-transitory computer-readable medium associated with a computing system, the text sequence including multiple glyphs unknown to the computing system; using one or more processors associated with the computing system to associate an identity to a subset of the multiple glyphs based on (1) similarities in shape between the glyphs within the image of the text sequence, and (2) relative locations of the glyphs within the image of the text sequence; and using the one or more processors to generate a text representation by replacing the glyphs with the associated identities. - View Dependent Claims (7, 8, 9)
-
-
10. An article of manufacture comprising a non-transitory computer-readable medium with instructions encoded thereon, the instructions configured to cause one or more processors associated with a computing system to perform a method comprising:
-
receiving an image of a text sequence, the text sequence including multiple glyphs unknown to the computing system; associating an identity to a subset of the multiple glyphs based on (1) similarities in shape between the glyphs within the image of the text sequence, and (2) relative locations of the glyphs within the image of the text sequence; and generating a text representation by replacing the glyphs with the associated identities.
-
-
11. A method comprising:
-
storing an image of a sequence of glyphs in a non-transitory computer-readable medium associated with one or more processors; for each of at least a subset of the glyphs, in the order of the sequence; determining whether an identifier has been attributed to the glyph; in response to determining that no identifier has been attributed to the glyph; selecting an unassigned identifier; assigning the selected identifier to the glyph; attributing the selected identifier to the glyph; and attributing the selected identifier to any glyphs that follow the glyph in the sequence and match the glyph; and generating an encoding of the image that is based on the identifiers attributed to the glyphs of the sequence. - View Dependent Claims (12, 13, 14)
-
-
15. An article of manufacture comprising a non-transitory computer-readable medium with instructions encoded thereon, the instructions configured to cause one or more processors to perform a method comprising:
-
receiving an image of a sequence of glyphs; initializing an encoding of the image; for each of at least a subset of the glyphs, in the order of the sequence; determining an offset to a glyph earlier in the sequence that matches the glyph; and storing a value in the encoding that indicates the determined offset. - View Dependent Claims (16, 17)
-
-
18. A method comprising:
-
storing an image of a sequence of glyphs in a non-transitory computer-readable medium associated with one or more processors; using the one or more processors to initialize an encoding of the image; and for each of at least a subset of the glyphs, in the order of the sequence, using the one or more processors to determine an offset to a glyph earlier in the sequence that matches the glyph and store an indication of the determined offset in the encoding.
-
Specification