Method and system for character recognition
First Claim
Patent Images
1. A method of character recognition comprising:
- capturing, by a character-recognition system, a scan of a document;
segmenting, by the character-recognition system, the scan into a sequence of character images;
generating, by the character-recognition system, a sequence of identifiers based on the character images, wherein each identifier in the sequence of identifiers represents a particular character, and wherein the generating comprises;
assigning an identifier to a first character image in the sequence of images;
for each subsequent character image in the sequence of images, comparing the subsequent character image to one or more character images that precede the subsequent character image in the sequence of character images;
for each subsequent character image that matches one of the previous character images based on the comparison, assigning the identifier that is assigned to the one character image to the subsequent image; and
for each subsequent character image that does not match one of the previous character images based on the comparison, assigning a new identifier that is different from each identifier previously assigned; and
obtaining, by the character-recognition system, text based on the sequence of identifiers, the obtaining comprising;
comparing the sequence of identifiers to a plurality of identifier sequences, each identifier sequence corresponding to a sequence of characters; and
identifying as the text a sequence of characters corresponding to an identifier sequence that matches the sequence of identifiers.
4 Assignments
0 Petitions
Accused Products
Abstract
A method and system for character recognition are described. In one embodiment, it may use matched sequences rather than character shape to determine a computer legible result.
764 Citations
22 Claims
-
1. A method of character recognition comprising:
-
capturing, by a character-recognition system, a scan of a document; segmenting, by the character-recognition system, the scan into a sequence of character images; generating, by the character-recognition system, a sequence of identifiers based on the character images, wherein each identifier in the sequence of identifiers represents a particular character, and wherein the generating comprises; assigning an identifier to a first character image in the sequence of images; for each subsequent character image in the sequence of images, comparing the subsequent character image to one or more character images that precede the subsequent character image in the sequence of character images; for each subsequent character image that matches one of the previous character images based on the comparison, assigning the identifier that is assigned to the one character image to the subsequent image; and for each subsequent character image that does not match one of the previous character images based on the comparison, assigning a new identifier that is different from each identifier previously assigned; and obtaining, by the character-recognition system, text based on the sequence of identifiers, the obtaining comprising; comparing the sequence of identifiers to a plurality of identifier sequences, each identifier sequence corresponding to a sequence of characters; and identifying as the text a sequence of characters corresponding to an identifier sequence that matches the sequence of identifiers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 21, 22)
-
-
15. A character-recognition system, comprising:
-
memory, configured to store at least a scan of a document; and a processor, configured to; segment the scan into character images; generate a sequence of identifiers based on the character images, wherein each identifier in the sequence of identifiers represents a particular character, and wherein the generating comprises; assigning an identifier to a first character image in the sequence of images; for each subsequent character image in the sequence of images, comparing the subsequent character image to one or more character images that precede the subsequent character image in the sequence of character images; for each subsequent character image that matches one of the previous character images based on the comparison, assigning the identifier that is assigned to the one character image to the subsequent image; and for each subsequent character image that does not match one of the previous character images based on the comparison, assigning a new identifier that is different from each identifier previously assigned; and obtain text based on the sequence of identifiers by; comparing the sequence of identifiers to a plurality of identifier sequences, each identifier sequence corresponding to a sequence of characters; and identifying as the text a sequence of characters corresponding to an identifier sequence that matches the sequence of identifiers. - View Dependent Claims (16, 17, 18)
-
-
19. A system, comprising:
-
means for capturing a scan of a document; means for segmenting the scan into a sequence of character images; means for generating a sequence of identifiers based on the character images, wherein each identifier in the sequence of identifiers represents a particular character, and wherein the generating comprises; assigning an identifier to a first character image in the sequence of images; for each subsequent character image in the sequence of images, comparing the subsequent character image to one or more character images that precede the subsequent character image in the sequence of character images; for each subsequent character image that matches one of the previous character images based on the comparison, assigning the identifier that is assigned to the one character image to the subsequent image; and for each subsequent character image that does not match one of the previous character images based on the comparison, assigning a new identifier that is different from each identifier previously assigned; and means for obtaining text based on the sequence of identifiers, the obtaining comprising; comparing the sequence of identifiers to a plurality of identifier sequences, each identifier sequence corresponding to a sequence of characters; and identifying as the text a sequence of characters corresponding to an identifier sequence that matches the sequence of identifiers. - View Dependent Claims (20)
-
Specification