Document recognition method and system
First Claim
1. A document recognition system wherein characters on a document are recognized from an image of the document, and at least the recognized characters are displayed on a display unit, comprising:
- input means for entering the document image which is an object to be recognized;
character line extraction means for obtaining character line data which indicate limits of a character line formed of a character string, including either of a frame and a block surrounding said character string, on said document image entered by said input means, wherein said frame and said block indicate a set character line displayed as a polygon in superposition of the document image by utilizing graphics;
character line correction means for presenting the limits of the character line obtained by said character line extraction means to a user of said document recognition system, superimposed on said document image on said display unit, and for correcting said limits of said character line in accordance with an instruction given by the user;
character segmentation means for deriving character patterns of the individual characters contained in said character line, from said document image on the basis of the character line data;
character recognition means for recognizing the character patterns derived by said character segmentation means, and for converting the recognized character patterns into respectively corresponding character codes; and
display means for causing said display unit to display the character codes of said characters recognized by said character recognition means.
0 Assignments
0 Petitions
Accused Products
Abstract
A document recognition system includes various recognitive steps for document recognition, and various correctional steps corresponding to the recognitive steps. The operation modes of the system are a "sequential mode" (sequential recognition operation) in which the respective recognitive steps are executed in "step by step" fashion, an "auto mode" (batch recognition operation) in which all the recognitive steps are collectively executed, and a "retry mode" (re-recognition operation) in which the execution of any of the correctional steps is automatically followed by the execution of a necessary one of the recognitive steps. In the "sequential mode", any of the recognitive steps having been executed can be shifted, not only to the correctional step corresponding to the executed recognitive step, but also to the correctional step preceding the execution. When an error is involved in a recognized result because the limits of a character line recognized by the system are incorrect, the user of the system corrects the limits of the character line. Then, the system can execute the necessary recognitive step again based on the corrected limits of the character line.
-
Citations
21 Claims
-
1. A document recognition system wherein characters on a document are recognized from an image of the document, and at least the recognized characters are displayed on a display unit, comprising:
-
input means for entering the document image which is an object to be recognized; character line extraction means for obtaining character line data which indicate limits of a character line formed of a character string, including either of a frame and a block surrounding said character string, on said document image entered by said input means, wherein said frame and said block indicate a set character line displayed as a polygon in superposition of the document image by utilizing graphics; character line correction means for presenting the limits of the character line obtained by said character line extraction means to a user of said document recognition system, superimposed on said document image on said display unit, and for correcting said limits of said character line in accordance with an instruction given by the user; character segmentation means for deriving character patterns of the individual characters contained in said character line, from said document image on the basis of the character line data; character recognition means for recognizing the character patterns derived by said character segmentation means, and for converting the recognized character patterns into respectively corresponding character codes; and display means for causing said display unit to display the character codes of said characters recognized by said character recognition means. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A document recognition system wherein characters on a document are recognized from an image of the document, and at least the recognized characters are displayed on a display unit, comprising:
-
input means for entering the document image which is an object to be recognized; character line extraction means for obtaining character line data which indicate limits of a character line formed of a character string, including either of a frame and a block surrounding said character string, on said document image entered by said input means, wherein said frame and block indicate a set character line displayed as a polygon in superposition of the document image by utilizing graphics; layout analysis means for recognizing blocks including a plurality of character lines, and for determining a sequence in which the plurality of blocks are read; character segmentation means for detecting first character segmentation positions being the most likely to be actual boundaries of characters included in said character line and second character segmentation positions being next most likely to be actual boundaries of characters included in said character line, and for successively deriving character patterns of the individual characters contained in said character lines, divided at the first character segmentation positions, from said document image, on the basis of the sequence in which the plurality of blocks are read; character recognition means for recognizing the character patterns derived by said character segmentation means, and for converting the recognized character patterns into respectively corresponding character codes; language processing means for recognizing any inappropriate phrase in text data which includes the character codes of the character string recognized by said character recognition means, with reference to a language dictionary; display means for causing said display unit to display a processed result of said language processing means; character line correction means for presenting the limits of the character line obtained by said character line extraction means to a user of said document recognition system, superimposed on said document image on said display unit, and for correcting said limits of said character line in accordance with an instruction given by the user; skew correction means for correcting a skew angle of said document image displayed on said display unit which has been detected by said system, in accordance with an instruction given by said user; layout correction means for presenting limits of the blocks recognized by said layout analysis means, to said user superimposed on said document image on said display unit, and for correcting the presented limits of said blocks in accordance with an instruction given by said user; character segmentation correction means for presenting said first and second character segmentation positions to said user superimposed on said document image on said display unit, for correcting the character segmentation; character correction means for presenting said characters corresponding to said character code obtained by said character recognition means, to said user on said display unit, and for correcting said character codes in accordance with an instruction given by said user; phrase correction means for presenting the inappropriate phrase recognized by said language processing means to said user, and for correcting said inappropriate phrase in accordance with an instruction given by said user; means for selectively starting any selectable one of said character line correction means, said layout correction means, said character segmentation correction means, said character correction means and said phrase correction means, immediately after any of the processing steps of said character line extraction means, said layout analysis means, said character segmentation means, said character recognition means and said language processing means; and retry control means which includes; a control table for defining, for each of said character line correction means, said layout correction means, said character segmentation correction means, said character correction means and said phrase correction means, combinations of a plurality of means to perform processing after an operation of said each of said correction means, each of said combinations being selected from a group including said character line extraction means, said layout analysis means, said character segmentation means, said character recognition means and said language processing means, and start means for automatically activating, after an operation of one of said correction means, said means included in one of the combinations selected with respect to said one of said correction means, with reference to said control table. - View Dependent Claims (13, 16, 17, 18, 21)
-
-
14. A document recognition method wherein characters on a document are recognized from an image of the document, and at least the recognized characters are displayed on a display unit, said method comprising the steps of:
-
inputting the document image which is an object to be recognized; gaining character line data which indicate limits of a character line formed of a character string, including either of a frame and a block surrounding said character string, on the input document image, thereby extracting the character line, wherein said frame and block indicate a set character line displayed as a polygon in superposition of the document image by utilizing graphics; displaying the limits of the extracted character line superimposed on said document image on said display unit; altering said limits of said character line displayed on said display unit, thereby correcting the character line data; deriving character patterns of the individual characters contained in said character line, from said document image based on the corrected character line data; recognizing the derived character patterns, and converting the recognized character patterns into respectively corresponding character codes; and display the character codes on said display unit, wherein the step of correcting said character line data alters the limits of said character line on said display unit so that two extracted character lines may be connected into a single character line, thereby correcting said character line data. - View Dependent Claims (15, 19, 20)
-
Specification