Document processing apparatus, document processing method and scanner
First Claim
1. A document processing apparatus, comprising:
- a memory;
a processor coupled to the memory and configured to;
extract at least one text line from an input document;
determine, by performing a graphic feature recognition process, whether or not an optical character recognition process is necessary for a language of the input document;
determine, by performing the optical character recognition process, an optical character recognition confidence with respect to each candidate direction for each of at least some of the text lines, in the case that it is determined that the optical character recognition process is necessary for the language of the input document;
determine, by performing the graphic feature recognition processing, a graphic feature recognition confidence with respect to each candidate direction for each text lines; and
determine, based on at least one of the determined graphic feature recognition confidences and the determined optical character recognition confidences, a combination confidence with respect to each candidate direction for each of the at least some of the text lines, and determine, based on the combination confidences, an orientation of the input document.
1 Assignment
0 Petitions
Accused Products
Abstract
The disclosure provides a document processing apparatus, method and a scanner. The document processing apparatus includes: a text line extraction unit extracting a text line from an input document; a language classification unit determining whether an OCR process is necessary for a language of the input document; an OCR unit determining, by performing the OCR process, an OCR confidence in the case that it is determined that the OCR process is necessary; an graphic feature recognition unit determining an graphic feature recognition confidence; and a determination unit determining a combination confidence based on at least one of the determined graphic feature recognition confidences and the determined OCR confidences, and determining an orientation of the input document based on the combination confidences. This technical solution can determine better an orientation of the document, and is especially applicable when the quality of the image of the document is deteriorated.
-
Citations
19 Claims
-
1. A document processing apparatus, comprising:
-
a memory; a processor coupled to the memory and configured to; extract at least one text line from an input document; determine, by performing a graphic feature recognition process, whether or not an optical character recognition process is necessary for a language of the input document; determine, by performing the optical character recognition process, an optical character recognition confidence with respect to each candidate direction for each of at least some of the text lines, in the case that it is determined that the optical character recognition process is necessary for the language of the input document; determine, by performing the graphic feature recognition processing, a graphic feature recognition confidence with respect to each candidate direction for each text lines; and determine, based on at least one of the determined graphic feature recognition confidences and the determined optical character recognition confidences, a combination confidence with respect to each candidate direction for each of the at least some of the text lines, and determine, based on the combination confidences, an orientation of the input document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A document processing method, comprising:
-
extracting at least one text line from an input document; determining, by performing a graphic feature recognition process, whether or not an optical character recognition process is necessary for a language of the input document; determining, by performing the optical character recognition process, an optical character recognition confidence with respect to each candidate direction for each of at least some of the text lines, in the case that it is determined that the optical character recognition process is not necessary for the language of the input document; determining, by performing the graphic feature recognition processing, a graphic feature recognition confidence with respect to each candidate direction for each text lines; and determining, based on at least one of the determined graphic feature recognition confidences and the determined optical character recognition confidences, a combination confidence with respect to each candidate direction for each of the at least some of the text lines, and determining, based on the combination confidences, an orientation of the input document. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
Specification