Document search method and apparatus
First Claim
Patent Images
1. A document search method for searching for a document, comprising:
- a character recognition step of executing a character recognition process for an image of a search document;
an extraction step of extracting text data which is estimated to be correctly recognized from text data obtained in the character recognition step;
. a generation step of generating text feature information on the basis of the text data extracted in the extraction step; and
a search step of searching a plurality of documents for a document corresponding to the search document using the text feature information generated in the generation step as a query.
1 Assignment
0 Petitions
Accused Products
Abstract
In a document search method for searching for a document, a character recognition process is applied to an image of a search image, and text data which is estimated to be correctly recognized is extracted from the text data obtained by the character recognition process. Text feature information is generated based on the extracted text data, and a plurality of documents are searched for a document corresponding to the search document using the generated text feature information as a query.
-
Citations
18 Claims
-
1. A document search method for searching for a document, comprising:
-
a character recognition step of executing a character recognition process for an image of a search document;
an extraction step of extracting text data which is estimated to be correctly recognized from text data obtained in the character recognition step;
.a generation step of generating text feature information on the basis of the text data extracted in the extraction step; and
a search step of searching a plurality of documents for a document corresponding to the search document using the text feature information generated in the generation step as a query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 17, 18)
-
-
9. A document search apparatus for searching for a document, comprising:
-
a character recognition unit configured to execute a character recognition process for an image of a search document;
an extraction unit configured to extract text data which is estimated to be correctly recognized from text data obtained b said character recognition unit;
a generation unit configured to generate text feature information on the basis of the text data extracted by said extraction unit; and
a search unit configured to search a plurality of documents for a document corresponding to the search document using the text feature information generated by said generation unit as a query. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification