Method and system for extracting information from documents by document segregation
First Claim
Patent Images
1. A computer-implemented method for extracting information from a document, comprising:
- analyzing handwritten entries from a plurality of fields on a plurality of documents, wherein more than one of the plurality of fields includes information identifying a recipient of the set of documents;
extracting information from the handwritten entries of the documents and comparing extracted information to preset information;
weighting the information extracted from each of the plurality of fields based on a consistency of information extracted from each of the plurality of fields; and
instructing a processor of the computer to segregate a set of documents from the plurality of documents based on a likelihood that at least one document in said set of documents carries the preset information, said preset information being directed to the recipient of the set of documents.
2 Assignments
0 Petitions
Accused Products
Abstract
A method (and system) for extracting information from a document, includes segregating a set of documents from a plurality of documents based on a likelihood that at least one document in the set of documents carries an instance of a preset information.
196 Citations
30 Claims
-
1. A computer-implemented method for extracting information from a document, comprising:
-
analyzing handwritten entries from a plurality of fields on a plurality of documents, wherein more than one of the plurality of fields includes information identifying a recipient of the set of documents; extracting information from the handwritten entries of the documents and comparing extracted information to preset information; weighting the information extracted from each of the plurality of fields based on a consistency of information extracted from each of the plurality of fields; and instructing a processor of the computer to segregate a set of documents from the plurality of documents based on a likelihood that at least one document in said set of documents carries the preset information, said preset information being directed to the recipient of the set of documents. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A system of extracting information from a document, comprising:
-
an analyzing unit that analyzes handwritten entries from a plurality of fields on a plurality of documents, wherein more than one of the plurality of fields includes information identifying a recipient of the set of documents; an extracting unit that extracts information from the handwritten entries of the documents and compares extracted information to preset information; a weighting unit that weights the information extracted from each of the plurality of fields based on a consistency of information extracted from each of the plurality of fields; and a segregation unit that segregates a set of documents from the plurality of documents based on a likelihood that at least one document in said set of documents carries the preset information, said preset information being directed to the recipient of the set of documents.
-
-
25. A system of extracting information from a document, comprising:
-
means for recognizing handwritten indicia from a plurality of fields in a plurality of documents, wherein more than one of the plurality of fields includes information identifying a recipient of the set of documents, wherein information obtained from each of the plurality of fields is weighted based on its consistency; and means, coupled to said recognizing means, for segregating a set of documents from said plurality of documents based on a likelihood that at least one document in said set of documents carries an instance of a preset information in said indicia, said preset information being directed to the recipient of the set of documents.
-
-
26. A computer-readable medium tangibly embodying a program of machine readable instructions executable by a digital processing apparatus to perform a method for extracting information from a document, said method comprising:
-
analyzing handwritten entries on a plurality of documents; extracting information from the handwritten entries from a plurality of fields of the documents and comparing extracted information to preset information, wherein more than one of the plurality of fields includes information identifying a recipient of the set of documents; weighting the information extracted from each of the plurality of fields based on a consistency of information extracted from each of the plurality of fields; and segregating a set of documents from the plurality of documents based on a likelihood that at least one document in said set of documents carries the preset information, said preset information being directed to the recipient of the set of documents.
-
-
27. A method for deploying computing infrastructure, comprising integrating computer-readable code into a computing system, wherein the computer readable code in combination with the computing system is capable of performing a method for extracting information from a document, said method for extracting information from a document, comprising:
-
analyzing handwritten entries from a plurality of fields on a plurality of documents; extracting information from the handwritten entries of the documents and comparing extracted information to preset information, wherein more than one of the plurality of fields includes information identifying a recipient of the set of documents; weighting the information extracted from each of the plurality of fields based on a consistency of information extracted from each of the plurality of fields; and segregating a set of documents from the plurality of documents based on a likelihood that at least one document in said set of documents carries the preset information, said preset information being directed to the recipient of the set of documents.
-
-
28. A system of extracting information from a document, comprising:
-
a recognition unit that recognizes indicia including handwritten characters from a plurality of fields on a document, wherein more than one of the plurality of fields includes information identifying a recipient of the set of documents; a weighting unit that weights the characters from each of the plurality of fields based on a consistency of characters recognized from each of the plurality of fields; and a segregating unit that segregates a batch of documents from a plurality of documents based on a likelihood that at least one document in said batch of documents carries an instance of a preset information in said indicia, wherein said recognition unit recognizes said handwritten text on a front side and a rear side of said document, said preset information being directed to the recipient of the set of documents. - View Dependent Claims (29, 30)
-
Specification