Systems, methods and computer program products for determining document validity
First Claim
1. A method, comprising:
- performing optical character recognition (OCR) on an image of a first document;
generating a list of hypotheses mapping the first document to a complementary document using;
textual information from the first document,textual information from the complementary document, andpredefined business rules;
at least one of;
correcting OCR errors in the first document, and normalizing data from the complementary document, using at least one of the textual information from the complementary document and the predefined business rules;
determining a validity of the first document based on the hypotheses; and
outputting an indication of the determined validity.
7 Assignments
0 Petitions
Accused Products
Abstract
A method according to one embodiment includes performing optical character recognition (OCR) on an image of a first document; generating a list of hypotheses mapping the first document to a complementary document using: textual information from the first document, textual information from the complementary document, and predefined business rules; at least one of: correcting OCR errors in the first document, and normalizing data from the complementary document, using at least one of the textual information from the complementary document and the predefined business rules; determining a validity of the first document based on the hypotheses; and outputting an indication of the determined validity. Additional systems, methods and computer program products are also presented.
64 Citations
67 Claims
-
1. A method, comprising:
-
performing optical character recognition (OCR) on an image of a first document; generating a list of hypotheses mapping the first document to a complementary document using; textual information from the first document, textual information from the complementary document, and predefined business rules; at least one of;
correcting OCR errors in the first document, and normalizing data from the complementary document, using at least one of the textual information from the complementary document and the predefined business rules;determining a validity of the first document based on the hypotheses; and outputting an indication of the determined validity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method, comprising:
-
determining a validity of a first document by simultaneously considering; textual information from the first document, textual information from a complementary document, and predefined business rules; at least one of;
correcting OCR errors in the first document, and normalizing data from the first document prior to determining the validity, using at least one of the textual information from the complementary document and the predefined business rules; andoutputting an indication of the determined validity. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A method, comprising:
-
receiving an image of a document; performing optical character recognition (OCR) on the image of the document; extracting an address of a sender of the document from the image based on the OCR; comparing the extracted address with content in a first database; identifying complementary textual information in a second database based on the address; and at least one of; extracting additional content from the image of the document; correcting OCR errors in the document using the complementary textual information, and normalizing data from the document prior to determining a validity of the document using at least one of the complementary textual information and predefined business rules. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48)
-
-
49. A method, comprising:
-
receiving an image of a part or all of a document selected from a group consisting of;
an invoice, a bill, a receipt, a sales order, an insurance claim, a medical insurance document, and a benefits document;performing optical character recognition (OCR) on the image; extracting at least a partial address of a sender of the document; comparing the at least partial address of the sender to a plurality of addresses in a first database; and identifying one or more of; textual information specific to the sender; and data formatting specific to the sender. - View Dependent Claims (50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67)
-
Specification