System for dynamic optical character recognition tuning
First Claim
1. A system for dynamically tuning optical character recognition processes, the system comprising:
- a memory device; and
a processing device operatively coupled to the memory device, wherein the processing device is configured to execute computer-readable program code to;
receive an image of a resource document comprising image coordinates associated with the resource document;
apply a general optical character recognition process to the image of the resource document to identify a resource document source or a resource document type;
identify an expected image coordinate area of the image of the resource document associated with a data field of the resource document;
apply a data field-specific optical character recognition process to the expected image coordinate area of the image of the resource document;
determine that the data field is missing from the expected image coordinate area of the image of the resource document;
in response to determining that the data field is missing, cause a user interface of a computing device to display the image of the resource document;
receive, from the user interface of the computing device, a user input associated with an updated expected image coordinate area for the data field;
apply the data field-specific optical character recognition process to the updated expected image coordinate area for the data field within the image of the resource document to extract a value of the data field;
replace the expected image coordinate area in a database with the updated expected image coordinate area; and
associate the stored updated expected image coordinate area for the data field with the identified resource document source or the resource document type.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the present invention provide a system for dynamically tuning optical character recognition (OCR) processes. The system receives or captures an image of a resource document and uses a general or default OCR process to identify a source of the document and values of multiple data fields in the image of the document. When the system determines that a data field is missing or cannot be extracted, it causes a computing device to display the image of the resource document and requests user input of a coordinate area of the missing data field from an associated specialist. Once the user input is received, the system applies a data field-specific OCR process on the coordinate area of the missing data field to extract the value of the data field. This value of the missing data field can be transmitted to a processing system for further processing.
71 Citations
20 Claims
-
1. A system for dynamically tuning optical character recognition processes, the system comprising:
-
a memory device; and a processing device operatively coupled to the memory device, wherein the processing device is configured to execute computer-readable program code to; receive an image of a resource document comprising image coordinates associated with the resource document; apply a general optical character recognition process to the image of the resource document to identify a resource document source or a resource document type; identify an expected image coordinate area of the image of the resource document associated with a data field of the resource document; apply a data field-specific optical character recognition process to the expected image coordinate area of the image of the resource document; determine that the data field is missing from the expected image coordinate area of the image of the resource document; in response to determining that the data field is missing, cause a user interface of a computing device to display the image of the resource document; receive, from the user interface of the computing device, a user input associated with an updated expected image coordinate area for the data field; apply the data field-specific optical character recognition process to the updated expected image coordinate area for the data field within the image of the resource document to extract a value of the data field; replace the expected image coordinate area in a database with the updated expected image coordinate area; and associate the stored updated expected image coordinate area for the data field with the identified resource document source or the resource document type. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product for dynamically tuning optical character recognition processes, the computer program product comprising at least one non-transitory computer readable medium comprising computer readable instructions that, when executed, perform steps of:
-
receiving an image of a resource document comprising image coordinates associated with the resource document; applying a general optical character recognition process to the image of the resource document to identify a resource document source or a resource document type; identifying an expected image coordinate area of the image of the resource document associated with a data field of the resource document; applying a data field-specific optical character recognition process to the expected image coordinate area of the image of the resource document; determining that the data field is missing from the expected image coordinate area of the image of the resource document; in response to determining that the data field is missing, causing a user interface of a computing device to display the image of the resource document; receiving, from the user interface of the computing device, a user input associated with an updated expected image coordinate area for the data field; applying the data field-specific optical character recognition process to the updated expected image coordinate area for the data field within the image of the resource document to extract a value of the data field; replacing the expected image coordinate area in a database with the updated expected image coordinate area; and associating the stored updated expected image coordinate area for the data field with the identified resource document source or the resource document type. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer implemented method for dynamically tuning optical character recognition processes, said computer implemented method comprising:
providing a computing system comprising a computer processing device and a non-transitory computer readable medium, where the computer readable medium comprises configured computer program instruction code, such that when said instruction code is operated by said computer processing device, said computer processing device performs the following operations; receiving an image of a resource document comprising image coordinates associated with the resource document; applying a general optical character recognition process to the image of the resource document to identify a resource document source or a resource document type; identifying an expected image coordinate area of the image of the resource document associated with a data field of the resource document; applying a data field-specific optical character recognition process to the expected image coordinate area of the image of the resource document; determining that the data field is missing from the expected image coordinate area of the image of the resource document; in response to determining that the data field is missing, causing a user interface of a computing device to display the image of the resource document; receiving, from the user interface of the computing device, a user input associated with an updated expected image coordinate area for the data field; applying the data field-specific optical character recognition process to the updated expected image coordinate area for the data field within the image of the resource document to extract a value of the data field; replacing the expected image coordinate area in a database with the updated expected image coordinate area; and associating the stored updated expected image coordinate area for the data field with the identified resource document source or the resource document type. - View Dependent Claims (18, 19, 20)
Specification