AUTOMATED DOCUMENT RECOGNITION, IDENTIFICATION, AND DATA EXTRACTION
First Claim
1. A processor-implemented method for automated document recognition, identification and data extraction, the method comprising:
- receiving a video stream associated with the document, the document being associated with a user;
detecting an image of the document in the video stream, the detecting including recognizing a shape corresponding to the document overall;
improving the detected image of the document in the video stream by adjusting colors, adjusting brightness, and removing blurring;
extracting the detected image of the document from the video stream, the image being a still image;
analyzing the extracted image using optical character recognition to produce image data, the image data including text zones, each of the text zones being associated with one or more distances to other text zones and one or more borders of the document, the one or more distances being determined using coordinates;
comparing the extracted image to one or more document templates using the image data;
determining a document template having a highest degree of coincidence with the extracted image using the comparison;
matching the text zones of the extracted image with text zones of the document template to determine a type of data in each text zone; and
structuring the data into a standard format to obtain structured data.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for automated document recognition, identification, and data extraction is described herein. The method comprises receiving, by the processor, an image of a document associated with a user. The image is analyzed using optical character recognition to obtain image data, wherein the image data includes text zones. Based on the image data, the image is compared to one or more document templates. Based on the comparison, a document template having the highest degree of coincidence with the image is determined. The text zones of the image are associated with text zones of the document template to determine a type of data in each text zone. The data is structured into a standard format to obtain structured data.
-
Citations
25 Claims
-
1. A processor-implemented method for automated document recognition, identification and data extraction, the method comprising:
-
receiving a video stream associated with the document, the document being associated with a user; detecting an image of the document in the video stream, the detecting including recognizing a shape corresponding to the document overall; improving the detected image of the document in the video stream by adjusting colors, adjusting brightness, and removing blurring; extracting the detected image of the document from the video stream, the image being a still image; analyzing the extracted image using optical character recognition to produce image data, the image data including text zones, each of the text zones being associated with one or more distances to other text zones and one or more borders of the document, the one or more distances being determined using coordinates; comparing the extracted image to one or more document templates using the image data; determining a document template having a highest degree of coincidence with the extracted image using the comparison; matching the text zones of the extracted image with text zones of the document template to determine a type of data in each text zone; and structuring the data into a standard format to obtain structured data. - View Dependent Claims (2, 3, 4, 6, 7, 8, 10)
-
-
5. (canceled)
-
9. (canceled)
-
11. (canceled)
-
12. A system for automated document recognition, identification and data extraction, the system comprising:
-
a processor; a memory coupled to the processor, the memory storing instructions, the instructions being executable by the processor to perform a method, the method comprising; receiving a video stream associated with a document associated with a user, detecting an image of the document in the video stream, the detecting including recognizing a shape corresponding to the identification document overall, improving the detected image of the document in the video stream by adjusting colors, adjusting brightness, and removing blurring, extracting the detected image of the document from the video stream, the image being a still image, analyzing the extracted image using optical character recognition to produce image data, the image data including text zones, each of the text zones being associated with one or more distances to other text zones and one or more borders of the document, the one or more distances being determined using coordinates, comparing the extracted image to one or more document templates using the image data, determining a document template having a highest degree of coincidence with the extracted image using the comparison, matching the text zones of the image with text zones of the document template to determine a type of data in each text zone; and structuring the data into a standard format to obtain structured data; and a database communicatively coupled to the processor, the database storing the one or more document templates. - View Dependent Claims (13, 15, 16, 17, 19)
-
-
14. (canceled)
-
18. (canceled)
-
20. A non-transitory computer-readable storage medium having embodied thereon a program, the program being executable by one or more processors to perform the a method, the method comprising:
-
receiving a video stream associated with a document, the document being associated with a user; detecting an image of the document, the detecting including recognizing a shape corresponding to the document overall; improving the detected image of the document in the video stream by adjusting colors, adjusting brightness, and removing blurring; extracting the detected image of the document from the video stream, the image being a still image; analyzing the extracted image using optical character recognition to produce image data, the image data including text zones, each of the text zones being associated with one or more distances to other text zones and one or more borders of the document, the one or more distances being determined using coordinates; comparing the extracted image to one or more document templates using the image data; determining a document template having a highest degree of coincidence with the extracted image using the comparison; matching the text zones of the image with text zones of the document template to determine a type of data in each text zone; and structuring the data into a standard format to obtain structured data. - View Dependent Claims (21, 22, 23, 24, 25)
-
Specification