Optical receipt processing
First Claim
1. A system, comprising:
- one or more servers configured to;
receive a first transformed image of a receipt from a consumer device, wherein the first transformed image is created by programmatically transforming image data captured by a camera using first image parameter values optimized for logo detection;
determine a training logo associated with a merchant;
perform a logo identification based on determining whether the first transformed image includes a logo corresponding with the training logo;
subsequent to determining that the first transformed image includes a logo that corresponds with the training logo associated with the merchant;
determine second image parameter values optimized for text extraction;
create a second transformed image by programmatically transforming the image data using the second image parameter values; and
perform a text extraction using the second transformed image to create receipt text data.
4 Assignments
0 Petitions
Accused Products
Abstract
Techniques for providing improved optical character recognition (OCR) for receipts are discussed herein. Some embodiments may provide for a system including one or more servers configured to perform receipt image cleanup, logo identification, and text extraction. The image cleanup may include transforming image data of the receipt by using image parameters values that optimize the logo identification, and performing logo identification using a comparison of the image data with training logos associated with merchants. When a merchant is identified, a second image clean up may be performed by using image parameter values optimized for text extraction. A receipt structure may be used to categorize the extracted text. Improved OCR accuracy is also achieved by applying on format rules of the receipt structure to the extracted text.
-
Citations
20 Claims
-
1. A system, comprising:
one or more servers configured to; receive a first transformed image of a receipt from a consumer device, wherein the first transformed image is created by programmatically transforming image data captured by a camera using first image parameter values optimized for logo detection; determine a training logo associated with a merchant; perform a logo identification based on determining whether the first transformed image includes a logo corresponding with the training logo; subsequent to determining that the first transformed image includes a logo that corresponds with the training logo associated with the merchant; determine second image parameter values optimized for text extraction; create a second transformed image by programmatically transforming the image data using the second image parameter values; and perform a text extraction using the second transformed image to create receipt text data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
9. A method comprising:
-
receiving a first transformed image of a receipt from a consumer device, wherein the first transformed image is created by programmatically transforming image data captured by a camera using first image parameter values optimized for logo detection; determining a training logo associated with a merchant; performing a logo identification based on determining whether the first transformed image includes a logo corresponding with the training logo; subsequent to determining that the first transformed image includes a logo that corresponds with the training logo associated with the merchant; determining second image parameter values optimized for text extraction; creating a second transformed image by programmatically transforming the image data using the second image parameter values; and performing a text extraction using the second transformed image to create receipt text data. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A computer program product comprising at least one non-transitory computer-readable storage medium having computer-executable program code instruction stored therein, the computer-executable program code instructions comprising program code instructions configured to:
-
receive a first transformed image of a receipt from a consumer device, wherein the first transformed image is created by programmatically transforming image data captured by a camera using first image parameter values optimized for logo detection; determine a training logo associated with a merchant; perform a logo identification based on determining whether the first transformed image includes a logo corresponding with the training logo; subsequent to determining that the first transformed image includes a logo that corresponds with the training logo associated with the merchant; determine second image parameter values optimized for text extraction; create a second transformed image by programmatically transforming the image data using the second image parameter values; and perform a text extraction using the second transformed image to create receipt text data. - View Dependent Claims (17, 18, 19, 20)
-
Specification