Hierarchical classification in credit card data extraction
First Claim
1. A computer-implemented method to extract card information, comprising:
- receiving, by one or more computing devices, an image of a card from a camera;
identifying, by the one or more computing devices, a first area of the image, the first area being selected as a potential location of a digit on the card in the image and of a size that will encompass not more than a single complete digit, the potential location and the size of the first area being identified from a comparison of the image to a database of card layouts stored on the one or more computing devices;
performing, by the one or more computing devices, a linear classification algorithm on data encompassed by the first area;
determining, by the one or more computing devices, a confidence level of a first result of the application of the linear classification algorithm to the first area, wherein the confidence level of the first result indicates a likelihood that the first area encompasses the single complete digit;
determining, by the one or more computing devices, that the first area does not encompasses a single complete digit upon determining that the confidence level of the first result is under a configured threshold;
identifying, by the one or more computing devices, a second area of the image, the second area being in a different location from the first area and of a size that will encompass not more than a single complete digit;
performing, by the one or more computing devices, a linear classification algorithm on data encompassed by the second area;
determining, by the one or more computing devices, a confidence level of a second result of the application of the linear classification algorithm to the second area indicating that the second area encompasses a single complete digit, wherein the confidence level of the second result indicates a likelihood that the second area encompasses the single complete digit;
determining, by the one or more computing devices, that the second area encompasses the single complete digit upon determining that the confidence level of the second result is over a configured threshold; and
performing, by the one or more computing devices, an optical character recognition algorithm on the second area upon a determination that the second area encompasses the single complete digit.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments herein provide computer-implemented techniques for allowing a user computing device to extract financial card information using optical character recognition (“OCR”). Extracting financial card information may be improved by applying various classifiers and other transformations to the image data. For example, applying a linear classifier to the image to determine digit locations before applying the OCR algorithm allows the user computing device to use less processing capacity to extract accurate card data. The OCR application may train a classifier to use the wear patterns of a card to improve OCR algorithm performance. The OCR application may apply a linear classifier and then a nonlinear classifier to improve the performance and the accuracy of the OCR algorithm. The OCR application uses the known digit patterns used by typical credit and debit cards to improve the accuracy of the OCR algorithm.
-
Citations
14 Claims
-
1. A computer-implemented method to extract card information, comprising:
-
receiving, by one or more computing devices, an image of a card from a camera; identifying, by the one or more computing devices, a first area of the image, the first area being selected as a potential location of a digit on the card in the image and of a size that will encompass not more than a single complete digit, the potential location and the size of the first area being identified from a comparison of the image to a database of card layouts stored on the one or more computing devices; performing, by the one or more computing devices, a linear classification algorithm on data encompassed by the first area; determining, by the one or more computing devices, a confidence level of a first result of the application of the linear classification algorithm to the first area, wherein the confidence level of the first result indicates a likelihood that the first area encompasses the single complete digit; determining, by the one or more computing devices, that the first area does not encompasses a single complete digit upon determining that the confidence level of the first result is under a configured threshold; identifying, by the one or more computing devices, a second area of the image, the second area being in a different location from the first area and of a size that will encompass not more than a single complete digit; performing, by the one or more computing devices, a linear classification algorithm on data encompassed by the second area; determining, by the one or more computing devices, a confidence level of a second result of the application of the linear classification algorithm to the second area indicating that the second area encompasses a single complete digit, wherein the confidence level of the second result indicates a likelihood that the second area encompasses the single complete digit; determining, by the one or more computing devices, that the second area encompasses the single complete digit upon determining that the confidence level of the second result is over a configured threshold; and performing, by the one or more computing devices, an optical character recognition algorithm on the second area upon a determination that the second area encompasses the single complete digit. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program product, comprising:
-
a non-transitory computer-readable storage device having computer-executable program instructions embodied thereon that when executed by a computer cause the computer to extract card information, the computer-executable program instructions comprising; computer-executable program instructions to receive an image of a card from a camera; computer-executable program instructions to identify a first area of the image, the first area being selected as a potential location of a digit on the card in the image and of a size that will encompass not more than a single complete digit, the potential location and the size of the first area being identified from a comparison of the image to a database of card layouts stored on the non-transitory computer-readable storage device; computer-executable program instructions to perform a linear classification algorithm on data encompassed by the first area; computer-executable program instructions to determine a confidence level of a first result of the application of the linear classification algorithm to the first area wherein the confidence level of the first result indicates a likelihood that the first area encompasses the single complete digit; computer-executable program instructions to determine that the first area does not encompasses a single complete digit upon determining that the confidence level of the first result is under a configured threshold; computer-executable program instructions to identify a second area of the image, the second area being in a different location from the first area and of a size that will encompass not more than a single complete digit; computer-executable program instructions to perform a linear classification algorithm on data encompassed by the second area; computer-executable program instructions to determine a confidence level of a second result of the application of the linear classification algorithm to the second area indicating that the second area encompasses a single complete digit, wherein the confidence level of the second result indicates a likelihood that the second area encompasses the single complete digit; computer-executable program instructions to determine that the second area encompasses the single complete digit upon determining that the confidence level of the second result is over a configured threshold; and computer-executable program instructions to perform an optical character recognition algorithm on the second area upon a determination that the second area encompasses the single complete digit. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A system to extract card information, comprising:
-
a storage device; a processor communicatively coupled to the storage device, wherein the processor executes application code instructions that are stored in the storage device to cause the system to; receive an image of a card from a camera; identify a first area of the image, the first area being selected as a potential location of a digit on the card in the image and of a size that will encompass not more than a single complete digit, the potential location and the size of the first area being identified from a comparison of the image to a database of card layouts stored on the storage device; perform a linear classification algorithm on data encompassed by the first area; determine a confidence level of a first result of the application of the linear classification algorithm to the first area, wherein the confidence level of the first result indicates a likelihood that the first area encompasses the single complete digit; determine that the first area does not encompasses a single complete digit upon determining that the confidence level of the first result is under a configured threshold; identify a second area of the image, the second area being in a different location from the first area and of a size that will encompass not more than a single complete digit; perform a linear classification algorithm on data encompassed by the second area; determine a confidence level of a second result of the application of the linear classification algorithm to the second area indicating that the second area encompasses a single complete digit; determine that the second area encompasses a single complete digit upon determining that the confidence level of the second result is over a configured threshold; and perform an optical character recognition algorithm on the second area upon a determination that the second area encompasses the single complete digit. - View Dependent Claims (12, 13, 14)
-
Specification