ITEMIZED RECEIPT EXTRACTION USING MACHINE LEARNING
First Claim
Patent Images
1. A method, comprising:
- retrieving, by a computer, a transaction receipt comprising unstructured data;
extracting features indicating details of the transaction from the unstructured data;
applying, using a receipt language model, weights to the features;
associating, based on the features and the weights, labels with tokens in the receipt, the tokens comprising values stored in the unstructured data; and
updating the receipt language model with the extracted features, the applied weights and the associated labels.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, including retrieving a transaction receipt, wherein the transaction receipt includes unstructured data. Features indicating details of the transaction are extracted from the unstructured data, and using a receipt language model, weights are applied to the features. Based on the features and the weights, labels are associated with tokens in the receipt, and the receipt language model is updated with the extracted features, the applied weights and the associated labels.
93 Citations
21 Claims
-
1. A method, comprising:
-
retrieving, by a computer, a transaction receipt comprising unstructured data; extracting features indicating details of the transaction from the unstructured data; applying, using a receipt language model, weights to the features; associating, based on the features and the weights, labels with tokens in the receipt, the tokens comprising values stored in the unstructured data; and updating the receipt language model with the extracted features, the applied weights and the associated labels. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus, comprising:
-
a memory configured to store a transaction receipt comprising unstructured data; and a processor configured to extract features indicating details of the transaction from the unstructured data, to apply, using a receipt language model, weights to the features, to associate, based on the features and the weights, labels with tokens in the receipt, the tokens comprising values stored in the unstructured data, and to update the receipt language model with the extracted features, the applied weights and the associated labels. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer software product comprising a non-transitory computer-readable medium, in which program instructions are stored, which instructions, when read by a computer executing a user interface, cause the computer to retrieve a transaction receipt comprising unstructured data, to extract features indicating details of the transaction from the unstructured data, to apply, using a receipt language model, weights to the features, associate, based on the features and the weights, labels with tokens in the receipt, the tokens comprising values stored in the unstructured data, and to update the receipt language model with the extracted features, the applied weights and the associated labels.
Specification