SYSTEMS AND METHODS FOR MATCHING TRANSACTIONAL DATA
First Claim
1. A computer implemented method comprising:
- receiving transactional data for a first user, the transactional data comprising a plurality of records, and each record comprising a first plurality of fields;
receiving second data for the first user, the second data comprising a second plurality of fields corresponding to at least a portion of the first plurality of fields;
selecting transactional data records for the first user from a data store of transactional data for a plurality of users;
determining a plurality of similarities between a plurality of fields from the first plurality of fields of each of the transactional data records for the first user and a corresponding plurality of fields from the second plurality of fields of second data for the first user;
determining a likelihood of a match between the first plurality of fields of the transactional data records for the first user and the second plurality of fields for the second data for the first user based on the plurality of similarities using a machine learning model, and in accordance therewith, identifying one record in the transactional data records for the first user that generates said likelihood of the match above a first threshold;
replacing one or more of values in the second plurality of fields with one or more corresponding values in the first plurality of fields from the identified one record; and
storing the second plurality of fields in a record in a database.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the present disclosure pertain to matching transactional data. In one embodiment, the present disclosure includes a computer implemented method comprising receiving transactional data for a first user and second data for the first user, selecting transactional data records for the first user from a data store of transactional data for a plurality of users, determining a plurality of similarities between fields of the transactional data and second data, determining a likelihood of a match between a transactional data field and a second data field based on the plurality of similarities using a machine learning model, and in accordance therewith, identifying one record in the transactional data records for the first user that generates said likelihood of the match above a first threshold, and replacing values second data fields with corresponding values in the one record.
8 Citations
20 Claims
-
1. A computer implemented method comprising:
-
receiving transactional data for a first user, the transactional data comprising a plurality of records, and each record comprising a first plurality of fields; receiving second data for the first user, the second data comprising a second plurality of fields corresponding to at least a portion of the first plurality of fields; selecting transactional data records for the first user from a data store of transactional data for a plurality of users; determining a plurality of similarities between a plurality of fields from the first plurality of fields of each of the transactional data records for the first user and a corresponding plurality of fields from the second plurality of fields of second data for the first user; determining a likelihood of a match between the first plurality of fields of the transactional data records for the first user and the second plurality of fields for the second data for the first user based on the plurality of similarities using a machine learning model, and in accordance therewith, identifying one record in the transactional data records for the first user that generates said likelihood of the match above a first threshold; replacing one or more of values in the second plurality of fields with one or more corresponding values in the first plurality of fields from the identified one record; and storing the second plurality of fields in a record in a database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A non-transitory machine-readable medium storing a program executable by at least one processing unit of a computer, the program comprising sets of instructions for:
-
receiving transactional data for a first user, the transactional data comprising a plurality of records, and each record comprising a first plurality of fields; receiving OCR data for the first user corresponding to optical character recognition of a physical transaction receipt from a picture of the receipt taken on a mobile device, the OCR data comprising a second plurality of fields corresponding to at least a portion of the first plurality of fields; selecting transactional data records for the first user from a data store of transactional data for a plurality of users; determining a plurality of similarities between a plurality of fields from the first plurality of fields of each of the transactional data records for the first user and a corresponding plurality of fields from the second plurality of fields of OCR data for the first user, wherein determining at least one similarity of the plurality of similarities comprises; determining a first similarity, the first similarity comprising a similarity between a first character string and a second character string, wherein the first character string is an intersection of a first character field in the first plurality of fields and a corresponding first character field in the second plurality of fields, and wherein the second character string is the first character field; determining a second similarity, the second similarity comprising a similarity between the first character string and a third character string, wherein the third character string is the second character field; determining a third similarity, the third similarity comprising a similarity between the first character field and the second character field; and selecting the maximum similarity from the first similarity, the second similarity, and the third similarity; determining a likelihood of a match between the first plurality of fields of the transactional data records for the first user and the second plurality of fields for the OCR data for the first user based on the selected maximum similarity and similarities between a plurality of other fields of the first and second plurality of fields using a random forest machine learning model, and in accordance therewith, identifying one record in the transactional data records that corresponds to the OCR data for the first user; and replacing one or more of values in the second plurality of fields with one or more corresponding values in the first plurality of fields from the identified one record. - View Dependent Claims (13, 14, 15)
-
-
16. A computer system comprising:
-
a processor; and a non-transitory machine-readable medium storing a program executable by the processor, the program comprising sets of instructions for; receiving transactional data for a first user, the transactional data comprising a plurality of records, and each record comprising a first plurality of fields; receiving second data for the first user, the second data comprising a second plurality of fields corresponding to at least a portion of the first plurality of fields; selecting transactional data records for the first user from a data store of transactional data for a plurality of users; determining a plurality of similarities between a plurality of fields from the first plurality of fields of each of the transactional data records for the first user and a corresponding plurality of fields from the second plurality of fields of second data for the first user; determining a likelihood of a match between the first plurality of fields of the transactional data records for the first user and the second plurality of fields for the second data for the first user based on the plurality of similarities using a machine learning model, and in accordance therewith, identifying one record in the transactional data records for the first user that generates said likelihood of the match above a first threshold; replacing one or more of values in the second plurality of fields with one or more corresponding values in the first plurality of fields from the identified one record; and storing the second plurality of fields in a record in a database. - View Dependent Claims (17, 18, 19, 20)
-
Specification