Resolving similar entities from a transaction database
First Claim
1. A method for identifying related transaction records from a database storing transaction records for multiple entities performed by one or more processors of a computer system, the method comprising:
- acquiring a plurality of transaction record sets, wherein each transaction record set includes one or more of the transaction records sharing a common attribute value and wherein the transaction records are credit or debit transactions processed by a financial institution for a merchant;
receiving a selection of or selecting an exemplar record set of the plurality of transaction record sets, wherein the exemplar record set comprises a plurality of the transaction records associated with a first entity of the multiple entities;
for at least one of the acquired plurality of transaction record sets;
determining a probability that the transaction record set stores transaction records associated with the first entity based at least in part on a machine learning classifier, the machine learning classifier being trained using one or more first pairs of transaction record sets and one or more second pairs of transaction record sets, wherein a first pair of transaction record sets represents a common entity and a second pair of transaction record sets represents unrelated entities, andupon determining the probability exceeds a threshold, resolving the transaction record set as storing transaction records associated with the first entity, the resolving including merging the transaction records of the transaction record set into the exemplar record set that comprises the plurality of the transaction records associated with the first entity.
8 Assignments
0 Petitions
Accused Products
Abstract
A technique for identifying related transaction records from a database storing transaction records for multiple entities includes grouping transaction records with a common attribute value into transaction record sets, receiving a selection of an exemplar record set and determining the probability the transaction record set stores transaction records associated with a first entity. Other operations include resolving the transaction record set as storing transaction records associated with the first entity. This improves the process of identifying related transaction records because related transaction records missed by string comparisons transaction record attributes are detected.
430 Citations
20 Claims
-
1. A method for identifying related transaction records from a database storing transaction records for multiple entities performed by one or more processors of a computer system, the method comprising:
-
acquiring a plurality of transaction record sets, wherein each transaction record set includes one or more of the transaction records sharing a common attribute value and wherein the transaction records are credit or debit transactions processed by a financial institution for a merchant; receiving a selection of or selecting an exemplar record set of the plurality of transaction record sets, wherein the exemplar record set comprises a plurality of the transaction records associated with a first entity of the multiple entities; for at least one of the acquired plurality of transaction record sets; determining a probability that the transaction record set stores transaction records associated with the first entity based at least in part on a machine learning classifier, the machine learning classifier being trained using one or more first pairs of transaction record sets and one or more second pairs of transaction record sets, wherein a first pair of transaction record sets represents a common entity and a second pair of transaction record sets represents unrelated entities, and upon determining the probability exceeds a threshold, resolving the transaction record set as storing transaction records associated with the first entity, the resolving including merging the transaction records of the transaction record set into the exemplar record set that comprises the plurality of the transaction records associated with the first entity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer-readable storage medium storing instructions that, when executed by a processor, cause the processor to perform an operation for identifying related transaction records from a database storing transaction records for multiple entities, the method comprising:
-
acquiring a plurality of transaction record sets, wherein each transaction record set includes one or more of the transaction records sharing a common attribute value and wherein the transaction records are credit or debit transactions processed by a financial institution for a merchant; receiving a selection of or selecting an exemplar record set of the plurality of transaction record sets, wherein the exemplar record set comprises a plurality of the transaction records associated with a first entity of the multiple entities; for at least one of the acquired plurality of transaction record sets; determining a probability that the transaction record set stores transaction records associated with the first entity based at least in part on a machine learning classifier, the machine learning classifier being trained using one or more first pairs of transaction record sets and one or more second pairs of transaction record sets, wherein a first pair of transaction record sets represents a common entity and a second pair of transaction record sets represents unrelated entities, and upon determining the probability exceeds a threshold, resolving the transaction record set as storing transaction records associated with the first entity, the resolving including merging the transaction records of the transaction record set into the exemplar record set that comprises the plurality of the transaction records associated with the first entity. - View Dependent Claims (10, 11, 12, 13, 14, 20)
-
-
15. A computer system, comprising:
-
a memory; and a processor storing one or more programs configured to perform an operation for identifying related transaction records from a database storing transaction records for multiple entities, the method comprising; acquiring a plurality of transaction record sets, wherein each transaction record set includes one or more of the transaction records sharing a common attribute value and wherein the transaction records are credit or debit transactions processed by a financial institution for a merchant; receiving a selection of or selecting an exemplar record set of the plurality of transaction record sets, wherein the exemplar record set comprises a plurality of the transaction records associated with a first entity of the multiple entities;
for at least one of the acquired plurality of transaction record sets;determining a probability that the transaction record set stores transaction records associated with the first entity based at least in part on a machine learning classifier, the machine learning classifier being trained using one or more first pairs of transaction record sets and one or more second pairs of transaction record sets, wherein a first pair of transaction record sets represents a common entity and a second pair of transaction record sets represents unrelated entities, and upon determining the probability exceeds a threshold, resolving the transaction record set as storing transaction records associated with the first entity, the resolving including merging the transaction records of the transaction record set into the exemplar record set that comprises the plurality of the transaction records associated with the first entity. - View Dependent Claims (16, 17, 18, 19)
-
Specification