Methods, systems, and computer program products for estimating accuracy of linking of customer relationships
First Claim
1. A method comprising:
- selecting, by a computer based system for estimating accuracy of customer relationships, from a database comprising customer relationships data a limited stratified subset of customer relationships data to form a sample data set;
identifying, by the computer based system, linked relationships associated with different customers and unlinked relationships associated with a same customer as potential linking errors in the sample data set;
validating, by the computer based system, the potential linking errors to identify actual linking errors of the sample data set;
storing, by the computer based system, an error type designation of the actual linking errors of the sample data set; and
estimating, by the computer based system, the linking errors within the database by extrapolating the identified actual linking errors of the sample data set to estimate the linking errors within a data set of the database larger than the sample data set, wherein the estimate includes the linking error type designation.
3 Assignments
0 Petitions
Accused Products
Abstract
The disclosed methods, systems, and computer-program products allow a business to estimate linking errors in customer relationships in a database and to identify metrics that improve the linking accuracy. In an embodiment, a plurality of sample customer relationships are selected from a database to form a sample data set that is statistically representative of the database. Potential linking errors are then identified within the sample data set. The identified potential linking errors are then validated to identify actual linking errors in the sample data set. Once validated, the actual linking errors within the sample data set are used to estimate linking errors within the database. Further, the estimated linking errors in the database may be analyzed to identify one or more factors that contribute to the linking errors.
-
Citations
19 Claims
-
1. A method comprising:
-
selecting, by a computer based system for estimating accuracy of customer relationships, from a database comprising customer relationships data a limited stratified subset of customer relationships data to form a sample data set; identifying, by the computer based system, linked relationships associated with different customers and unlinked relationships associated with a same customer as potential linking errors in the sample data set; validating, by the computer based system, the potential linking errors to identify actual linking errors of the sample data set; storing, by the computer based system, an error type designation of the actual linking errors of the sample data set; and estimating, by the computer based system, the linking errors within the database by extrapolating the identified actual linking errors of the sample data set to estimate the linking errors within a data set of the database larger than the sample data set, wherein the estimate includes the linking error type designation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An article of manufacture including a non-transitory, tangible computer readable medium having instructions stored thereon that, in response to execution by a computer-based system for estimating the accuracy of the customer relationships, cause the computer-based system to perform operations comprising:
-
selecting, by the computer based system, from a database comprising customer relationships data a limited stratified subset of customer relationships data to form a sample data set; identifying, by the computer based system, linked relationships associated with different customers and unlinked relationships associated with a same customer as potential linking errors in the sample data set; validating, by the computer based system, the potential linking errors to identify actual linking errors of the sample data set; storing, by the computer based system, an error type designation of the actual linking errors of the sample data set; and estimating, by the computer based system, the linking errors within the database by extrapolating the identified actual linking errors of the sample data set to estimate the linking errors within a data set of the database larger than the sample data set, wherein the estimate includes the linking error type designation. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A system comprising:
-
a tangible, non-transitory memory communicating with a processor for estimating the accuracy of the customer relationships, the tangible, non-transitory memory having instructions stored thereon that, in response to execution by the processor, cause the processor to perform operations comprising; select, by the processor, from a database comprising customer relationships data a limited stratified subset of customer relationships data to form a sample data set; identify by the processor, linked relationships associated with different customers and unlinked relationships associated with a same customer as potential linking errors in the sample data set; validate by the processor, the potential linking errors to identify actual linking errors of the sample data set; store, by the processor, an error type designation of the actual linking errors of the sample data set; and estimate by the processor, the linking errors within the database by extrapolating the identified actual linking errors of the sample data set to estimate the linking errors within a data set of the database larger than the sample data set, wherein the estimate includes the linking error type designation. - View Dependent Claims (18, 19)
-
Specification