System and method for analyzing and correcting retail data
First Claim
1. A method comprising:
- identifying a plurality of data sources, wherein at least a first data source is more accurate than a second data source;
identifying a plurality of overlapping attribute segments to use for comparing the data sources;
calculating a factor as a function of each of the plurality of overlapping attribute segments; and
using the factors to update a first group of values in the second data source to reduce bias.
20 Assignments
0 Petitions
Accused Products
Abstract
A computer system and method is disclosed that analyzes and corrects retail data. The system and method includes several client workstations and one or more servers coupled together over a network. A database stores various data used by the system. A business logic server uses competitive and complementary fusion to analyze and correct some of the data sources stored in database server. The data fusion process itself is an iterative one—utilizing both competitive and complementary fusion methods. In competitive fusion, two or more data sources that provide overlapping attributes are compared against each other. More accurate/reliable sources are used to correct less accurate/reliable sources. In complementary fusion, relationships modeled where data sources overlap are projected to areas of the data framework in which fewer sources exist—enhancing the accuracy/reliability of those fewer sources even in the absence of the other sources upon which the models were based.
64 Citations
80 Claims
-
1. A method comprising:
-
identifying a plurality of data sources, wherein at least a first data source is more accurate than a second data source;
identifying a plurality of overlapping attribute segments to use for comparing the data sources;
calculating a factor as a function of each of the plurality of overlapping attribute segments; and
using the factors to update a first group of values in the second data source to reduce bias. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method comprising:
-
receiving point-of-sale data and panel data on a periodic basis;
identifying a plurality of product identifiers aid a plurality of attributes to analyze;
retrieving and summarizing the point-of-sale data and the panel data by the plurality of product identifiers, the plurality of attributes, and a plurality of corresponding attribute segments for a specified time period;
calculating a factor for each attribute segment of a particular attribute; and
applying the factors for the particular attribute segment to the panel data to correct panel bias. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
-
33. A method comprising:
-
receiving point-of-sale data and panel data on a periodic basis;
identifying a plurality of product identifiers and a plurality of attributes to analyze;
retrieving and summarizing the point-of-sale data and the panel data by the plurality of product identifiers, the plurality of attributes, and a plurality of corresponding attribute segments for a specified the period;
calculating a factor for each attribute segment of the plurality of attributes; and
applying the factors for each particular attribute segment to the panel data to correct panel bias. - View Dependent Claims (34, 35, 36, 37, 38, 40)
-
-
39. The method of claim 331 wherein the product identifiers are uniform product codes.
-
41. A method, comprising:
-
identifying a plurality of product identifiers and a plurality of attributes to analyze for at least two data sources, wherein at least a first data source is more accurate than a second data source;
retrieving and summarizing the first data source and the second data source by the plurality of product identifiers, the plurality of attributes, and a plurality of corresponding attribute segments for a specified time period;
calculating a plurality of factors, wherein one factor is calculated for each attribute segment of the plurality of attributes;
applying the factors to the second data source to reduce bias; and
applying the factors to the second data source to reduce incompleteness. - View Dependent Claims (43, 44, 45, 46)
-
-
42. The method of 41, further comprising:
-
determining that additional data sources are available for comparison;
using the additional data sources to calculate additional factors; and
applying the additional factors to the second data source.
-
-
47. A system comprising:
-
one or more servers being operable to store retail data from at least two data sources, store product identifier and attribute categorizations, and store a plurality of factor calculations;
wherein the at least two data sources includes a first data source that is more accurate than a second data source; and
wherein one or more of said servers contains business logic that is operable to identify and retrieve a plurality of overlapping attribute segments to use for comparing the at least two data sources, compare each of the overlapping attribute segments, calculate a factor for each of the overlapping attribute segments, and use the factors to update a first group of values in the second data source to reduce bias. - View Dependent Claims (48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60)
-
-
61. An apparatus, comprising:
- a device encoded with logic executable by one or more processors to;
identify and retrieve a plurality of overlapping attribute segments to use for comparing at least two data sources, wherein the at least two data sources includes a first data source that is more accurate than a second data source, compare each of the overlapping attribute segments, calculate a factor for each of the overlapping attribute segments, and use the factors to update a first group of values in the second data source to reduce bias. - View Dependent Claims (62, 63, 64, 65)
- a device encoded with logic executable by one or more processors to;
-
66. A method comprising:
-
identifying a plurality of data sources in at least one computer database, wherein at least a first data source is more accurate than a second data source;
identifying a plurality of overlapping attribute segments to use for comparing the data sources;
calculating at least one factor as a function of at least one of the overlapping attribute segments; and
using the at least one factor to create modified values of a first group of values in the second data source, said modified values being more accurate than the first group of values. - View Dependent Claims (67, 68, 69, 70, 71, 72, 73, 74, 75)
-
-
76. A system comprising:
-
at least one computer being operable to store retail data from at least two data sources, store product identifier and attribute categorizations, and store a plurality of factor calculations;
wherein the at least two data sources includes a first data source that is more accurate than a second data source; and
wherein the at least one computer contains business logic that is operable to identify and retrieve a plurality of overlapping attribute segments to use for comparing the at least two data sources, compare selected overlapping attribute segments, calculate at least one factor for at least one of the overlapping attribute segments, and use the at least one factor to create modified values of a first group of values in the second data source, said modified values being more accurate than the first group of values. - View Dependent Claims (77, 78, 79)
-
-
80. An apparatus, comprising:
- a device encoded with logic executable by one or more processors to;
identify and retrieve a plurality of overlapping attribute segments to use for comparing at least two data sources, wherein the at least two data sources includes a first data source that is more accurate than a second data source, compare selected overlapping attribute segments, calculate at least one factor for at least one of the overlapping attribute segments, and use the at least one factor to create modified values of a first group of values in the second data source, said modified values being more accurate than the first group of values.
- a device encoded with logic executable by one or more processors to;
Specification