System and Method for Transaction Log Cleansing and Aggregation
First Claim
1. A computer-implemented method of preparing data for modeling, comprising:
- storing data from customer sales transactions in a database;
retrieving a dataset of the data from the database;
cleansing the dataset to remove erroneous and anomalous entries;
aggregating the dataset over a plurality of dimensions of the transactional space including store, product, and time dimensions; and
analyzing the dataset following the cleansing and aggregating steps within a model to predict attributes of subsequent sales transactions.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method prepares data for modeling. The method comprises storing data from customer sales transactions in a database and retrieving a dataset of the data from the database. The dataset may include promotion and merchandizing entries. The method includes cleansing the dataset to remove erroneous and anomalous entries. Cleansing the dataset may include determining a threshold value from the dataset and determining whether a value of the dataset exceeds the threshold value, and determining an out-of-stock status for a product from the dataset. The method includes aggregating the dataset over a plurality of dimensions of the transactional space including store, product, and time dimensions, and analyzing the dataset following the cleansing and aggregating steps within a model to predict attributes of subsequent sales transactions.
-
Citations
25 Claims
-
1. A computer-implemented method of preparing data for modeling, comprising:
-
storing data from customer sales transactions in a database; retrieving a dataset of the data from the database; cleansing the dataset to remove erroneous and anomalous entries; aggregating the dataset over a plurality of dimensions of the transactional space including store, product, and time dimensions; and analyzing the dataset following the cleansing and aggregating steps within a model to predict attributes of subsequent sales transactions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of preparing data for modeling, comprising:
-
retrieving a dataset from a database, wherein the dataset includes data from customer sales transactions; cleansing the dataset to remove erroneous and anomalous entries; aggregating the dataset over a plurality of dimensions of the transactional space including store, product, and time dimensions; and transmitting the dataset to a modeling system, wherein the modeling system predicts attributes of subsequent sales transactions. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A computer program product usable with a programmable computer processor having a computer readable program code embodied therein, comprising:
-
computer readable program code which stores data from customer sales transactions in a database; computer readable program code which retrieves a dataset of the data from the database; computer readable program code which cleanses the dataset to remove erroneous and anomalous entries; computer readable program code which aggregates the dataset over a plurality of dimensions of the transactional space including store, product, and time dimensions; and computer readable program code which analyzes the dataset within a model to predict attributes of subsequent sales transactions. - View Dependent Claims (19, 20, 21)
-
-
22. A computer system for manipulating data, comprising:
-
means for retrieving a dataset from a database, wherein the dataset includes data from customer sales transactions; means for cleansing the dataset to remove erroneous and anomalous entries; means for aggregating the dataset over a plurality of dimensions of the transactional space including store, product, and time dimensions; and means for transmitting the dataset to a modeling system, wherein the modeling system predicts attributes of subsequent sales transactions. - View Dependent Claims (23, 24, 25)
-
Specification