Usage-based optimization of network traffic and data warehouse size
First Claim
1. A non-transitory computer-readable medium containing a program which, when executed by a processor, performs operations for maintaining a data warehouse having a plurality of fields for storing data from one or more data sources, comprising:
- monitoring queries issued against the data warehouse;
updating, based on the monitored queries, usage characteristics of one or more of the plurality of fields in the data warehouse indicative of when the one or more fields were last involved in a query;
issuing, by a management component, a fallout request to one or more of the data sources indicating fields determined not to be used according to the usage characteristics; and
updating, by the management component, the data warehouse with data, from one or more of the data sources, for a limited subset of fields involved in the monitored queries within a first predetermined time period, as indicated by the fields contained in the fallout request.
0 Assignments
0 Petitions
Accused Products
Abstract
The present invention generally provides systems, methods, and articles of manufacture for maintaining a data warehouse having a plurality of fields updated with data from one or more data sources. Rather than automatically update every field of data available in the warehouse, a limited subset of fields identified through their involvement in queries issued against the warehouse are updated. By limiting the fields that are updated, the network bandwidth required to transmit the updates to the data warehouse may be reduced. Further, by removing fields from the data warehouse that are not in use, the size of the data warehouse may be reduced.
-
Citations
7 Claims
-
1. A non-transitory computer-readable medium containing a program which, when executed by a processor, performs operations for maintaining a data warehouse having a plurality of fields for storing data from one or more data sources, comprising:
-
monitoring queries issued against the data warehouse; updating, based on the monitored queries, usage characteristics of one or more of the plurality of fields in the data warehouse indicative of when the one or more fields were last involved in a query; issuing, by a management component, a fallout request to one or more of the data sources indicating fields determined not to be used according to the usage characteristics; and updating, by the management component, the data warehouse with data, from one or more of the data sources, for a limited subset of fields involved in the monitored queries within a first predetermined time period, as indicated by the fields contained in the fallout request. - View Dependent Claims (2, 3)
-
-
4. A database system comprising:
-
a data warehouse comprising fields of data containing data originating from one or more data sources; an interface adapted to receive queries issued against the data warehouse; and a warehouse manager configured to; monitor the received queries; update, based on the monitored queries, usage characteristics of one or more of the plurality of fields in the data warehouse indicative of when the one or more fields were last involved in a query; issue a fallout request to one or more of the data sources indicating fields determined not to be used according to the usage characteristics; and update the data warehouse with data, from one or more of the data sources, for a limited subset of fields involved in the monitored queries within a first predetermined time period, as indicated by the fields contained in the fallout request. - View Dependent Claims (5, 6, 7)
-
Specification