USAGE-BASED OPTIMIZATION OF NETWORK TRAFFIC AND DATA WAREHOUSE SIZE
First Claim
1. A method for maintaining a data warehouse having a plurality of fields for storing data from one or more data sources, comprising:
- monitoring queries issued against the data warehouse;
updating, based on the monitored queries, usage characteristics of one or more of the plurality of fields in the data warehouse indicative of when the one or more fields were last involved in a query;
issuing, by a management component, a fallout request to one or more of the data sources indicating fields determined not to be used according to the usage characteristics; and
updating, by the management component, the data warehouse with data, from one or more of the data sources, for a limited subset of fields involved in the monitored queries within a first predetermined time period, as indicated by the fields contained in the fallout request.
0 Assignments
0 Petitions
Accused Products
Abstract
The present invention generally provides systems, methods, and articles of manufacture for maintaining a data warehouse having a plurality of fields updated with data from one or more data sources. Rather than automatically update every field of data available in the warehouse, a limited subset of fields identified through their involvement in queries issued against the warehouse are updated. By limiting the fields that are updated, the network bandwidth required to transmit the updates to the data warehouse may be reduced. Further, by removing fields from the data warehouse that are not in use, the size of the data warehouse may be reduced.
20 Citations
2 Claims
-
1. A method for maintaining a data warehouse having a plurality of fields for storing data from one or more data sources, comprising:
-
monitoring queries issued against the data warehouse; updating, based on the monitored queries, usage characteristics of one or more of the plurality of fields in the data warehouse indicative of when the one or more fields were last involved in a query; issuing, by a management component, a fallout request to one or more of the data sources indicating fields determined not to be used according to the usage characteristics; and updating, by the management component, the data warehouse with data, from one or more of the data sources, for a limited subset of fields involved in the monitored queries within a first predetermined time period, as indicated by the fields contained in the fallout request.
-
-
2. A method for maintaining a data warehouse having a plurality of fields for storing data from one or more data sources, comprising:
configuring a management component to manage updating the data warehouse on the basis of usage characteristics indicating which of the plurality of fields are in use, wherein the management component is configured to perform an operation, comprising; monitoring queries issued against the data warehouse; updating, based on the monitored queries, the usage characteristics of one or more of the plurality of fields indicative of when the one or more fields were last involved in a query; receiving updates from the one or more data sources for the plurality of fields; identifying, from the received updates, a limited subset of fields involved in the monitored queries as distinguished from fields not involved in the monitored queries;
wherein the identifying is done on the basis of the usage characteristics; andselectively updating the data warehouse with data, from one or more of the data sources, for the identified limited subset of fields involved in the monitored queries.
Specification