Automated abstract database generation through existing application statement analysis
First Claim
1. A computer implemented method for automatically generating a data repository abstraction component describing, and used to access, data in a data repository, the method comprising:
- monitoring queries issued against the data repository by an application;
collecting, from the monitored queries, information regarding usage of fields of the data repository;
identifying, from the collected usage information, a limited subset of one or more fields of the data repository accessed by the application; and
generating, for use in the data repository abstraction component, abstract representations for the limited subset of fields;
wherein the abstract representations logically model the respective limited subset of fields;
whereby the data repository abstraction component is customized on the basis of usage of the fields of the data repository;
wherein the generated abstract representations model the respective fields of the limited subset of fields and expose the data to users in a manner that allows the users to compose queries on the basis of the generated abstract representations without knowledge of or regard for a particular schema of the data repository;
wherein each of the abstract representations comprises;
a logical field definition comprising a logical field name;
at least one location attribute identifying a location of data in the data repository corresponding to the logical field name; and
a reference to an access method invoked to access the data in response to receiving an abstract query containing the logical field name.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention generally is directed to methods, articles of manufacture and systems for automatically generating abstract representations of a limited subset of data repository fields accessed by a particular application. The limited subset of fields may be identified by monitoring queries issued against the data repository by the application. The abstract representations may be grouped in a data repository abstraction (DRA) component that serves as a data definition framework allowing data from the fields to be accessed independent of the particular manner in which the data is physically represented in the data repository. By limiting the fields exposed to the application to the limited subset of fields actually accessed by the application, the DRA component, in effect, provides the application with a custom database tailored to its particular needs.
-
Citations
31 Claims
-
1. A computer implemented method for automatically generating a data repository abstraction component describing, and used to access, data in a data repository, the method comprising:
-
monitoring queries issued against the data repository by an application; collecting, from the monitored queries, information regarding usage of fields of the data repository; identifying, from the collected usage information, a limited subset of one or more fields of the data repository accessed by the application; and generating, for use in the data repository abstraction component, abstract representations for the limited subset of fields;
wherein the abstract representations logically model the respective limited subset of fields;
whereby the data repository abstraction component is customized on the basis of usage of the fields of the data repository;
wherein the generated abstract representations model the respective fields of the limited subset of fields and expose the data to users in a manner that allows the users to compose queries on the basis of the generated abstract representations without knowledge of or regard for a particular schema of the data repository;
wherein each of the abstract representations comprises;a logical field definition comprising a logical field name; at least one location attribute identifying a location of data in the data repository corresponding to the logical field name; and a reference to an access method invoked to access the data in response to receiving an abstract query containing the logical field name. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer implemented method for automatically generating abstract representations of one or more fields of a data source, the method comprising:
-
monitoring queries issued against the data source; collecting, from the monitored queries, information regarding usage of the fields of the data source; and based on the collected usage information, generating the abstract representations for select fields of the data source, wherein the generated abstract representations model the respective select fields and expose the data to users in a manner that allows the users to compose queries on the basis of the generated abstract representations without knowledge of or regard fur a particular schema of the data source;
wherein each of the abstract representations comprises a logical field definition comprising a logical field name, at least one location attribute identifying a location of physical data corresponding to the logical field name, and a reference to an access method invoked to access the physical data from the data source in response to receiving an abstract query containing the logical field name. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A tangible computer readable storage medium containing a program which, when executed, performs operations for automatically generating a data repository abstraction component describing, and used to access, data in a data repository, the operations comprising:
-
monitoring queries issued against the data repository; collecting usage information from the monitored queries, the usage information pertaining to how the fields of the data source are being used; and generating, based on the collected usage information, a set of logical field specifications for a limited subset of one or more fields of the data repository accessed by the monitored queries for use in the data repository abstraction component;
wherein the generated logical field specifications model the respective select fields and expose the data to users in a manner that allows the users to compose abstract queries on the basis of the data repository abstraction component without knowledge of or regard for a particular schema of the data repository;
wherein each of the logical field specifications comprises a logical field definition comprising a logical field name, at least one location attribute identifying a location of physical data corresponding to the logical field name, and a reference to an access method invoked to access the physical data from the data source in response to receiving an abstract query containing the logical field name. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
-
25. A data processing system, comprising:
-
a processor; a data repository containing data; an application suitable for querying the data repository; and a data repository abstraction generator configured to generate a data repository abstraction component describing, and used to access, data in the data repository based on usage information collected by monitoring and parsing queries issued against the data repository by the application;
wherein the data repository abstraction generator is configured to generate a plurality of logical field specifications for a limited subset of fields of the data repository accessed by the application, as indicated by the monitored queries;
whereby the data repository abstraction is customized on the basis of how the application queries the data repository;
wherein the generated logical field specifications model the respective fields of the limited subset of fields and expose the physical data to users in a manner that allows the users to compose queries on the basis of the generated logical field specifications without knowledge of or regard for a particular schema of the data repository;
wherein each of the logical field specifications comprises;a logical field definition comprising a logical field name; at least one location attribute identifying a location of data in the data repository corresponding to the logical field name; and a reference to an access method invoked to access the data in response to receiving an abstract query containing the logical field name. - View Dependent Claims (26, 27, 28, 29, 30, 31)
-
Specification