System and method for planning and generating queries for multi-dimensional analysis using domain models and data federation
First Claim
Patent Images
1. A system for planning and generating database queries using a multitude of data sources in various formats and media, which enables multi-dimensional analyses without physically building a data warehouse, the system comprising:
- a domain model which provides a formal description of a domain of interest, comprising classes which are concepts in a domain of discourse, properties of each class describing its attributes including its relations with other classes and constraints on properties in the classes;
a mapping server which generates a domain model specific to a set of data sources by specifying mapping between entities in said domain model and entities in the data sources;
an ontology query generator which translates one or more queries in a natural language format to generate one or more ontology queries which can be understood in terms of said domain model;
an ontology query server which infers one or more answers to ontology queries by reasoning with said domain model, where said ontology query server comprises one or more ontology query languages and programming interfaces;
one or more ontology import and loading components;
one or more ontology storage components; and
one or more reasoning components which apply one or more reasoning rules to one or more facts to infer one or more implied facts and find one or more answers to ontology queries;
a SQL query plan generator which composes one or more query plans using a standard query language for relational databases for multi-dimensional analysis including one or more of data grouping, roll-up and drill-down, pivoting, and changing the focused class of the current multi-dimensional analysis;
a data federation system which builds one or more adaptors for individual data sources which make data sources appear as relational databases and allow them to be accessed by using a standard query language for relational databases and which also optimizes query plans for performance and processes queries to retrieve data from one or more data sources; and
a report generator which provides data retrieved from data sources for multi-dimensional analysis.
4 Assignments
0 Petitions
Accused Products
Abstract
Data integration and data analysis using computing equipment, software as well as hardware, includes a system and method for integrating data from various data sources, structured and unstructured, without physically creating a data warehouse and automatically generating queries for analysis of the integrated data from a multitude of different views.
38 Citations
12 Claims
-
1. A system for planning and generating database queries using a multitude of data sources in various formats and media, which enables multi-dimensional analyses without physically building a data warehouse, the system comprising:
-
a domain model which provides a formal description of a domain of interest, comprising classes which are concepts in a domain of discourse, properties of each class describing its attributes including its relations with other classes and constraints on properties in the classes; a mapping server which generates a domain model specific to a set of data sources by specifying mapping between entities in said domain model and entities in the data sources; an ontology query generator which translates one or more queries in a natural language format to generate one or more ontology queries which can be understood in terms of said domain model; an ontology query server which infers one or more answers to ontology queries by reasoning with said domain model, where said ontology query server comprises one or more ontology query languages and programming interfaces;
one or more ontology import and loading components;
one or more ontology storage components; and
one or more reasoning components which apply one or more reasoning rules to one or more facts to infer one or more implied facts and find one or more answers to ontology queries;a SQL query plan generator which composes one or more query plans using a standard query language for relational databases for multi-dimensional analysis including one or more of data grouping, roll-up and drill-down, pivoting, and changing the focused class of the current multi-dimensional analysis; a data federation system which builds one or more adaptors for individual data sources which make data sources appear as relational databases and allow them to be accessed by using a standard query language for relational databases and which also optimizes query plans for performance and processes queries to retrieve data from one or more data sources; and a report generator which provides data retrieved from data sources for multi-dimensional analysis. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for planning and generating database queries for multi-dimensional analysis using domain models and data federation comprising the steps of:
-
creating a domain model for a domain of discourse to support multi-dimensional analysis; mapping between entities in the domain model and entities in data sources; building one or more adaptors for individual data sources which make data sources appears as relational databases and allow the data sources to be accessed by using a standard query language of relational databases; composing one or more queries for multi-dimensional analysis; translating the one or more queries in a natural language and generating one or more ontology queries which can be understood in terms of the domain model; inferring one or more answers to the ontology queries by using a reasoning component and the domain model, where inferring comprises use of an ontology query server that provides one or more ontology query languages and programming interfaces;
one or more ontology import and loading components;
one or more ontology storage components; and
one or more reasoning components which apply one or more reasoning rules to one or more facts to infer one or more implied facts and find one or more answers to ontology queries;composing one or more SQL query plans for multi-dimensional analysis including one or more data grouping, roll-up and drill-down, pivoting, and changing the focused class of the current multi-dimensional analysis; submitting the SQL query plans to a data federation system for processing and retrieving data from one or more data sources; and providing the retrieved data for multi-dimensional analysis. - View Dependent Claims (7, 8, 9)
-
-
10. A computer readable medium storing a program of machine-readable instructions executable by a digital processing apparatus to perform operations for planning and generating database queries for multi-dimensional analysis using domain models and data federation, the operations comprising:
-
creating a domain model for a domain of discourse to support multi-dimensional analysis; mapping between entities in the domain model and entities in data sources; building one or more adaptors for individual data sources which make data sources appears as relational databases and allow the data sources to be accessed by using a standard query language of relational databases; composing one or more queries for multi-dimensional analysis; translating the one or more queries in a natural language and generating one or more ontology queries which can be understood in terms of the domain model; inferring one or more answers to the ontology queries by using a reasoning component and the domain model, where inferring comprises use of an ontology query server that provides one or more ontology query languages and programming interfaces;
one or more ontology import and loading components;
one or more ontology storage components; and
one or more reasoning components which apply one or more reasoning rules to one or more facts to infer one or more implied facts and find one or more answers to ontology queries;composing one or more SQL query plans for multi-dimensional analysis including one or more data grouping, roll-up and drill-down, pivoting, and changing the focused class of the current multi-dimensional analysis; submitting the SQL query plans to a data federation system for processing and retrieving data from one or more data sources; and providing the retrieved data for multi-dimensional analysis. - View Dependent Claims (11, 12)
-
Specification