System and method for planning and generating queries for multi-dimensional analysis using domain models and data federation
First Claim
Patent Images
1. A system for planning and generating database queries using a multitude of data sources in various formats and media, which enables multi-dimensional analyses without physically building a data warehouse, the system comprising:
- a memory;
a domain model which provides a formal description of a domain of interest, comprising classes which are concepts in a domain of discourse, properties of each class describing its attributes including its relations with other classes and constraints on properties in the classes;
a mapping server which generates a domain model specific to a set of data sources by specifying mapping between entities in said domain model and entities in the data sources;
an ontology query generator which translates one or more queries in a natural language format to generate one or more ontology queries which can be understood in terms of said domain model;
an ontology query server which infers one or more answers to ontology queries by reasoning with said domain model;
a SQL query plan generator which composes one or more query plans using a standard query language for relational databases for multi-dimensional analysis including one or more of data grouping, roll-up and drill-down, pivoting, and changing the focused class of the current multi-dimensional analysis;
a data federation system which builds one or more adaptors for individual data sources which make data sources appear as relational databases and allow them to be accessed by using a standard query language for relational databases and which also optimizes query plans for performance and processes queries to retrieve data from one or more data sources; and
a report generator which provides data retrieved from data sources for multi-dimensional analysis.
3 Assignments
0 Petitions
Accused Products
Abstract
Data integration and data analysis using computing equipment, software as well as hardware, includes a system and method for integrating data from various data sources, structured and unstructured, without physically creating a data warehouse and automatically generating queries for analysis of the integrated data from a multitude of different views.
-
Citations
12 Claims
-
1. A system for planning and generating database queries using a multitude of data sources in various formats and media, which enables multi-dimensional analyses without physically building a data warehouse, the system comprising:
-
a memory; a domain model which provides a formal description of a domain of interest, comprising classes which are concepts in a domain of discourse, properties of each class describing its attributes including its relations with other classes and constraints on properties in the classes; a mapping server which generates a domain model specific to a set of data sources by specifying mapping between entities in said domain model and entities in the data sources; an ontology query generator which translates one or more queries in a natural language format to generate one or more ontology queries which can be understood in terms of said domain model; an ontology query server which infers one or more answers to ontology queries by reasoning with said domain model; a SQL query plan generator which composes one or more query plans using a standard query language for relational databases for multi-dimensional analysis including one or more of data grouping, roll-up and drill-down, pivoting, and changing the focused class of the current multi-dimensional analysis; a data federation system which builds one or more adaptors for individual data sources which make data sources appear as relational databases and allow them to be accessed by using a standard query language for relational databases and which also optimizes query plans for performance and processes queries to retrieve data from one or more data sources; and a report generator which provides data retrieved from data sources for multi-dimensional analysis. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method for planning and generating database queries for multi-dimensional analysis using domain models and data federation comprising the steps of:
-
creating a domain model for a domain of discourse to support multi-dimensional analysis; mapping between entities in the domain model and entities in data sources; building one or more adaptors for individual data sources which make data sources appears as relational databases and allow the data sources to be accessed by using a standard query language of relational databases; composing one or more queries for multi-dimensional analysis; translating the one or more queries in a natural language and generating one or more ontology queries which can be understood in terms of the domain model; inferring one or more answers to the ontology queries by using a reasoning component and the domain model; composing one or more SQL query plans for multi-dimensional analysis including one or more data grouping, roll-up and drill-down, pivoting, and changing the focused class of the current multi-dimensional analysis; submitting the SQL query plans to a data federation system for processing and retrieving data from one or more data sources; and providing the retrieved data for multi-dimensional analysis. - View Dependent Claims (7, 8, 9)
-
-
10. A memory storing a program of machine-readable instructions executable by a digital processing apparatus to perform operations for planning and generating database queries for multi-dimensional analysis using domain models and data federation, the operations comprising:
-
creating a domain model for a domain of discourse to support multi-dimensional analysis; mapping between entities in the domain model and entities in data sources; building one or more adaptors for individual data sources which make data sources appears as relational databases and allow the data sources to be accessed by using a standard query language of relational databases; composing one or more queries for multi-dimensional analysis; translating the one or more queries in a natural language and generating one or more ontology queries which can be understood in terms of the domain model; inferring one or more answers to the ontology queries by using a reasoning component and the domain model; composing one or more SQL query plans for multi-dimensional analysis including one or more data grouping, roll-up and drill-down, pivoting, and changing the focused class of the current multi-dimensional analysis; submitting the SQL query plans to a data federation system for processing and retrieving data from one or more data sources; and providing the retrieved data for multi-dimensional analysis. - View Dependent Claims (11, 12)
-
Specification