Method and system for creating and loading data warehouse from semi-structured document
First Claim
Patent Images
1. A computer-implemented method for creating a data warehouse from at least one semi-structured file, comprising:
- analyzing said at least one file to generate at least one generic schema;
creating at least one database table using the schema;
loading data in said at least one file into corresponding columns in the table; and
defining dimensions for a data warehouse using columns of the table.
2 Assignments
0 Petitions
Accused Products
Abstract
A data warehouse is created using an input file that can contain sub-documents of different formats. A root document model including path names to all nodes among the sub-documents is generated, and a table is generated with columns being derived from the path names of the root model. The sub-documents are shredded to populate the table. Then, the dimensions of the data warehouse are defined by selecting respective columns. A routine such as a DDL may then be generated to populate the data warehouse with data.
22 Citations
36 Claims
-
1. A computer-implemented method for creating a data warehouse from at least one semi-structured file, comprising:
-
analyzing said at least one file to generate at least one generic schema;
creating at least one database table using the schema;
loading data in said at least one file into corresponding columns in the table; and
defining dimensions for a data warehouse using columns of the table. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A service for creating a data warehouse for a customer, comprising:
-
generating a root model of at least one sub-document in at least one input file;
using the root model to generate at least one table;
shredding sub-documents to populate the table;
using columns in the table to define dimensions of the data warehouse; and
providing at least one of;
the data warehouse, and a tool to populate the data warehouse with data from the input file, to a customer. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer executing logic comprising:
-
generating a root model of plural documents, the root model including at least path names to all nodes among the documents;
generating a table having one of columns, or rows, derived from the path names of the root model; and
defining plural dimensions of a data warehouse by selecting one of;
respective columns, and respective rows, of the table. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A computer program product having a program executable by a digital processing apparatus to create a data warehouse for storing data in an input file, comprising:
-
means for generating a root document model containing path names to all nodes in the input file; and
means for using the path names to establish dimensions in the data warehouse. - View Dependent Claims (28, 29, 30, 31, 32, 33, 34, 35, 36)
-
Specification