OLAP Query Processing Method Oriented to Database and HADOOP Hybrid Platform
First Claim
1. An on-line analytical processing (OLAP) query processing method oriented to a database and Hadoop hybrid platform, wherein:
- on the basis of a multi-copy fault-tolerance mechanism of the Hadoop, a fact table is stored in a database cluster, a main working copy and at least one fault-tolerant copy of the fact table are set, the main working copy is imported into a local database of a working node, and a table corresponding to the main working copy is named according to a unified naming rule;
the main working copy in a Hadoop distributed file system is deleted, and meta-information of the main working copy in namenode is updated into a JDBC connection of the local database and a name of the table corresponding to the main working copy;
when OLAP query processing is performed, the processing is executed first on the main working copy, and a query processing result is recorded in an aggregate result table of the local database; and
when the working node is faulty, node information of the fault-tolerant copy corresponding to the main working copy is searched for through the namenode, and a MapReduce task is invoked to complete the OLAP query processing task on the fault-tolerant copy.
1 Assignment
0 Petitions
Accused Products
Abstract
An OLAP query processing method oriented to a database and Hadoop hybrid platform is described. When OLAP query processing is performed, the processing is executed first on a main working copy, and a query processing result is recorded in an aggregate result table of a local database; when a working node is faulty, node information of a fault-tolerant copy corresponding to the main working copy is searched for through namenode, and a MapReduce task is invoked to complete the OLAP query processing task on the fault-tolerant copy. The database technology and the Hadoop technology are combined, and the storage performance of the database and the high expandability and high availability of the Hadoop are combined; the database query processing and the MapReduce query processing are integrated in a loosely-coupled mode, thereby ensuring the high query processing performance, and ensuring the high fault-tolerance performance.
-
Citations
10 Claims
-
1. An on-line analytical processing (OLAP) query processing method oriented to a database and Hadoop hybrid platform, wherein:
-
on the basis of a multi-copy fault-tolerance mechanism of the Hadoop, a fact table is stored in a database cluster, a main working copy and at least one fault-tolerant copy of the fact table are set, the main working copy is imported into a local database of a working node, and a table corresponding to the main working copy is named according to a unified naming rule;
the main working copy in a Hadoop distributed file system is deleted, and meta-information of the main working copy in namenode is updated into a JDBC connection of the local database and a name of the table corresponding to the main working copy;when OLAP query processing is performed, the processing is executed first on the main working copy, and a query processing result is recorded in an aggregate result table of the local database; and when the working node is faulty, node information of the fault-tolerant copy corresponding to the main working copy is searched for through the namenode, and a MapReduce task is invoked to complete the OLAP query processing task on the fault-tolerant copy. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
Specification