Apparatus and Method for Securing Preliminary Information About Database Fragments for Utilization in Mapreduce Processing
First Claim
1. A method of processing a query in a distributed database implemented across a plurality of nodes, comprising:
- receiving a query;
dividing the query into split characterization queries;
distributing the split characterization queries to a plurality of worker nodes, wherein each worker node stores a partition of the distributed database as a database with encoded textual objects and pre-defined indices characterizing encoded textual object fragments; and
executing the split characterization queries at the plurality of worker nodes to obtain preliminary information about query results, wherein executing the split characterization queries includes matching query fragments associated with the split characterization queries with encoded textual object fragments to produce fragment matches representative of the size of the query results, wherein for each split characterization query the preliminary information about query results includes a fragment count, a database partition identification, and a database host name.
4 Assignments
0 Petitions
Accused Products
Abstract
A method of processing a query in a distributed database implemented across a set of nodes includes receiving a query. The query is divided into split characterization queries. The split characterization queries are distributed to worker nodes. Each worker node stores a partition of the distributed database with encoded textual objects and pre-defined indices characterizing encoded textual object fragments. The split characterization queries are executed at the worker nodes to obtain preliminary information about query results. Executing the split characterization queries includes matching query fragments associated with the split characterization queries with encoded textual object fragments of the pre-defined indices to produce fragment matches representative of the size of the query results. For each split characterization query the preliminary information about query results includes a fragment count, a database partition identification, and a database host name.
-
Citations
16 Claims
-
1. A method of processing a query in a distributed database implemented across a plurality of nodes, comprising:
-
receiving a query; dividing the query into split characterization queries; distributing the split characterization queries to a plurality of worker nodes, wherein each worker node stores a partition of the distributed database as a database with encoded textual objects and pre-defined indices characterizing encoded textual object fragments; and executing the split characterization queries at the plurality of worker nodes to obtain preliminary information about query results, wherein executing the split characterization queries includes matching query fragments associated with the split characterization queries with encoded textual object fragments to produce fragment matches representative of the size of the query results, wherein for each split characterization query the preliminary information about query results includes a fragment count, a database partition identification, and a database host name. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer readable storage medium, comprising executable instructions to:
-
execute a split characterization query against a partition of a distributed database with encoded textual objects and pre-defined indices characterizing encoded textual object fragments, wherein the executable instructions to execute include executable instructions to match query fragments associated with the split characterization query with encoded textual object fragments of the pre-defined indices to produce fragment matches representative of the size of query results; and generate preliminary information about the query results, wherein the preliminary information includes a fragment count, a database partition identification, and a database host name. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification