Query-level access to external petabyte-scale distributed file systems
First Claim
1. A computer-implemented method for creating query-level access by a database engine to an external distributed file system, the method comprising:
- identifying a location of external data residing on the external distributed file system;
creating a query specifying an external table and metadata within the database engine, the external table associated with one or more location files that specify code to execute operational directives against the external data in the external distributed file system, the operational directives producing one or more results at the external distributed system; and
defining the metadata comprising;
generating the external table via an external table declarative statement, andspecifying a location of the one or more location files for the external table, wherein the code specified by the one or more location files, when executed, streams data from the one or more results to a user application without copying the data to table files on the database engine.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system to creating query-level access to an external distributed file system by identifying a location of one or more external data residing on the external distributed file system, creating a query specifying an external table within a database engine having one or more location files, wherein the location files identify metadata operations for accessing and processing the one or more external data, defining metadata operations for accessing and processing the one or more external data, wherein the processing that produces one or more result files occurs at the external distributed file system, and executing the query at the database engine to create the external table, the external table comprising the one or more location files identifying the metadata directives for processing query-level requests on the one or more external data stored on the external distributed file system.
-
Citations
21 Claims
-
1. A computer-implemented method for creating query-level access by a database engine to an external distributed file system, the method comprising:
-
identifying a location of external data residing on the external distributed file system; creating a query specifying an external table and metadata within the database engine, the external table associated with one or more location files that specify code to execute operational directives against the external data in the external distributed file system, the operational directives producing one or more results at the external distributed system; and defining the metadata comprising; generating the external table via an external table declarative statement, and specifying a location of the one or more location files for the external table, wherein the code specified by the one or more location files, when executed, streams data from the one or more results to a user application without copying the data to table files on the database engine. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer system for creating query-level access by a database engine to an external distributed file system, the system comprising:
-
A computer processor to execute a set of program code instructions; and a memory to hold the program code instructions, in which the program code instructions comprises program code to perform; identifying a location of external data residing on the external distributed file system; creating a query specifying an external table and metadata within the database engine, the external table associated with one or more location files that specify code to execute operational directives against the external data in the external distributed file system, the operational directives producing one or more results at the external distributed system; and defining the metadata comprising; generating the external table via an external table declarative statement, and specifying a location of the one or more location files for the external table, wherein the code specified by the one or more location files, when executed, stream data from the one or more results to a user application without copying the data to table files on the database engine. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product embodied in a non-transitory computer readable medium, the computer readable medium having stored thereon a sequence of instructions which, when executed by a processor causes the processor to execute a process to create a query-level access by a database engine to an external distributed file system, the process comprising:
-
identifying a location of external data residing on the external distributed file system; creating a query specifying an external table and metadata within the database engine, the external table associated with one or more location files that specify code to execute operational directives against the external data in the external distributed file system, the operational directives producing one or more results at the external distributed system; and defining the metadata comprising; generating the external table via an external table declarative statement, and specifying a location of the one or more location files for the external table, wherein the code specified by the one or more location files, when executed, stream data from the one or more results to a user application without copying the data to table files on the database engine. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification