Caching Systems And Methods
First Claim
Patent Images
1. A method comprising:
- identifying a plurality of files used to process a query;
distributing each of the plurality of files to a particular execution node, wherein each execution node is configured to execute the query with the files distributed to that execution node;
determining, by each execution node, whether the distributed file is stored in the execution node'"'"'s cache;
responsive to determining that the distributed file is stored in the execution node'"'"'s cache, processing, using one or more processors, the query using the file stored in the cache; and
responsive to determining that the distributed file is not stored in the execution node'"'"'s cache;
retrieving the file from a remote storage device;
storing the file in the execution node'"'"'s cache; and
processing, using the one or more processors, the query using the file.
2 Assignments
0 Petitions
Accused Products
Abstract
Example caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node'"'"'s cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node'"'"'s cache, and processes the query using the file.
16 Citations
20 Claims
-
1. A method comprising:
-
identifying a plurality of files used to process a query; distributing each of the plurality of files to a particular execution node, wherein each execution node is configured to execute the query with the files distributed to that execution node; determining, by each execution node, whether the distributed file is stored in the execution node'"'"'s cache; responsive to determining that the distributed file is stored in the execution node'"'"'s cache, processing, using one or more processors, the query using the file stored in the cache; and responsive to determining that the distributed file is not stored in the execution node'"'"'s cache; retrieving the file from a remote storage device; storing the file in the execution node'"'"'s cache; and processing, using the one or more processors, the query using the file. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. An apparatus comprising:
-
a resource manager configured to identify a plurality of files used to process a query; and an execution platform coupled to the resource manager, the execution platform including a plurality of execution nodes configured to process the query using the plurality of files, each execution node including a cache configured to store data retrieved from a plurality of remote storage devices, the execution platform further configured to distribute the plurality of files to at least one of the plurality of execution nodes based on metadata associated with the plurality of execution nodes, each execution node further including a processor configured to process the query using at least one of the plurality of files; wherein the execution platform is further configured to determine whether the files are stored in an execution node'"'"'s cache, responsive to determining that a particular file is not stored in the execution node'"'"'s cache, the execution platform is configured to instruct the execution node to retrieve the file from the remote storage device, cache the retrieved file in the execution node'"'"'s cache, and process the query using the retrieved file. - View Dependent Claims (15, 16, 17, 18)
-
-
19. An apparatus comprising:
-
a resource manager configured to identify a plurality of files used to process a query; and means for executing data processing tasks, the means for executing including a plurality of execution nodes configured to process the query using the plurality of files, each execution node including a cache configured to store data retrieved from a plurality of remote storage devices, the means for executing distributing the plurality of files to at least one of the plurality of execution nodes based on metadata associated with the plurality of execution nodes, each execution node further including a processor configured to process the query using at least one of the plurality of files; the means for executing further determining whether the files are stored in an execution node'"'"'s cache, responsive to determining that a particular file is not stored in the execution node'"'"'s cache, the means for executing instructs the execution node to retrieve the file from the remote storage device, cache the retrieved file in the execution node'"'"'s cache, and process the query using the retrieved file. - View Dependent Claims (20)
-
Specification