External data access with split index
First Claim
Patent Images
1. A method, comprising:
- creating, by a processor, an index locally in a relational data storage system for data stored external to the relational data storage system in a non-relational data storage system;
receiving a query, by the relational data storage system, for at least a portion of the data in the non-relational data storage system;
determining that the index is current with respect to an indexed portion of the data stored external to the relational data storage system, and stale with respect to an updated portion of the data stored external to the relational data storage system;
identifying, as a first selected data subset, data of the indexed portion that satisfies the query based on the index;
scanning the updated portion of the data stored external to the relational data storage system to identify a second selected data subset matching the query; and
returning the first selected data subset and the second selected data subset in response to the query.
2 Assignments
0 Petitions
Accused Products
Abstract
A split-index can be employed for access to external data. The index can be created on a primary data storage system for data stored externally on a secondary data storage system. After creation, the index can be utilized to expedite at least query execution over the externally stored data. The index can be updated upon detection of changes to data. Further, even when the index is not completely up to date, the index can be exploited for query execution. Furthermore, hybrid execution is enabled with the index and without the index.
-
Citations
19 Claims
-
1. A method, comprising:
-
creating, by a processor, an index locally in a relational data storage system for data stored external to the relational data storage system in a non-relational data storage system; receiving a query, by the relational data storage system, for at least a portion of the data in the non-relational data storage system; determining that the index is current with respect to an indexed portion of the data stored external to the relational data storage system, and stale with respect to an updated portion of the data stored external to the relational data storage system; identifying, as a first selected data subset, data of the indexed portion that satisfies the query based on the index; scanning the updated portion of the data stored external to the relational data storage system to identify a second selected data subset matching the query; and returning the first selected data subset and the second selected data subset in response to the query. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system, comprising:
-
a processor; and a memory storing instructions, wherein execution of the instructions by the processor causes a device to; create an index locally in a relational data storage system for data stored external to the relational data storage system in a non-relational data storage system; receive a query, by the relational data storage system, for at least a portion of the data in the non-relational data storage system; determine that the index is current with respect to an indexed portion of the data stored external to the relational data storage system, and stale with respect to an updated portion of the data stored external to the relational data storage system; identify, as a first selected data subset, data of the indexed portion that satisfies the query based on the index; scanning the updated portion of the data stored external to the relational data storage system to identify a second selected data subset matching the query; and return the first selected data subset and the second selected data subset in response to the query. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A computer-readable storage medium having instructions stored thereon, wherein execution of the instructions by a processor of a device causes the device to:
-
create an index locally in a relational data warehouse system for data stored external to the relational data warehouse system in a non-relational distributed file system; receive a query, by the relational data warehouse system, for at least a portion of the data stored in the non-relational distributed file system; determine that the index is current with respect to an indexed portion of the data stored external to the relational data warehouse, and stale with respect to an updated portion of the data stored external to the relational data warehouse; identify, as a first selected data subset, data of the indexed portion that satisfies the query based on the index; scan the updated portion of the data stored external to the relational data storage system to identify a second selected data subset matching the query; return the first selected data subset and the second selected data subset in response to the query. - View Dependent Claims (17, 18, 19)
-
Specification