Determining data location in a distributed data store
First Claim
Patent Images
1. A method comprising:
- identifying, by a processing device of server computing device, a first possible location of a data item in a distributed database, wherein the first possible location of the data item is on a second server in the distributed database;
generating, by the processing device, a first probabilistic data structure that comprises information to identify the first possible location of the data item in the distributed database;
providing the first probabilistic data structure to a client computing device before the client computing device sends a request for the data item, wherein the client computing device is not involved with an operation to generate the first probabilistic data structure;
receiving, by the processing device, a query for the data item in response to the client computing device failing to locate the data item in view of the first probabilistic data structure;
determining a second possible location of the data item by querying a name server comprising an index to identify locations of data items stored in the distributed database;
generating a second probabilistic data structure to identify the second possible location of the data item; and
providing the second probabilistic data structure to the client computing device.
1 Assignment
0 Petitions
Accused Products
Abstract
A computing device identifies a location of a data item in a distributed data store. The computing device generates at least one probabilistic data structure that contains information to identify a probable location of the data item. The computing device transmits the at least one probabilistic data structure to one or more client computing devices, wherein the at least one probabilistic data structure is usable by the one or more client computing devices to determine the probable location of the data item.
-
Citations
18 Claims
-
1. A method comprising:
-
identifying, by a processing device of server computing device, a first possible location of a data item in a distributed database, wherein the first possible location of the data item is on a second server in the distributed database; generating, by the processing device, a first probabilistic data structure that comprises information to identify the first possible location of the data item in the distributed database; providing the first probabilistic data structure to a client computing device before the client computing device sends a request for the data item, wherein the client computing device is not involved with an operation to generate the first probabilistic data structure; receiving, by the processing device, a query for the data item in response to the client computing device failing to locate the data item in view of the first probabilistic data structure; determining a second possible location of the data item by querying a name server comprising an index to identify locations of data items stored in the distributed database; generating a second probabilistic data structure to identify the second possible location of the data item; and providing the second probabilistic data structure to the client computing device. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
receiving, by a processing device of a client computing device, a probabilistic data structure to probabilistically identify a first possible location of a data item in a distributed database, wherein the client computing device is not involved with an operation to generate the probabilistic data structure; receiving, by the processing device, a query for the data item; using, by the processing device, the probabilistic data structure to identify the first possible location of the data item in the distributed database, wherein the first possible location of the data item is on a second server in the distributed database; attempting to access, in view of the probabilistic data structure, the data item from the first possible location at the second server without querying a name server; receiving a response, via the second server, that the data item is not present at the second server; querying the name server for an exact location of the data item, wherein the name server comprises an index to identify locations of data items stored in the distributed database; receiving a response that comprises the exact location of the data item; and accessing the data item at the exact location. - View Dependent Claims (7, 8, 9)
-
-
10. A non-transitory computer readable storage medium having instructions that, when executed by a processing device, cause the processing device to perform operations comprising:
-
identify, by the processing device, a first possible location of a data item in a distributed database; generate, by the processing device, a first probabilistic data structure that comprises information to identify the first possible location of the data item in the distributed database, wherein the first possible location of the data item is on a second server in the distributed database; provide the first probabilistic data structure to a client computing device, wherein the client computing device is not involved with an operation to generate the probabilistic data structure; receive, by the processing device, a query for the data item in response to the client computing device failing to locate the data item in view of the first probabilistic data structure; determine a second possible location of the data item by querying a name server comprising an index to identify locations of data items stored in the distributed database; generate a second probabilistic data structure to identify the second possible location of the data item; and provide the second probabilistic data structure to the client computing device. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A non-transitory computer readable storage medium having instructions that, when executed by a processing device of a client device, cause the processing device to perform operations comprising:
-
receive, by the processing device, a probabilistic data structure to probabilistically identify a plurality of possible locations of a data item in a distributed database, wherein the client computing device is not involved with an operation to generate the probabilistic data structure; receive a query for the data item; use, by the processing device, the probabilistic data structure to identify a first possible location of the data item in the distributed database, wherein the first possible location of the data item is on a second server in the distributed database; access, in view of the probabilistic data structure, the second server for the data item without querying a name server; receive a response, via the second server, that the data item is not present at the second server; query, by the processing device, the name server for an exact location of the data item, wherein the name server comprises an index to identify locations of data items stored in the distributed database and the exact location is on a third server in the distributed database; and accessing the data item at the third server. - View Dependent Claims (16)
-
-
17. A system comprising:
a server computing device comprising a memory and a processing device coupled to the memory, wherein the processing device is to; identify a first possible location of a data item in a distributed database; generate a probabilistic data structure that comprises information to identify the first possible location of the data item in the distributed database, wherein the first possible location of the data item is on a second server in the distributed database; and provide the probabilistic data structure to a client computing before the client computing device sends a request for the data item, wherein the client computing device is not involved with an operation to generate the probabilistic data structure receive a query for the data item in response to the client computing device failing to locate the data item in view of the probabilistic data structure; determine an exact location of the data item by querying a name server comprising an index to identify locations of data items stored in the distributed database; and provide the exact location of the data item to the client computing device. - View Dependent Claims (18)
Specification