Method for updating data in a distributed data storage system
First Claim
1. A method for updating data stored in a data storage system, the data storage system including a plurality of storage nodes, the method comprising:
- determining status information regarding a first data item, wherein the status information comprises an indication of identities of one or more data storage nodes that store the first data item, the first data item comprises a collection object that references one or more other data items in the data storage system, and the one or more other data items comprise at least one other collection object;
determining an identifier for a second data item stored on a second storage node, wherein the identifier uniquely identifies the second data item within the data storage system;
sending an update message for the first data item to a first storage node based on the status information indicating that the first storage node is one of the one or more data storage nodes that store the first data item, wherein the update message indicates that the identifier for the second data item is to be included in a reference to the second data item within in the collection object of the first data item;
determining that at least one data storage node that stores a version of the first data item is storing a version of the first data item with different contents than are included in a version of the first data item stored at the first data storage node; and
determining which version of the first data item should be stored in the data storage system by selecting an instance of the first data item that is stored at a largest number of data storage nodes as the version of the first data item that is to be distributed in the data storage system.
3 Assignments
0 Petitions
Accused Products
Abstract
There is provided a method implemented in a data storage system including a plurality of storage nodes connected to each other via a communications network. At least one storage node of the plurality of storage nodes may store a first data item. The data item may be arranged to include a reference to a second data item stored in the data storage system. The method may include providing an identifier of the second data item. The method may also include providing status information about the first data item. The identifier may be used to update the first data item, for example based on the status information. The first data item may be stored at the at least one storage node.
-
Citations
18 Claims
-
1. A method for updating data stored in a data storage system, the data storage system including a plurality of storage nodes, the method comprising:
-
determining status information regarding a first data item, wherein the status information comprises an indication of identities of one or more data storage nodes that store the first data item, the first data item comprises a collection object that references one or more other data items in the data storage system, and the one or more other data items comprise at least one other collection object; determining an identifier for a second data item stored on a second storage node, wherein the identifier uniquely identifies the second data item within the data storage system; sending an update message for the first data item to a first storage node based on the status information indicating that the first storage node is one of the one or more data storage nodes that store the first data item, wherein the update message indicates that the identifier for the second data item is to be included in a reference to the second data item within in the collection object of the first data item; determining that at least one data storage node that stores a version of the first data item is storing a version of the first data item with different contents than are included in a version of the first data item stored at the first data storage node; and determining which version of the first data item should be stored in the data storage system by selecting an instance of the first data item that is stored at a largest number of data storage nodes as the version of the first data item that is to be distributed in the data storage system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A server comprising a processor and memory, the processor being configured to:
-
determine status information regarding a first data item, wherein the status information comprises an indication of identities of one or more data storage nodes that store the first data item, the first data item comprises a collection object that references one or more other data items in the data storage system, and the one or more other data items comprise at least one other collection object; determine an identifier for a second data item stored on a second storage node, wherein the identifier uniquely identifies the second data item within the data storage system; send an update message for the first data item to a first storage node based on the status information indicating that the first storage node is one of the one or more data storage nodes that store the first data item, wherein the update message indicates that the identifier for the second data item is to be included in a reference to the second data item within in the collection object of the first data item; determine that at least one data storage node that stores a version of the first data item is storing a version of the first data item with different contents than are included in a version of the first data item stored at the first data storage node; determine which version of the first data item should be stored in the data storage system by selecting an instance of the first data item that is stored at a largest number of data storage nodes as the version of the first data item that should be stored in the data storage system; and send a message indicating the version of the first data item to be stored to at least one storage node that stores a different version of the first data item. - View Dependent Claims (17)
-
-
18. A storage node comprising a processor and memory, the processor being configured to:
-
send status information to a server regarding a first data item, wherein the status information comprises an indication of identities of one or more data storage nodes that store the first data item, the status information indicates that the storage node is one of the one or more storage nodes that store the first data item, the first data item comprises a collection object that references one or more other data items in the data storage system, and the one or more other data items comprise at least one other collection object; receive an update message for the first data item from the server, wherein the update message indicates an identifier for a second data item stored on a second data storage node and the identifier uniquely identifies the second data item within the data storage system; update the first data item to include a reference to the second data item based on the received update message, the reference comprising the identifier; and send a local version of the data of the first data item to another data storage node that stores a different version of the first data item, wherein the local version of the first data item is sent to the another storage node based on the local version of the first data item being stored on a greater number of data storage nodes than store the different version of the first data item.
-
Specification