Distributed processing system including a name node and a plurality of data nodes, and method of operating the same
First Claim
1. A method of operating a distributed processing system including a name node and a plurality of data nodes, comprising:
- transmitting an address request packet from the name node to each of the plurality of data nodes;
transmitting an address packet from each of the plurality of data nodes to the name node upon receiving the address request packet at each of the plurality of data nodes and upon each of the plurality of data nodes storing its own media access control (MAC) address in a payload of the address packet as a data node MAC address;
determining, by the name node, a node configuration of the distributed processing system based on the address packet received from each of the plurality of data nodes; and
processing data using a selected data node from among the plurality of data nodes based on the node configuration,wherein determining the node configuration comprises;
reading a source name node MAC address and a name node MAC address from a payload of an address return packet and a source internet protocol (IP) address from a header of the address return packet; and
determining whether the name node operates on a virtual machine by comparing the source name node MAC address and the name node MAC address.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of operating a distributed processing system including a name node and a plurality of data nodes includes transmitting an address request packet from the name node to each of the plurality of data nodes, transmitting an address packet from each of the plurality of data nodes to the name node upon receiving the address request packet at each of the plurality of data nodes and upon each of the plurality of data nodes storing its own media access control (MAC) address in a payload of the address packet as a data node MAC address, determining, by the name node, a node configuration of the distributed processing system based on the address packet received from each of the plurality of data nodes, and processing data using a selected data node from among the plurality of data nodes based on the node configuration.
53 Citations
19 Claims
-
1. A method of operating a distributed processing system including a name node and a plurality of data nodes, comprising:
-
transmitting an address request packet from the name node to each of the plurality of data nodes; transmitting an address packet from each of the plurality of data nodes to the name node upon receiving the address request packet at each of the plurality of data nodes and upon each of the plurality of data nodes storing its own media access control (MAC) address in a payload of the address packet as a data node MAC address; determining, by the name node, a node configuration of the distributed processing system based on the address packet received from each of the plurality of data nodes; and processing data using a selected data node from among the plurality of data nodes based on the node configuration, wherein determining the node configuration comprises; reading a source name node MAC address and a name node MAC address from a payload of an address return packet and a source internet protocol (IP) address from a header of the address return packet; and determining whether the name node operates on a virtual machine by comparing the source name node MAC address and the name node MAC address. - View Dependent Claims (2, 3, 4, 9, 10, 11, 12, 13, 14)
-
-
5. A method of operating a distributed processing system including a name node and a plurality of data nodes, comprising:
-
transmitting an address request packet from the name node to each of the plurality of data nodes; transmitting an address packet from each of the plurality of data nodes to the name node upon receiving the address request packet at each of the plurality of data nodes and upon each of the plurality of data nodes storing its own media access control (MAC) address in a payload of the address packet as a data node MAC address; determining, by the name node, a node configuration of the distributed processing system based on the address packet received from each of the plurality of data nodes; and processing data using a selected data node from among the plurality of data nodes based on the node configuration, wherein determining the node configuration comprises; reading a source MAC address and a source internet protocol (IP) address from a header of the address packet, and the data node MAC address from the payload of the address packet; and determining whether a data node from among the plurality of data nodes corresponding to the source IP address operates on a virtual machine by comparing the source MAC address and the data node MAC address. - View Dependent Claims (6, 7, 8)
-
-
15. A method of operating a distributed processing system including a name node and a plurality of data nodes, comprising:
-
transmitting a multiple address return request packet from the name node to each of the plurality of data nodes upon the name node storing its own media access control (MAC) address in a payload of the multiple address return request packet as a name node MAC address; transmitting a multiple address return packet from each of the plurality of data nodes to the name node upon receiving the multiple address return request packet at each of the plurality of data nodes, upon each of the plurality of data nodes storing a source MAC address in a payload of the multiple address return packet as a source name node MAC address, upon each of the plurality of data nodes storing the name node MAC address in the payload of the multiple address return packet, and upon each of the plurality of data nodes storing its own MAC address in the payload of the multiple address return packet as a data node MAC address, wherein the source MAC address is included in a header of the multiple address return request packet and the name node MAC address is included in the payload of the multiple address return request packet; determining, by the name node, a node configuration of the distributed processing system based on the multiple address return packet received from each of the plurality of data nodes; and processing data using a selected data node selected from among the plurality of data nodes based on the node configuration. - View Dependent Claims (16, 17, 18, 19)
-
Specification