Data processing method and computer system
First Claim
1. A data processing method of a computer system, the computer system including a data entry server, a plurality of nodes connected to the data entry server via a first network, and a storage device connected to the data entry server and the plurality of nodes via a second network, the storage device including a logical volume having stored therein at least data as data sources that can be read by each of the plurality of nodes, the method comprising:
- causing the data entry server to estimate a processing time required to execute each of first and second data entry methods, and to select one of the first and second data entry methods based on the estimated processing time; and
causing the data entry server or the plurality of nodes to execute data entry processing based on the selected data entry method,wherein the first data entry method is a method in which the data entry server reads data from the logical volume, divides the data into a plurality of blocks, and sends the data blocks to the plurality of nodes with no overlaps,wherein the second data entry method is a method in which the plurality of nodes, in response to an instruction from the data entry server, reads the data from the logical volume with no overlaps by dividing the data into a plurality of blocks, andwherein the storage device storing the logical volume is separated from the plurality of nodes so that the logical volume does not constitute a distributed file system which is composed of the plurality of nodes.
1 Assignment
0 Petitions
Accused Products
Abstract
A technique for increasing the speed of data entry into a distributed processing platform is provided. According to a computer system of the present invention, when data is entered into each node in a distributed manner, the most efficient entry method (a method with the highest processing speed) is selected from among a plurality of entry methods, so that the data is entered into each node with no overlaps in accordance with the selected method.
14 Citations
15 Claims
-
1. A data processing method of a computer system, the computer system including a data entry server, a plurality of nodes connected to the data entry server via a first network, and a storage device connected to the data entry server and the plurality of nodes via a second network, the storage device including a logical volume having stored therein at least data as data sources that can be read by each of the plurality of nodes, the method comprising:
-
causing the data entry server to estimate a processing time required to execute each of first and second data entry methods, and to select one of the first and second data entry methods based on the estimated processing time; and causing the data entry server or the plurality of nodes to execute data entry processing based on the selected data entry method, wherein the first data entry method is a method in which the data entry server reads data from the logical volume, divides the data into a plurality of blocks, and sends the data blocks to the plurality of nodes with no overlaps, wherein the second data entry method is a method in which the plurality of nodes, in response to an instruction from the data entry server, reads the data from the logical volume with no overlaps by dividing the data into a plurality of blocks, and wherein the storage device storing the logical volume is separated from the plurality of nodes so that the logical volume does not constitute a distributed file system which is composed of the plurality of nodes. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer system comprising:
-
a data entry server; a plurality of nodes connected to the data entry server via a first network; and a storage device connected to the data entry server and the plurality of nodes via a second network, the storage device including a logical volume having stored therein at least data as data sources that can be read by each of the plurality of nodes, and the storage device being a non-transitory, tangible computer-readable storage medium, wherein the storage device storing the logical volume is separated from the plurality of nodes so that the logical volume does not constitute a distributed file system which is composed of the plurality of nodes, wherein the data entry server estimates a processing time required to execute each of first and second data entry methods, and selects one of the first and second data entry methods based on the estimated processing time, wherein the data entry server or the plurality of nodes executes data entry processing based on the selected data entry method, wherein the first data entry method is a method in which the data entry server reads data from the logical volume, divides the data into a plurality of blocks, and sends the data blocks to the plurality of nodes with no overlaps, and wherein the second data entry method is a method in which the plurality of nodes, in response to an instruction from the data entry server, reads the data from the logical volume with no overlaps by dividing the data into a plurality of blocks. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
Specification