Method and distributed database file system for implementing self-describing distributed file objects
First Claim
1. A method for implementing self-describing file objects comprising the steps of:
- creating a node group to define multiple computer systems for storing the file object;
identifying a hash algorithm for applying to data records;
identifying a partition distribution map for distributing data to each of said multiple computer systems utilizing a set of predetermined hash algorithm results;
identifying remote system information for each of said multiple computer systems; and
creating a file object in each of said multiple computer systems;
each said file object including said hash algorithm, said partition distribution map, and said remote system information.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus are provided for implementing self-describing file objects. A node group is created for defining multiple computer systems for storing data. A hash algorithm for applying to data records is identified. A partition distribution map for distributing data to each of the multiple computer systems utilizing a set of predetermined hash algorithm results and remote system information for each of the multiple computer systems are identified. A file object is created in each of the multiple computer systems. Each the file objects includes the hash algorithm, the partition distribution map, and the remote system information. A data record is inserted into one of the distributed file objects by receiving the data record, applying the hash algorithm to the received data record, comparing the hash algorithm result with the partition distribution map to identify the particular computer system for the data record, utilizing the system information to establish connection to that system. The file objects are fully self-describing, eliminating the need for additional objects to be addressed, opened, paged into memory or the like.
-
Citations
16 Claims
-
1. A method for implementing self-describing file objects comprising the steps of:
-
creating a node group to define multiple computer systems for storing the file object; identifying a hash algorithm for applying to data records; identifying a partition distribution map for distributing data to each of said multiple computer systems utilizing a set of predetermined hash algorithm results; identifying remote system information for each of said multiple computer systems; and creating a file object in each of said multiple computer systems;
each said file object including said hash algorithm, said partition distribution map, and said remote system information. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. Apparatus for implementing self-describing file objects comprising:
-
means for creating a node group to define multiple computer systems for storing the file object; identifying a hash algorithm for applying to data records; means for identifying a partition distribution map for distributing data to each of said multiple computer systems utilizing a set of predetermined hash algorithm results; means for identifying remote system information for each of said multiple computer systems; and means for creating a file object in each of said multiple computer systems;
each said file object including said hash algorithm, said partition distribution map, and said remote system information. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
14. A computer program product for use in a multiple computer system, each computer system having a processor, a memory, and a network adapter, the computer program product comprising:
-
a record medium; means, recorded on said recording medium, for creating a node group to define multiple computer systems for storing the file object; means, recorded on said recording medium, for identifying a hash algorithm for applying to data records; means, recorded on said recording medium, for identifying a partition distribution map for distributing data to each of said multiple computer systems utilizing a set of predetermined hash algorithm results; means, recorded on said recording medium, for identifying remote system information for each of said multiple computer systems; and means, recorded on said recording medium, for creating a file object in each of said multiple computer systems;
each said file object including said hash algorithm, said partition distribution map, and said remote system information. - View Dependent Claims (15, 16)
-
Specification