Computer system and process for transferring multiple high bandwidth streams of data between multiple storage units and multiple applications in a scalable and reliable manner
First Claim
1. A file system for allowing one or more client systems to access data, comprising:
- storage for storing the data in files; and
a processor connected to the storage;
wherein the processor receives requests for data to be read from a file on the storage, wherein the requests include an estimate of time by which the request should be serviced by the storage; and
wherein the processor, in response to a request received, retrieves the requested data from the storage;
wherein the processor returns the retrieved data with a priority according to the estimate of time included in the request.
8 Assignments
0 Petitions
Accused Products
Abstract
Multiple applications request data from multiple storage units over a computer network. The data is divided into segments and each segment is distributed randomly on one of several storage units, independent of the storage units on which other segments of the media data are stored. At least one additional copy of each segment also is distributed randomly over the storage units, such that each segment is stored on at least two storage units. This random distribution of multiple copies of segments of data improves both scalability and reliability. When an application requests a selected segment of data, the request is processed by the storage unit with the shortest queue of requests. Random fluctuations in the load applied by multiple applications on multiple storage units are balanced nearly equally over all of the storage units. This combination of techniques results in a system which can transfer multiple, independent high-bandwidth streams of data in a scalable manner in both directions between multiple applications and multiple storage units.
-
Citations
21 Claims
-
1. A file system for allowing one or more client systems to access data, comprising:
-
storage for storing the data in files; and a processor connected to the storage; wherein the processor receives requests for data to be read from a file on the storage, wherein the requests include an estimate of time by which the request should be serviced by the storage; and wherein the processor, in response to a request received, retrieves the requested data from the storage; wherein the processor returns the retrieved data with a priority according to the estimate of time included in the request. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A data storage system for allowing one or more client systems to access data over a computer network, comprising:
-
a plurality of storage units for storing the data and interconnected by the computer network, wherein the data is stored on the plurality of storage units in files; wherein each client system comprises; a network interface connected to the computer network for receiving and sending data over the computer network; and a processor connected to the network interface; wherein the processor instructs the network interface to send a request for data to be read from a file to one or more of the plurality of storage units, wherein the request includes an estimate of time by which the request should be serviced by the selected storage unit; and wherein each storage unit comprises; storage for storing the data; a network interface connected to the computer network for receiving and sending data over the computer network; and a processor connected to the network interface and the storage; wherein the processor, in response to a request received over the computer network from one of the client systems for data from a file, retrieves the requested data from the storage; wherein the processor instructs the network interface to send the retrieved data to the client system; and wherein the processor prioritizes sending of data according to the estimate of time included in the request. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A distributed data storage system for allowing one or more client systems to access data over a computer network, comprising:
-
a plurality of independent storage units for storing the data and interconnected by the computer network; wherein the data is stored on the plurality of storage units in files, wherein each file includes segments of data, wherein each segment has an identifier, and wherein the segments of data are distributed among the plurality of storage units; computer readable storage including data that associates, for each segment of a file, the identifier of the segment with an indication of the storage unit on which the segment is stored; wherein each client system comprises; a network interface connected to the computer network for receiving and sending data over the computer network; and a processor connected to the network interface; wherein the processor accesses the data that associates, for each segment of a file, the identifier of the segment with an indication of the storage unit on which the segment is stored, to select a storage unit for each segment to be read from the file; wherein the processor instructs the network interface to send a request, for each segment to be read from the file, to the selected storage unit for the segment, wherein the request includes the identifier of the requested segment of the file and an estimate of time by which the request should be serviced by the selected storage unit; and wherein each storage unit comprises; storage for storing the data; a memory containing data defining information that associates, for each segment stored on the storage unit, the identifier of the segment with the location of the segment in the storage; a network interface connected to the computer network for receiving and sending data over the computer network; and a processor connected to the network interface, the memory and the storage; wherein the processor, in response to a request received over the computer network from one of the client systems for a segment of a file, determines the location of the segment in the storage using the information that associates the identifier of the segment with the location of the segment in the storage, and retrieves the requested segment from the storage; wherein the processor instructs the network interface to send the retrieved segment to the client system; and wherein the processor prioritizes sending of segments according to the estimate of time included in the request. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification