×

Analyzing found data in a distributed storage and task network

  • US 9,304,858 B2
  • Filed: 12/06/2012
  • Issued: 04/05/2016
  • Est. Priority Date: 12/12/2011
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for execution by a computing device in a distributed storage and computing network that stores a large volume of data objects as pluralities of sets of encoded data slices in a plurality of distributed storage and task execution (DST) units, the method comprises:

  • establishing data identifying criteria for searching for like data objects of the large volume of data objects;

    establishing data analyzing criteria for analyzing found data objects;

    establishing distributed computing criteria based on the data identifying criteria, the data analyzing criteria, and a slice grouping storage type indication, wherein, for at least one of the found data objects, a DST unit of a set of DST units of the plurality of DST units stores a contiguous data chunk or error code data partition, and wherein the slice grouping storage type indication indicates processing of a contiguous data chunk or processing of the error coded data partition;

    distributing the data identifying criteria and the data analyzing criteria to a decode threshold number of DST units in accordance with the distributed computing criteria, wherein the decode threshold number corresponds to a minimum number of encoded data slices of a set of encoded data slices of the pluralities of sets of encoded data slices that is needed to recover a data segment of a corresponding one of the data objects, and wherein the decode threshold number of DST units is less than a number of the DST units storing the pluralities of sets of encoded data slices;

    receiving a set of network data partial resultants from the decode threshold number of DST units, wherein the decode threshold number of DST units generates the set of network data partial resultants based on searching at least some of the large volume of data objects in accordance with the data identifying criteria, the data analyzing criteria, and the distributed computing criteria; and

    processing the set of network data partial resultants to produce a network data resultant regarding the data on the network.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×