Resource allocation in distributed processing systems
First Claim
1. A distributed processing system, the system comprising:
- a source device configured to provide pieces of data for evaluation, wherein each of the pieces of data is associated with one or several user authors, wherein the pieces of data together comprise a processing task;
a plurality of independent processing units configured to receive a portion of the processing task, wherein the portion of the processing task comprises one or several of the pieces of data, and wherein the independent processing units are configured to characterize one or several aspects of the one or several of the pieces of data; and
a server communicatively connected to the source device and the plurality of independent processing units via a network, wherein the server is configured to;
receive a signal encoding the processing task;
identify a subset comprising some of the pieces of data in the processing task;
identify a plurality of features in each of the pieces of data;
generate an attribute vector for each of the pieces of data in the processing task, and wherein each of the plurality of attribute vectors comprises a dimension relating to the plurality of features of the corresponding piece of data for which the corresponding attribute vector is generated;
select pairs of attribute vectors from the plurality of attribute vectors;
determine a distance between ends of each of the pairs of attribute vectors;
identify a pair of the pairs of attribute vectors having ends separated by a greatest distance;
add the pieces of data associated with each of the attribute vectors having ends separated by a greatest distance to the subset of pieces of data; and
provide the subset of pieces of data to the plurality of independent processing units.
1 Assignment
0 Petitions
Accused Products
Abstract
A distributed processing system is disclosed herein. The distributed processing system includes a server, a database server, and an application server that are interconnected via a network, and connected via the network to a plurality of independent processing units. The independent processing units can include an analysis engine that is machine-learning-capable, and thus uniquely completes its processing tasks. The server can provide one or several pieces of data to one or several of the independent processing units, can receive analysis results from these one or several independent processing units, and can update the result based on a value characterizing the machine learning of the independent processing unit.
-
Citations
18 Claims
-
1. A distributed processing system, the system comprising:
-
a source device configured to provide pieces of data for evaluation, wherein each of the pieces of data is associated with one or several user authors, wherein the pieces of data together comprise a processing task; a plurality of independent processing units configured to receive a portion of the processing task, wherein the portion of the processing task comprises one or several of the pieces of data, and wherein the independent processing units are configured to characterize one or several aspects of the one or several of the pieces of data; and a server communicatively connected to the source device and the plurality of independent processing units via a network, wherein the server is configured to; receive a signal encoding the processing task; identify a subset comprising some of the pieces of data in the processing task; identify a plurality of features in each of the pieces of data; generate an attribute vector for each of the pieces of data in the processing task, and wherein each of the plurality of attribute vectors comprises a dimension relating to the plurality of features of the corresponding piece of data for which the corresponding attribute vector is generated; select pairs of attribute vectors from the plurality of attribute vectors; determine a distance between ends of each of the pairs of attribute vectors; identify a pair of the pairs of attribute vectors having ends separated by a greatest distance; add the pieces of data associated with each of the attribute vectors having ends separated by a greatest distance to the subset of pieces of data; and provide the subset of pieces of data to the plurality of independent processing units. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for automatically providing a final subset of pieces of data to an independent processor, the method comprising:
-
receiving a signal encoding a processing task comprising a plurality of pieces of dada; identifying a subset comprising some of the pieces of data in the processing task; identifying a plurality of features in each of the pieces of data; generating an attribute vector for each of the subset of the pieces of data in the processing task, wherein each of the plurality of attribute vectors comprises a dimension relating to the plurality of features of the corresponding piece of data for which the corresponding attribute vector is generated; selecting pairs of attribute vectors from the plurality of attribute vectors; determining a distance between ends of the attribute vectors in each of the pairs of selected attribute vectors; identifying at least one pair of the pairs of attribute vectors having ends separated by a greatest distance; generating a subset of pieces of data including a pair of attribute vectors having ends separated by the greatest distance; and providing the subset of pieces of data to a plurality of independent processing units. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification