Resource allocation in distributed processing systems
First Claim
1. A distributed processing system configured to improve processing speeds, the system comprising:
- a source device configured to provide groups of data, wherein each of the groups of data is associated with one or several user authors, wherein the groups of data together comprise a processing task;
a plurality of independent processing units configured to receive a portion of the processing task, wherein the portion of the processing task comprises one or several of the groups of data, and wherein the independent processing units are configured to characterize one or several aspects of the one or several of the groups of data; and
a server communicatively connected to the source device and the plurality of independent processing units via a network, wherein the server is configured to;
receive the processing task;
identify a plurality of features in some of the groups of data;
generate a preliminary subset from the groups of data, by selecting an attribute identified in at least one of the plurality of features;
calculate a subset measure for the preliminary subset, wherein the subset measure indicates a degree to which the subset is representative of the processing task;
optimize the subset measure by replacing some of the groups of data of the preliminary subset with at least one replacement group of data that increases the subset measure by increasing the efficiency of use of the preliminary subset using at least one contribution factor measuring a contribution of a piece of data within the at least one replacement group; and
provide a final subset, including the at least one replacement group of data that replaces the preliminary subset with the replacement group, thereby increasing the subset measure, to the plurality of independent processing units.
1 Assignment
0 Petitions
Accused Products
Abstract
A distributed processing system is disclosed herein. The distributed processing system includes a server, a database server, and an application server that are interconnected via a network, and connected via the network to a plurality of independent processing units. The independent processing units can include an analysis engine that is machine-learning-capable, and thus uniquely completes its processing tasks. The server can provide one or several pieces of data to one or several of the independent processing units, can receive analysis results from these one or several independent processing units, and can update the result based on a value characterizing the machine learning of the independent processing unit.
10 Citations
20 Claims
-
1. A distributed processing system configured to improve processing speeds, the system comprising:
-
a source device configured to provide groups of data, wherein each of the groups of data is associated with one or several user authors, wherein the groups of data together comprise a processing task; a plurality of independent processing units configured to receive a portion of the processing task, wherein the portion of the processing task comprises one or several of the groups of data, and wherein the independent processing units are configured to characterize one or several aspects of the one or several of the groups of data; and a server communicatively connected to the source device and the plurality of independent processing units via a network, wherein the server is configured to; receive the processing task; identify a plurality of features in some of the groups of data; generate a preliminary subset from the groups of data, by selecting an attribute identified in at least one of the plurality of features; calculate a subset measure for the preliminary subset, wherein the subset measure indicates a degree to which the subset is representative of the processing task; optimize the subset measure by replacing some of the groups of data of the preliminary subset with at least one replacement group of data that increases the subset measure by increasing the efficiency of use of the preliminary subset using at least one contribution factor measuring a contribution of a piece of data within the at least one replacement group; and provide a final subset, including the at least one replacement group of data that replaces the preliminary subset with the replacement group, thereby increasing the subset measure, to the plurality of independent processing units. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for distributed processing, the method comprising:
-
receiving at a server a processing task, wherein the processing task comprises a plurality of groups of data; identifying with the server a plurality of features in some of the groups of data; generating a preliminary subset from the groups of data, by selecting an attribute identified in at least one of the plurality of features; calculating a subset measure for the preliminary subset, wherein the subset measure indicates a degree to which the subset is representative of the processing task; optimizing the subset measure by replacing some of the groups of data of the subset with at least one replacement group of data that increases the subset measure by increasing the efficiency of use of the preliminary subset using at least one contribution factor measuring a contribution of a piece of data within the at least one replacement group; and providing a final subset, including the at least one replacement group of data that replaces the preliminary subset with the replacement group, thereby increasing the subset measure, to a plurality of independent processing units. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification