Batching tuples
First Claim
Patent Images
1. A method for batching tuples comprising:
- determining a plurality of key-attributes for each of a plurality of tuples, wherein each key-attribute of each tuple describes a type or classification of data in the tuple;
creating, by a processor of a publisher operator, a batch tuple in a buffer containing a plurality of lists based on the plurality of key-attributes of the plurality of tuples, including;
creating a series of intermediate batch tuples, wherein each of the intermediate batch tuples is created for one of the tuples and contains the plurality of lists, andadding the plurality of key-attributes of each of the tuples into the plurality of lists of one of the intermediate batch tuples in a sequential order such that a last one of the series of intermediate batch tuples contains the plurality of key-attributes of all of the tuples in the plurality of lists;
calculating, by the processor, a hash value for the batch tuple in the buffer based on the plurality of lists;
when the buffer containing the batch tuple is full, determining, by the processor, a subscriber operator instance having a same hash value as the batch tuple; and
sending, by the processor, the batch tuple in the buffer to the subscriber operator instance that has the same hash value as the batch tuple.
2 Assignments
0 Petitions
Accused Products
Abstract
A technique of batching tuples can include determining a plurality of key-attributes for a plurality of tuples, creating a batch tuple, and calculating a hash value for the batch tuple.
14 Citations
14 Claims
-
1. A method for batching tuples comprising:
-
determining a plurality of key-attributes for each of a plurality of tuples, wherein each key-attribute of each tuple describes a type or classification of data in the tuple; creating, by a processor of a publisher operator, a batch tuple in a buffer containing a plurality of lists based on the plurality of key-attributes of the plurality of tuples, including; creating a series of intermediate batch tuples, wherein each of the intermediate batch tuples is created for one of the tuples and contains the plurality of lists, and adding the plurality of key-attributes of each of the tuples into the plurality of lists of one of the intermediate batch tuples in a sequential order such that a last one of the series of intermediate batch tuples contains the plurality of key-attributes of all of the tuples in the plurality of lists; calculating, by the processor, a hash value for the batch tuple in the buffer based on the plurality of lists; when the buffer containing the batch tuple is full, determining, by the processor, a subscriber operator instance having a same hash value as the batch tuple; and sending, by the processor, the batch tuple in the buffer to the subscriber operator instance that has the same hash value as the batch tuple. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A non-transitory computer-readable medium storing a set of instructions executable by a processing resource to cause a computer to:
-
determine a plurality of key-attributes for each of a plurality of tuples, wherein each key-attribute of each tuple describes a type or classification of data in the tuple; create a particular batch tuple in a buffer containing a plurality of lists, by; creating a series of intermediate batch tuples, wherein each of the intermediate batch tuples contains the plurality of lists, and adding the plurality of key-attributes of each of the tuples into the plurality of lists of one of the intermediate batch tuples in a sequential order such that a last one of the series of intermediate batch tuples contains the plurality of key-attributes of all of the tuples in the plurality of lists; calculate a hash value for the batch tuple based on the hash value for the plurality of key-attributes; and when buffer containing the particular batch tuple is full, determining a corresponding subscriber operator instance having a same hash value as the particular batch tuple and send the particular batch tuple to the corresponding subscriber operator instance. - View Dependent Claims (7, 8, 9, 12, 13)
-
-
10. A system for batching tuples in a distributed streaming system, the system comprising:
-
a processing resource; a memory resource storing instructions that when executed cause the processing resource to; determine a plurality of key-attributes for each of a plurality of tuples, wherein each key-attribute of each tuple describes a type or classification of data in the tuple; create a particular batch tuple in a buffer containing a plurality of lists, by; creating a series of intermediate batch tuples, wherein each of the intermediate batch tuples contains the plurality of lists, and adding the plurality of key-attributes of each of the tuples into the plurality of lists of one of the intermediate batch tuples in a sequential order such that a last one of the series of intermediate batch tuples contains the plurality of key-attributes of all of the tuples in the plurality of lists; calculate a hash value for the particular batch tuple based on the plurality of key-attributes; and when the buffer containing the particular batch tuple is full, determine a corresponding subscriber operator instance having a same hash value as the particular batch tuple and send the particular batch tuple to the corresponding subscriber operator instance. - View Dependent Claims (11, 14)
-
Specification