Batch compression management of messages
First Claim
1. A computer-implemented method of batch compression management of messages, comprising:
- intercepting, by one or more processors, in middleware between a first distributed system and a second distributed system, a new message originating from the first distributed system being sent to a batch for transmission to the second distributed system;
determining, by the one or more processors, if a message schema of the new message is of a predefined similarity to message schemas of messages previously assigned to multiple batch pools comprising sub-sets of the batch, wherein the similarity in message schemas comprises a similarity between a majority of elements in a payload of the new message to a majority of elements in the payloads of the messages previously assigned to the multiple batch pools comprising the sub-sets of the batch, and wherein the determining comprises;
obtaining, by the one or more processors, from a batch pool mapping database, the message schemas of the messages previously assigned to the multiple batch pools comprising the sub-sets of the batch;
comparing, by the one or more processors, the message schemas of the previously assigned messages the to the message schema of the new message; and
determining, by the one or more processors, if a given batch pool of the multiple batch pools comprises one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message;
based on determining that the given batch pool of the multiple batch pools comprises one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message, assigning, by the one or more processors, the message from the batch to the given batch pool;
based on determining that the given batch pool of the multiple batch pools does not comprise one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message, creating, by the one or more processors, a new batch pool for the new message and assigning the new message to the new batch pool;
performing, by the one or more processors, a customized compression on the given batch pool using a dictionary of properties common to the one or more message schemas of the one or more messages, wherein the customized compression is an optimized compression that minimizes a number of entries utilized in the dictionary and reduces overhead in the one or more processors, based on the messages in the given batch pool comprising similar message schemas; and
transmitting, by the one or more processors, the compressed batch pool over an input/output channel to the second distributed system.
1 Assignment
0 Petitions
Accused Products
Abstract
Batch compression management of messages. A message being sent to a batch for transmission is intercepted. The message is assigned to a batch pool, in which multiple batch pools are provided as sub-sets of the batch. The assigning of the message to the batch pool is based on a similarity of properties of the message with other messages and determined by looking up the properties in a batch pool mapping database. The message is added to the batch pool or a new batch pool is created for the message. A batch pool of messages is compressed using a dictionary of the properties common to at least some of the messages in the batch pool. The compressed batch pools are transmitted over an input/output channel.
-
Citations
16 Claims
-
1. A computer-implemented method of batch compression management of messages, comprising:
-
intercepting, by one or more processors, in middleware between a first distributed system and a second distributed system, a new message originating from the first distributed system being sent to a batch for transmission to the second distributed system; determining, by the one or more processors, if a message schema of the new message is of a predefined similarity to message schemas of messages previously assigned to multiple batch pools comprising sub-sets of the batch, wherein the similarity in message schemas comprises a similarity between a majority of elements in a payload of the new message to a majority of elements in the payloads of the messages previously assigned to the multiple batch pools comprising the sub-sets of the batch, and wherein the determining comprises; obtaining, by the one or more processors, from a batch pool mapping database, the message schemas of the messages previously assigned to the multiple batch pools comprising the sub-sets of the batch; comparing, by the one or more processors, the message schemas of the previously assigned messages the to the message schema of the new message; and determining, by the one or more processors, if a given batch pool of the multiple batch pools comprises one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message; based on determining that the given batch pool of the multiple batch pools comprises one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message, assigning, by the one or more processors, the message from the batch to the given batch pool; based on determining that the given batch pool of the multiple batch pools does not comprise one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message, creating, by the one or more processors, a new batch pool for the new message and assigning the new message to the new batch pool; performing, by the one or more processors, a customized compression on the given batch pool using a dictionary of properties common to the one or more message schemas of the one or more messages, wherein the customized compression is an optimized compression that minimizes a number of entries utilized in the dictionary and reduces overhead in the one or more processors, based on the messages in the given batch pool comprising similar message schemas; and transmitting, by the one or more processors, the compressed batch pool over an input/output channel to the second distributed system. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer program product for batch compression management of messages, comprising:
-
a computer readable storage medium readable by a processing circuit and storing instructions for execution by the processing circuit for performing a method comprising; intercepting, by one or more processors, in middleware between a first distributed system and a second distributed system, a new message originating from the first distributed system being sent to a batch for transmission to the second distributed system; determining, by the one or more processors, if a message schema of the new message is of a predefined similarity to message schemas of messages previously assigned to multiple batch pools comprising sub-sets of the batch, wherein the determining comprises; obtaining, by the one or more processors, from a batch pool mapping database, the message schemas of the messages previously assigned to the multiple batch pools comprising the sub-sets of the batch; comparing, by the one or more processors, the message schemas of the previously assigned messages the to the message schema of the new message; and determining, by the one or more processors, if a given batch pool of the multiple batch pools comprises one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message; based on determining that the given batch pool of the multiple batch pools comprises one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message, assigning, by the one or more processors, the message from the batch to the given batch pool; based on determining that the given batch pool of the multiple batch pools does not comprise one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message, creating, by the one or more processors, a new batch pool for the new message and assigning the new message to the new batch pool; performing, by the one or more processors, a customized compression on the given batch pool using a dictionary of properties common to the one or more message schemas of the one or more messages at, wherein the customized compression is an optimized compression that minimizes a number of entries utilized in the dictionary and reduces overhead in the one or more processors, based on the messages in the given batch pool comprising similar message schemas; and
transmitting, by the one or more processors, the compressed batch pool over an input/output channel to the second distributed system. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A computer system for batch compression management of messages, comprising:
-
a memory; and a processing device in communication with the memory, wherein the computer system is configured to perform a method, said method comprising; intercepting, by the processing device, in middleware between a first distributed system and a second distributed system, a new message originating from the first distributed system being sent to a batch for transmission to the second distributed system; determining, by the processing device, if a message schema of the new message is of a predefined similarity to message schemas of messages previously assigned to multiple batch pools comprising sub-sets of the batch, wherein the determining comprises; obtaining, by the processing device, from a batch pool mapping database, the message schemas of the messages previously assigned to the multiple batch pools comprising the sub-sets of the batch; comparing, by the processing device, the message schemas of the previously assigned messages the to the message schema of the new message; and determining, by the processing device, if a given batch pool of the multiple batch pools comprises one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message; based on determining that the given batch pool of the multiple batch pools comprises one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message, assigning, by the processing device, the message from the batch to the given batch pool; based on determining that the given batch pool of the multiple batch pools does not comprise one or more messages with one or more message schemas of the predefined similarity to the message schema of the new message, creating, by the processing device, a new batch pool for the new message and assigning the new message to the new batch pool; performing, by the processing device, a customized compression on the given a batch pool using a dictionary of properties common to the one or more message schemas of the one or more messages, wherein the customized compression is an optimized compression that minimizes a number of entries utilized in the dictionary and reduces overhead in the one or more processors, based on the at messages in the given batch pool comprising similar message schemas; and transmitting, by the processing device, the compressed batch pool over an input/output channel to the second distributed system. - View Dependent Claims (13, 14, 15, 16)
-
Specification