Systems and computer implemented methods for semantic data compression
First Claim
1. A computer-implemented method for semantic data compression and transmission, comprising:
- receiving, at a first computer, a query from a second computer to transmit a plurality of artifacts to the second computer over a network, wherein the plurality of artifacts are from a corpus of documents;
queuing, at the first computer, the plurality of artifacts in response to the received query, wherein the plurality of artifacts are semantically related;
performing packet compression, at the first computer, on the plurality of queued artifacts, wherein packet compression comprises discarding artifacts determined to have been previously transmitted to the second computer;
performing semantic compression, at the first computer, on the plurality of queued artifacts based on one or more network decision variables, wherein the semantic compression comprises;
prioritizing, at the first computer, the plurality of queued artifacts, wherein prioritizing includes assigning a priority to each artifact based on network analytics and including the one or more network decision variables, wherein the network analytics are based on the entire corpus of documents,determining, at the first computer, for each of the plurality of prioritized queued artifacts, whether to send the artifact or only a link to the artifact based on the priority assigned to the artifact, based on a size of the artifact and available bandwidth, the determining resulting in a first set of artifacts in the plurality of queued artifacts to transmit and a second set of artifacts in the plurality of queued artifacts to only send links, andreplacing, at the first computer, unnecessary content in the first set of prioritized queued artifacts with one or more identifiers, wherein replacing is based on the frequency of the content across the entire corpus of documents, and wherein replacing reduces a size of the plurality of artifacts during transfer;
calculating, at the first computer, an optimum batch size of the compressed queued artifacts;
batching, at the first computer, the compressed queued artifacts into one or more batches based on the calculating; and
transmitting, by the first computer, the one or more batches over the network to the second computer.
2 Assignments
0 Petitions
Accused Products
Abstract
Computer implemented methods and systems directed to a technological improvement in electronic data compression and transmission between two computer systems using semantic analysis are disclosed. The method includes the step of compressing, at a first computer, a plurality of queued artifacts based on one or more network decision variables. The compression includes prioritizing the queued artifacts. The compression further includes determining a first set of artifacts in a set of queued artifacts to transmit and a second set of artifacts in a set of queued artifacts to only send links. The compression further includes replacing unnecessary content in the set of queued artifacts with one or more identifiers. The method further includes the step of transmitting, from the first computer, one or more batches of the compressed data over a network to a second computer.
54 Citations
18 Claims
-
1. A computer-implemented method for semantic data compression and transmission, comprising:
-
receiving, at a first computer, a query from a second computer to transmit a plurality of artifacts to the second computer over a network, wherein the plurality of artifacts are from a corpus of documents; queuing, at the first computer, the plurality of artifacts in response to the received query, wherein the plurality of artifacts are semantically related; performing packet compression, at the first computer, on the plurality of queued artifacts, wherein packet compression comprises discarding artifacts determined to have been previously transmitted to the second computer; performing semantic compression, at the first computer, on the plurality of queued artifacts based on one or more network decision variables, wherein the semantic compression comprises; prioritizing, at the first computer, the plurality of queued artifacts, wherein prioritizing includes assigning a priority to each artifact based on network analytics and including the one or more network decision variables, wherein the network analytics are based on the entire corpus of documents, determining, at the first computer, for each of the plurality of prioritized queued artifacts, whether to send the artifact or only a link to the artifact based on the priority assigned to the artifact, based on a size of the artifact and available bandwidth, the determining resulting in a first set of artifacts in the plurality of queued artifacts to transmit and a second set of artifacts in the plurality of queued artifacts to only send links, and replacing, at the first computer, unnecessary content in the first set of prioritized queued artifacts with one or more identifiers, wherein replacing is based on the frequency of the content across the entire corpus of documents, and wherein replacing reduces a size of the plurality of artifacts during transfer; calculating, at the first computer, an optimum batch size of the compressed queued artifacts; batching, at the first computer, the compressed queued artifacts into one or more batches based on the calculating; and transmitting, by the first computer, the one or more batches over the network to the second computer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A cloud transfer service system for semantic data compression and transmission, comprising:
-
a processor; a network interface coupled to the processor, wherein the network interface is communicatively coupled to a network; a data storage system; and a non-transitory memory coupled to the processor storing computer readable program instructions, wherein the computer readable program constructions configure the processor to perform the steps of; receiving a query from a second computer over the network to transmit a plurality of artifacts to the second computer over the network, wherein the plurality of artifacts are from a corpus of documents; queuing the plurality of artifacts in response to the received query, wherein the plurality of artifacts are semantically related; performing packet compression on the plurality of queued artifacts, wherein packet compression comprises discarding artifacts determined to have been previously transmitted to the second computer; performing semantic compression on the plurality of queued artifacts based on one or more network decision variables, wherein the semantic compression comprises; prioritizing the plurality of queued artifacts, wherein prioritizing includes assigning a priority to each artifact based on network analytics and including the one or more network decision variables, wherein the network analytics are based on the entire corpus of documents; determining, for each of the plurality of prioritized queued artifacts, whether to send the artifact or only a link to the artifact based on the priority assigned to the artifact, based on a size of the artifact and available bandwidth, the determining resulting in a first set of artifacts in the plurality of queued artifacts to transmit and a second set of artifacts in the plurality of queued artifacts to only send links, and replacing unnecessary content in the first set of prioritized queued artifacts with one or more identifiers, wherein replacing is based on the frequency of the content across the entire corpus of documents, and wherein replacing reduces a size of the plurality of artifacts during transfer; calculating an optimum batch size of the set of compressed queued artifacts; batching the compressed queued artifacts into one or more batches based on the calculating; and transmitting the one or more batches over the network to the second computer through the network interface. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification