×

Optimizing data processing across server clusters and data centers using checkpoint-based data replication

  • US 10,296,425 B2
  • Filed: 04/20/2017
  • Issued: 05/21/2019
  • Est. Priority Date: 04/20/2017
  • Status: Active Grant
First Claim
Patent Images

1. A computing platform, comprising:

  • at least one processor;

    a communication interface communicatively coupled to the at least one processor; and

    memory storing computer-readable instructions that, when executed by the at least one processor, cause the computing platform to;

    determine to initiate a data processing job associated with identifying one or more features of a source dataset, the data processing job comprising multiple processing steps;

    based on determining to initiate the data processing job, generate one or more first commands directing one or more first cluster server nodes associated with a first data center to execute the multiple processing steps associated with the data processing job to identify the one or more features of the source dataset, the one or more first commands further directing the one or more first cluster server nodes associated with the first data center to update a checkpoint table as each processing step of the multiple processing steps associated with the data processing job is completed, and the one or more first commands further directing the one or more first cluster server nodes associated with the first data center to replicate processing results data to at least one other data center different from the first data center as each processing step of the multiple processing steps associated with the data processing job is completed; and

    send, via the communication interface, to the one or more first cluster server nodes associated with the first data center, the one or more first commands.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×