×

Data stream ingestion and persistence techniques

  • US 9,858,322 B2
  • Filed: 11/11/2013
  • Issued: 01/02/2018
  • Est. Priority Date: 11/11/2013
  • Status: Expired due to Fees
First Claim
Patent Images

1. A system, comprising:

  • one or more computing devices comprising one or more processors and memory and configured to;

    implement a first set of programmatic interfaces enabling a client of a multi-tenant stream management service to select, for a particular data stream, a data ingestion policy from among a plurality of data ingestion policies, wherein the plurality of data ingestion policies includes an at-least-once ingestion policy in accordance with which a record submitter transmits an indication of a data record one or more times to the stream management service until a positive acknowledgement is received;

    implement a second set of programmatic interfaces enabling the client to select, for the particular data stream, a data persistence policy from among a plurality of data persistence policies, wherein the plurality of data persistence policies comprises a multiple-replica persistence policy, in accordance with which multiple copies of the data record are to be stored at respective storage locations by the stream management service;

    receive, at the stream management service via respective programmatic interfaces of the first and second set, a first indication that the client has selected the at-least-once ingestion policy for the particular data stream and a second indication that the client has selected the multiple-replica persistence policy for the particular data stream;

    determine a number of data ingestion nodes or a number of data storage nodes to be configured for the particular data stream based at least in part on a partitioning policy in accordance with which a data ingestion node of the number of data ingestion nodes is selected to ingest data records of a particular partition of the particular data stream or a data storage node of the number of data storage nodes is selected to store data records of the particular partition of the particular data stream; and

    in response to a plurality of transmissions indicating a particular data record to the stream management service,send at least one positive acknowledgement corresponding to the plurality of transmissions in accordance with the at-least-once ingestion policy; and

    store, in response to a particular transmission of the plurality of transmissions, copies of the particular data record at a plurality of storage locations in accordance with the multiple-replica persistence policy.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×