×

Filtering pipeline optimizations for unstructured data

  • US 10,650,032 B1
  • Filed: 04/13/2018
  • Issued: 05/12/2020
  • Est. Priority Date: 04/13/2018
  • Status: Active Grant
First Claim
Patent Images

1. A system, comprising:

  • one or more computing devices of an object storage service;

    wherein the one or more computing devices include instructions that upon execution on a processor cause the one or more computing devices to;

    store, in response to one or more programmatic requests, a plurality of unstructured data items of an item collection, including a first unstructured data item;

    determine a first query comprising one or more predicates to be used to filter data for inclusion in a response to a first access request directed to the item collection;

    based at least in part on an examination of the one or more predicates, identify a first character-level test that can be used to determine, without completing parsing of a particular record identified within the first unstructured data item, whether the particular record satisfies the first query;

    abandon parsing of a first record of the first unstructured data item in response to determining, using the first character-level test, that the first record does not satisfy the first query;

    determine, using at least a subset of records identified in the first unstructured data item, a response to the first access request, wherein the subset does not include the first record.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×