Preventing staleness in query results when using asynchronously updated indexes

US 10,606,839 B2
Filed: 05/23/2017
Issued: 03/31/2020
Est. Priority Date: 10/27/2015
Status: Active Grant

First Claim

Patent Images

1. A computer implemented method, the method comprising:

receiving, by one or more processors, an asynchronously updated index corresponding to a main dataset in a database system;

receiving, by the one or more processors, time-sequenced log data of modifications made to the main dataset after a cutoff time of a last asynchronous index update, wherein the time-sequenced log data is read once by the database system for joining the main dataset with the time-sequenced log data and filtering out updated dataset entries and deleted dataset entries from the asynchronously updated index;

receiving, by the one or more processors, from an end user, a proximity-based query directed to the main dataset;

joining, by the one or more processors, the main dataset with the time-sequenced log data resulting in a first intermediate result comprising a first one or more entries of the main dataset made after the cutoff time;

processing, by the one or more processors, the proximity-based query to determine a second one or more entries satisfying the proximity-based query by emulating a function of the last asynchronous index update resulting in a second intermediate result, wherein the second intermediate result includes updated and deleted entries of a base table that are retrieved by the proximity-based query using an outdated asynchronously updated index, wherein the processing the proximity-based query further comprises receiving a staleness acceptability criterion; and

determining, based at least in part on the staleness acceptability criterion, that one or more query results are acceptable;

filtering out, by the one or more processors, the updated dataset entries from the asynchronously updated index using the time-sequenced log data to generate a lookup table as index table;

processing, by the one or more processors, the proximity-based query against the main dataset using the lookup table resulting in a third intermediate result; and

building, by the one or more processors, a union of the second intermediate result and the third intermediate result, to generate a final result of the proximity-based query.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method, computer program product, and computer system for optimizing query processing is provided. An asynchronously updated index is provided for a main dataset. A time-sequences log of data modifications to the main dataset is provided. A query of the main dataset is received. The main dataset is joined with the time-sequenced log data resulting in a first intermediate result. The query is processed by keeping one or more entries satisfying the query by emulating a function of the asynchronously updated index resulting in a second intermediate result. Updated, deleted dataset entries are deleted from the asynchronously updated index. The query is processed resulting in a third intermediate result. A union of the second intermediate result and third intermediate result is built defining a final result.

Citations

4 Claims

1. A computer implemented method, the method comprising:
- receiving, by one or more processors, an asynchronously updated index corresponding to a main dataset in a database system;
  
  receiving, by the one or more processors, time-sequenced log data of modifications made to the main dataset after a cutoff time of a last asynchronous index update, wherein the time-sequenced log data is read once by the database system for joining the main dataset with the time-sequenced log data and filtering out updated dataset entries and deleted dataset entries from the asynchronously updated index;
  
  receiving, by the one or more processors, from an end user, a proximity-based query directed to the main dataset;
  
  joining, by the one or more processors, the main dataset with the time-sequenced log data resulting in a first intermediate result comprising a first one or more entries of the main dataset made after the cutoff time;
  
  processing, by the one or more processors, the proximity-based query to determine a second one or more entries satisfying the proximity-based query by emulating a function of the last asynchronous index update resulting in a second intermediate result, wherein the second intermediate result includes updated and deleted entries of a base table that are retrieved by the proximity-based query using an outdated asynchronously updated index, wherein the processing the proximity-based query further comprises receiving a staleness acceptability criterion; and
  
  determining, based at least in part on the staleness acceptability criterion, that one or more query results are acceptable;
  
  filtering out, by the one or more processors, the updated dataset entries from the asynchronously updated index using the time-sequenced log data to generate a lookup table as index table;
  
  processing, by the one or more processors, the proximity-based query against the main dataset using the lookup table resulting in a third intermediate result; and
  
  building, by the one or more processors, a union of the second intermediate result and the third intermediate result, to generate a final result of the proximity-based query.
- View Dependent Claims (2, 3, 4)
- - 2. The method of claim 1, wherein the database system is a relational database system.
  - 3. The method of claim 2, wherein each dataset is a table of the relational database system.
  - 4. The method of claim 1, wherein the asynchronously updated index is selected from the group consisting of a text search index, and an image index.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
International Business Machines Corporation
Original Assignee
International Business Machines Corporation
Inventors
Behnen, Marion E., Klauke, Joern, Seifert, Jens P., Zuzarte, Calisto P.
Primary Examiner(s)
Truong, Cam Y T

Application Number

US15/602,159
Publication Number

US 20170255677A1
Time in Patent Office

1,043 Days
Field of Search

707714
US Class Current
CPC Class Codes

G06F 16/2379   Updates performed during on...

G06F 16/2455   Query execution

G06F 16/24558   Binary matching operations

G06F 16/2456   Join operations

G06F 16/284   Relational databases

Preventing staleness in query results when using asynchronously updated indexes

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

4 Claims

Specification

Solutions

Use Cases

Quick Links

Preventing staleness in query results when using asynchronously updated indexes

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

4 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links