×

Extraction device, data processing system, and extraction method

  • US 10,089,370 B2
  • Filed: 06/23/2015
  • Issued: 10/02/2018
  • Est. Priority Date: 12/13/2013
  • Status: Active Grant
First Claim
Patent Images

1. An extraction method for extracting, as a conversion object, a sub query to be converted to a program for processing stream data continuously inputted to a database, from a query including one or more instructions as sub queries to be issued to a database management system for managing the database, the method comprising:

  • receiving,an input query having one or more sub queries,a maximum memory increase value to indicate an amount of memory by which memory usage may increase when processing the streamed data, anda lower limit value of efficiency to indicate a ratio of reduced processing time to increased memory usage, the reduced processing time indicating a difference between a first time to process the stream data using the program and a second time to process the stream data using the database management system, the increased memory usage indicating an amount of memory by which memory usage increases due to processing the stream data using the program compared to processing the stream data using the database management system;

    calculating, for each sub query in the input query,at least one memory increase amount corresponding to a change in memory usage when the sub query is converted to the program and the program is used to process the stream data,a processing time reduction corresponding to a difference in a time to process the stream data using the program and a time to process the stream data using the database management system, andan efficiency by using the at least one calculated memory increase amount and the processing time reduction;

    selecting at least one sub query having a calculated efficiency equal to or higher than the lower limit value;

    integrating a memory increase amount calculated for the selected sub query;

    extracting the selected sub query as a conversion object on condition that the integrated memory increase amount is equal to or smaller than the maximum memory increase amount, wherein the selected sub queries are extracted in descending order of the calculated efficiency;

    parsing the input query and generating a graph expressed by a tree structure having a set of one or more nodes, wherein each sub query is a node in the set of one or more nodes;

    determining whether each node in the set of one or more nodes is a first type of node which executes pipeline processing by processing input data and outputting a processing result or a second type of node which does not execute pipeline processing based on a result of parsing the input query;

    classifying nodes in the set of one or more nodes into one or more first node groups and one or more second node groups, wherein nodes in the first node groups are hierarchically connected from a root node and include only nodes from the set of one or more nodes which are the first type of node and nodes in the one or more second node groups include remaining nodes from the set of one or more nodes; and

    selecting first nodes corresponding to sub queries whose efficiencies are equal to or higher than the lower limit value from nodes classified into the first node groups and extracting the first nodes as conversion objects, wherein the first nodes are extracted in descending order of the calculated efficiency.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×