×

Search-based detection, link, and acquisition of data

  • US 10,318,527 B2
  • Filed: 01/27/2015
  • Issued: 06/11/2019
  • Est. Priority Date: 01/27/2015
  • Status: Active Grant
First Claim
Patent Images

1. A computer system comprising:

  • one or more processors, one or more computer-readable memories, and one or more computer readable, tangible storage devices;

    program instructions, stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, to receive a natural language query;

    program instructions, stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, to determine, based on the natural language query, a set of search criteria;

    program instructions, stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, to;

    determine a respective coverage score for each of a plurality of data sets, wherein the plurality of data sets is stored in one or more data sources, anddetermine a respective coverage score for each of one or more combinations of two or more data sets, wherein the respective coverage score for each of the one or more combinations of two or more data sets is separate from the coverage score for each of the plurality of data sets,wherein the respective coverage score for each of the one or more combinations of two or more data sets is based at least in part on matches between the set of search criteria and attributes of the plurality of data sets and of the one or more combinations of two or more data sets, wherein the respective coverage score for a respective one of the plurality of data sets corresponds to a percentage of the natural language query that is bound to one or more attributes of the respective one of the plurality of data sets, and wherein the respective coverage score for a respective one of the one or more combinations of two or more data sets corresponds to the percentage of the natural language query that is bound to one or more attributes of the respective one of the one or more combinations of two or more data sets;

    program instructions, stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, to, in response to determining a combination of two or more data sets out of the one or more combinations of two or more data sets that are associated with the natural language query based at least in part on the respective coverage score of the combination of two or more data sets, build an on-the-fly data source query to join two or more data sets making up the combination of two or more data sets, including joining at least one of;

    two or more rows and two or more columns of the two or more data sets to produce a joint data set;

    program instructions, stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, to execute the on-the-fly data source query to retrieve the joint data set from the one or more data sources; and

    program instructions, stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, to output a representation of the joint data set for display at a display device.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×