×

Natural language querying of a data lake using contextualized knowledge bases

  • US 10,706,045 B1
  • Filed: 04/17/2019
  • Issued: 07/07/2020
  • Est. Priority Date: 02/11/2019
  • Status: Active Grant
First Claim
Patent Images

1. A method of querying a data lake using natural language, comprising the following steps:

  • receiving a natural language query directed to an electronic data lake;

    parsing the natural language query to determine a plurality of entities within the natural language query;

    identifying the plurality of entities using at least one contextual knowledge base, wherein the plurality of entities are tabulated in at least one of a plurality of data tables by entity type and compared against at least one entry in the at least one contextual knowledge base, wherein at least one phrase of the natural language query is combined, the plurality of entities are soft-matched, and at least one entity above a threshold confidence level is identified, and wherein a relationship table structure knowledge base provides a relationship between at least two of the plurality of entities by determining relational links at least two of the plurality of data tables;

    mapping a dependency relationship between the plurality of identified entities to determine relational parts of speech of the plurality of identified entities based on the parsed natural language query;

    constructing a structured data query based on the plurality of identified entities and the mapped dependency; and

    automatically generating a visual output of a result of the structured data query, wherein a format of the visual output is recommended by a visual recommender knowledge base based on at least;

    a number of columns or a number of rows of the result of the structured data query.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×