Automatic generation of composite datasets based on hierarchical fields
First Claim
1. A method, comprising:
- at one or more computing devices comprising one or more processors and memory storing one or more computer programs executed by the one or more processors to perform the method, performing the operations of;
storing a plurality of datasets with metadata, wherein the plurality of datasets includes a plurality of fields, wherein the metadata includes a plurality of categories in which each category corresponds to one or more fields of the plurality of fields;
generating a hierarchy mapping that indicates a hierarchical relationship between different categories of the plurality of categories;
receiving a natural language query that specifies a first granularity level indicating a particular category of the plurality of categories, and one or more field values that correspond to the particular category;
identifying, based on the hierarchy mapping, one or more categories of the plurality of categories that are hierarchically related to the particular category and indicate one or more granularity levels that are hierarchically related to the first granularity level;
selecting, based on the metadata, two or more datasets of the plurality of datasets that include at least one category of the one or more categories;
translating at least one dataset of the two or more datasets to the first granularity level;
joining the two or more datasets to generate a composite dataset that includes the particular category;
executing the natural language query against the composite dataset to return a result for the natural language query.
1 Assignment
0 Petitions
Accused Products
Abstract
Datasets are annotated with metadata including categories. Each category corresponds to one or more fields. A hierarchy mapping is generated to indicate a hierarchical relationship between different categories. A natural language query specifies a first granularity level indicating a particular category and one or more field values corresponding to the particular category. Based on the hierarchy mapping, one or more categories that are hierarchically related to the particular category are identified. Based on the metadata, two or more datasets that include at least one hierarchically related category is selected. Based on the first granularity level, one or more dataset filters are generated. The one or more dataset filters are translated to a second granularity level corresponding to the at least one hierarchically related category. The translated filters are applied to at least one of the selected datasets. The two or more datasets are joined to generate a composite dataset.
811 Citations
20 Claims
-
1. A method, comprising:
-
at one or more computing devices comprising one or more processors and memory storing one or more computer programs executed by the one or more processors to perform the method, performing the operations of; storing a plurality of datasets with metadata, wherein the plurality of datasets includes a plurality of fields, wherein the metadata includes a plurality of categories in which each category corresponds to one or more fields of the plurality of fields; generating a hierarchy mapping that indicates a hierarchical relationship between different categories of the plurality of categories; receiving a natural language query that specifies a first granularity level indicating a particular category of the plurality of categories, and one or more field values that correspond to the particular category; identifying, based on the hierarchy mapping, one or more categories of the plurality of categories that are hierarchically related to the particular category and indicate one or more granularity levels that are hierarchically related to the first granularity level; selecting, based on the metadata, two or more datasets of the plurality of datasets that include at least one category of the one or more categories; translating at least one dataset of the two or more datasets to the first granularity level; joining the two or more datasets to generate a composite dataset that includes the particular category; executing the natural language query against the composite dataset to return a result for the natural language query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system, comprising:
-
storage media; one or more processors; and one or more programs stored in the storage media and configured for execution by the one or more processors, the one or more programs comprising instructions for; storing a plurality of datasets with metadata, wherein the plurality of datasets includes a plurality of fields, wherein the metadata includes a plurality of categories in which each category corresponds to one or more fields of the plurality of fields; generating a hierarchy mapping that indicates a hierarchical relationship between different categories of the plurality of categories; receiving a natural language query that specifies a first granularity level indicating a particular category of the plurality of categories, and one or more field values that correspond to the particular category; identifying, based on the hierarchy mapping, one or more categories of the plurality of categories that are hierarchically related to the particular category and indicate one or more granularity levels that are hierarchically related to the first granularity level; selecting, based on the metadata, two or more datasets of the plurality of datasets that include at least one category of the one or more categories; translating at least one dataset of the two or more datasets to the first granularity level; joining the two or more datasets to generate a composite dataset that includes the particular category; executing the natural language query against the composite dataset to return a result for the natural language query. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification