SEMANTIC MATCHING AND ANNOTATION OF ATTRIBUTES
First Claim
1. A system comprising:
- one or more processing units; and
memory storing instructions that when executed by the one or more processing units, configure the one or more processing units to perform operations comprising;
accepting a query including;
an entity identifier, a keyword, at least one of a unit identifier or a scale identifier, and a time identifier;
acquiring data from a plurality of tables;
identifying at least one existing semantic label of at least a first of the plurality of tables from the data;
identifying that data from a least a second of the plurality of tables includes the entity identifier and the keyword and lacks at least one of the unit identifier, the scale identifier, or the time identifier;
identifying at least one conversion rule associated with the at least one existing semantic label;
normalizing the data from the plurality of tables, based at least in part on the query and the at least one conversion rule; and
outputting a result to the query normalized to show the data, including data from the second of the plurality of tables, in consistent units, scale, and time.
4 Assignments
0 Petitions
Accused Products
Abstract
Techniques and constructs to facilitate semantic matching and automated annotation (SMA) of attributes can take entity names and a keyword describing an attribute associated with the named entities as input and leverage a corpus of data such as data from tables, which can include HTML web tables, to automatically populate values associated with the named entities for the attribute. The constructs enable accurate SMA of attributes, such as attributes that relate to the entity and include numeric values in a different unit than the query, in a different scale than the query, and/or reflecting a time different from that of the query. An entity augmentation application programming interface (API) may be used to accept queries that include numeric criteria, parameters, or arguments, including query attributes represented by numeric values, which may be in different units or scales, and attributes represented by numeric values that can vary by time.
64 Citations
25 Claims
-
1. A system comprising:
-
one or more processing units; and memory storing instructions that when executed by the one or more processing units, configure the one or more processing units to perform operations comprising; accepting a query including;
an entity identifier, a keyword, at least one of a unit identifier or a scale identifier, and a time identifier;acquiring data from a plurality of tables; identifying at least one existing semantic label of at least a first of the plurality of tables from the data; identifying that data from a least a second of the plurality of tables includes the entity identifier and the keyword and lacks at least one of the unit identifier, the scale identifier, or the time identifier; identifying at least one conversion rule associated with the at least one existing semantic label; normalizing the data from the plurality of tables, based at least in part on the query and the at least one conversion rule; and outputting a result to the query normalized to show the data, including data from the second of the plurality of tables, in consistent units, scale, and time. - View Dependent Claims (2, 3, 4)
-
-
5. A method comprising:
-
extracting a plurality of tables via a computing device; distinguishing, from the plurality of tables, relational tables from at least one other type of table building a semantic graph based at least in part on a label from at least one of the relational tables; and building at least a first index on the relational tables and the semantic graph. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. One or more computer-readable media having computer executable instructions recorded thereon, the computer-executable instructions, upon execution, to configure a computing device to perform semantic matching and annotation of numeric attribute operations comprising:
-
receiving an entity augmentation query including a name of an entity, a keyword associated with the entity, time information associated with the keyword, and at least one of unit information associated with the keyword or scale information associated with the keyword; processing the entity augmentation query based at least in part on existing conversion rules or graphs; and exposing results for the entity augmentation query. - View Dependent Claims (21, 22, 23, 24, 25)
-
Specification