Enrichment of named entities in documents via contextual attribute ranking
First Claim
1. A method comprising plurality of acts that are executed by at least one processor in at least one computing device, the acts comprising:
- extracting text from a document;
detecting existence of a named entity in the text of the document;
generating annotated text responsive to detecting the existence of the named entity in the text of the document, the annotated text comprising an annotation that identifies the named entity in the text;
transmitting the annotated text to an entity classification system, the entity classification system assigning a class from a plurality of possible classes to the named entity based at least in part upon the annotated text;
receiving from the entity classification system data that identifies the class assigned to the named entity by the entity classification system;
transmitting the data that identifies the class to an attribute ranking system, the attribute ranking system determining a plurality of attributes that are relevant to the class;
receiving from the attribute ranking system the plurality of attributes that are relevant to the class;
responsive to receiving the plurality of attributes, retrieving values for the plurality of attributes for the named entity;
subsequent to retrieving the values for the plurality of attributes, causing the named entity to be highlighted in the document when the document is displayed on a display of a client computing device, the named entity highlighted to inform a user of the client computing device that additional data about the named entity is available, wherein the client computing device displays, on the display, the values for the plurality of attributes for the named entity responsive to detecting that the user has selected the named entity in the document.
2 Assignments
0 Petitions
Accused Products
Abstract
Technologies pertaining to retrieval of contextually relevant attribute values for an automatically identified named entity in a document are described herein. Named entity recognition technologies are employed to identify named entities in the text of a document. Context corresponding to an identified named entity is analyzed to probabilistically assign a class to the named entity. Attributes that are most relevant to the class are determined, and attribute values for such attributes are retrieved. The attribute values are presented in correlation with the named entity in the document responsive to user-selection of the named entity in the document.
-
Citations
20 Claims
-
1. A method comprising plurality of acts that are executed by at least one processor in at least one computing device, the acts comprising:
-
extracting text from a document; detecting existence of a named entity in the text of the document; generating annotated text responsive to detecting the existence of the named entity in the text of the document, the annotated text comprising an annotation that identifies the named entity in the text; transmitting the annotated text to an entity classification system, the entity classification system assigning a class from a plurality of possible classes to the named entity based at least in part upon the annotated text; receiving from the entity classification system data that identifies the class assigned to the named entity by the entity classification system; transmitting the data that identifies the class to an attribute ranking system, the attribute ranking system determining a plurality of attributes that are relevant to the class; receiving from the attribute ranking system the plurality of attributes that are relevant to the class; responsive to receiving the plurality of attributes, retrieving values for the plurality of attributes for the named entity; subsequent to retrieving the values for the plurality of attributes, causing the named entity to be highlighted in the document when the document is displayed on a display of a client computing device, the named entity highlighted to inform a user of the client computing device that additional data about the named entity is available, wherein the client computing device displays, on the display, the values for the plurality of attributes for the named entity responsive to detecting that the user has selected the named entity in the document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system that facilitates provision of contextually relevant data to a user, the system comprising:
-
at least one processor; and memory comprising instructions that, when executed by the at least one processor, cause the at least one processor to perform acts comprising; receiving a document that comprises text, the text comprising a named entity; identifying the named entity in the text; responsive to identifying the named entity in the text and based upon the named entity, inferring a class to which the named entity belongs, the class being amongst a plurality of possible classes; responsive to inferring the class to which the named entity belongs transmitting the data that identifies the class to an attribute ranking system, the attribute ranking system determining attributes that are relevant to the class; receiving from the attribute ranking system the attributes that are relevant to the class; and retrieving values for the attributes for the named entity responsive to receiving the attributes;
wherein the values for the attributes for the named entity are displayed as a graphical overlay in the document responsive to a reviewer of the document selecting the named entity on a client computing device. - View Dependent Claims (13, 14, 15, 16, 17, 18, 20)
-
-
19. A computer-readable medium comprising instructions that, when executed by a processor, cause the processor to perform acts comprising:
-
identifying named entities in text of a web page; subsequent to identifying the named entities in the text, annotating the named entities in the text to form annotated text; transmitting the annotated text to a named entity classification system, wherein the named entity classification system probabilistically assigns classes to respective named entities in the annotated text; receiving class assignments for the respective named entities from the named entity classification system; for each unique class received from the named entity classification system, transmitting the respective unique class to an attribute ranking system, wherein the attribute ranking system outputs lists of attributes that are relevant to the respective classes; receiving from the attribute ranking system, for each unique class, attributes that are relevant to a respective unique class; retrieving attribute values for the received attributes for each named entity in the text of the web page, wherein the attribute values are retrieved from a data repository that comprises structured data, wherein each named entity in the web page has at least one attribute value associated therewith to inform a reviewer of the web page that additional information pertaining to the named entities is available, wherein attribute identities and corresponding values for a named entity are displayed in response to the named entity being selected in the web page.
-
Specification