IDENTIFYING A TOPIC-RELEVANT SUBJECT
First Claim
1. One or more computer-readable media having computer-executable instructions embodied thereon that, when executed on a computing device, cause the computing device to perform a method of identifying, from within a corpus of documents, a subject that is relevant to a topic and that is usable to enhance a topic-describing document, the method comprising:
- retrieving the topic-describing document and a linked document, wherein a hyperlink is embedded within text of the linked document that, when input, enables navigation to the topic-describing document;
parsing text of the linked document to identify the subject, wherein a context of the subject suggests a degree of relevance of the subject to the topic;
determining an enhancement type of the subject, wherein an enhancement type is a category of supplemental information that is usable to enhance the topic-describing document;
transforming a first version of the topic-describing document into an enhanced version of the topic-describing document, wherein the enhanced version includes a presentation of the subject; and
storing the enhanced version of the topic-describing document to be presented at runtime.
2 Assignments
0 Petitions
Accused Products
Abstract
The present technology is related to identifying, from within a corpus of documents, a subject (e.g., person, location, date, etc.) that is relevant to a topic and that is usable to enhance a topic-describing document. Documents within the corpus of documents share a link structure, such that some documents include hyperlinks that enable navigation to the topic-describing document, and the topic-describing document includes hyperlinks that enable navigation to other documents. Text of documents within the corpus is parsed to identify the subject, and a context of the subject suggests a degree of relevance of the subject to the topic. An enhancement type of the subject is determined, and a version of the topic-describing document is enhanced to include a presentation of the subject.
-
Citations
20 Claims
-
1. One or more computer-readable media having computer-executable instructions embodied thereon that, when executed on a computing device, cause the computing device to perform a method of identifying, from within a corpus of documents, a subject that is relevant to a topic and that is usable to enhance a topic-describing document, the method comprising:
-
retrieving the topic-describing document and a linked document, wherein a hyperlink is embedded within text of the linked document that, when input, enables navigation to the topic-describing document; parsing text of the linked document to identify the subject, wherein a context of the subject suggests a degree of relevance of the subject to the topic; determining an enhancement type of the subject, wherein an enhancement type is a category of supplemental information that is usable to enhance the topic-describing document; transforming a first version of the topic-describing document into an enhanced version of the topic-describing document, wherein the enhanced version includes a presentation of the subject; and storing the enhanced version of the topic-describing document to be presented at runtime. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system, which includes a processor and a computer readable medium, that identifies, from within a corpus of documents, a subject that is relevant to a topic and that is usable to enhance a topic-describing document, the system comprising:
-
a document retriever that retrieves a first set of documents, each of which includes a respective in-hyperlink that, when input, navigates to the topic-describing document, and a second set of documents, each of which is referenced by a respective out-hyperlink that is embedded in text of the topic-describing document; a parser that parses text or HTML of the topic-describing document, text or HTML of the first set of documents, text or HTML of the second set of documents, or a combination thereof, to generate a set of potentially relevant subjects; a subject evaluator that applies a rule to a respective context of each potentially relevant subject, wherein application of the rule identifies a relevant subject that is relevant to the topic; an enhancement-type identifier that determines an enhancement type of the relevant subject, wherein an enhancement type is a category of supplemental information that is usable to enhance the topic-describing document; and a document enhancer that transforms a first version of the topic-describing document into an enhanced version of the topic-describing document, which includes a presentation of the relevant subject. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A computer-implemented method, which is executed using a processor and computer readable media, of identifying, from within a corpus of documents, a subject that is relevant to a topic and that is usable to enhance a topic-describing document, the method comprising:
-
retrieving the topic-describing document and a linked document, wherein a hyperlink is embedded within text of the linked document that, when input, enables navigation to the topic-describing document; grammatically parsing text of the linked document to identify the subject, wherein a grammatical context of the subject suggests a degree of relevance of the subject to the topic; identifying within text of the linked document a date-page hyperlink that forms at least part of the grammatical context of the subject and that navigates to a date-describing reference document, which suggests that the subject is of temporal significance; transforming a first version of the topic-describing document into an enhanced version, which includes a timeline that indicates the temporal significance of the subject; and storing the enhanced version of the topic-describing document to be presented at runtime.
-
Specification