Automated extension for generation of cross references in a knowledge base
First Claim
1. A method for generating cross-references among categories in a knowledge base, said method comprising the steps of:
- extracting, from a plurality of documents, a plurality of themes, wherein a theme identifies subject matter contained in a corresponding document;
generating a theme strength for said themes, said theme strength reflects the amount of subject matter contained in a document for a corresponding theme relative to other themes in said document;
generating a plurality of scores, from said theme strengths, to identify a relative theme pair strength for at least one pair of said themes extracted from said documents;
selecting theme pairs based on said scores;
selecting category pairs in said knowledge base by mapping said themes of said theme pairs selected to corresponding categories of said knowledge base; and
generating a cross reference in said knowledge base between categories of said category pairs, wherein said cross reference identifies an association between said category pairs.
2 Assignments
0 Petitions
Accused Products
Abstract
A technique for generating cross-references among categories in a knowledge base extracts a plurality of themes from a corpus of documents. A theme identifies subject matter contained in a corresponding document. A plurality of scores are generated such that each score identifies a relative theme strength among theme pairs of the themes extracted from the documents. In general, a theme strength reflects the amount of subject matter contained in a document for a corresponding theme relative to other themes in the document. Thereafter, the most related theme pairs are selected as indicated by the scores. Category pairs of the knowledge base are then selected by mapping the themes of the selected theme pairs to corresponding categories of the knowledge base. A cross-reference between categories of the category pairs in the knowledge base is generated so as to identify an association between the category pairs.
-
Citations
15 Claims
-
1. A method for generating cross-references among categories in a knowledge base, said method comprising the steps of:
-
extracting, from a plurality of documents, a plurality of themes, wherein a theme identifies subject matter contained in a corresponding document; generating a theme strength for said themes, said theme strength reflects the amount of subject matter contained in a document for a corresponding theme relative to other themes in said document; generating a plurality of scores, from said theme strengths, to identify a relative theme pair strength for at least one pair of said themes extracted from said documents; selecting theme pairs based on said scores; selecting category pairs in said knowledge base by mapping said themes of said theme pairs selected to corresponding categories of said knowledge base; and generating a cross reference in said knowledge base between categories of said category pairs, wherein said cross reference identifies an association between said category pairs. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system comprising:
-
search and retrieval module for receiving a user query and for generating a query response including query feedback; a knowledge base, coupled to said search and retrieval module, for storing relationships among terminology for use as query feedback; a knowledge base processing system, coupled to said knowledge base for processing a plurality of documents and automatically extending said relationships among said terminology in said knowledge base, said knowledge base processing system for extracting, from said documents, a plurality of themes, wherein a theme identifies subject matter contained in a corresponding document, for generating a theme strength for said themes, said theme strength reflects the amount of subject matter contained in a document for a corresponding theme relative to other themes in said document, for generating a plurality of scores, from said theme strengths, to identify a relative theme pair strength for at least one pair of said themes extracted from said documents, for selecting theme pairs based on said scores, for selecting category pairs in said knowledge base by mapping said themes of said theme pairs selected to corresponding categories of said knowledge base, and for generating a cross reference in said knowledge base between categories of said category pairs, wherein said cross reference identifies an association between said category pairs. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computer readable medium comprising a plurality of instructions, which when executed, causes the computer to perform the steps of:
-
extracting, from a plurality of documents, a plurality of themes, wherein a theme identifies subject matter contained in a corresponding document; generating a theme strength for said themes, said theme strength reflects the amount of subject matter contained in a document for a corresponding theme relative to other themes in said document; generating a plurality of scores, from said theme strengths, to identify a relative theme pair strength for at least one pair of said themes extracted from said documents; selecting theme pairs based on said scores; selecting category pairs in said knowledge base by mapping said themes of said theme pairs selected to corresponding categories of said knowledge base; and generating a cross reference in said knowledge base between categories of said category pairs, wherein said cross reference identifies an association between said category pairs. - View Dependent Claims (12, 13, 14, 15)
-
Specification