Identifying conceptual gaps in a knowledge base
First Claim
1. A method in a computing system for augmenting a corpus of documents, the method comprising:
- generating a corpus concept graph from the documents indicating connections between concepts of the documents of the corpus;
analyzing the corpus concept graph to determine whether connectedness of concepts of the documents of the corpus is sufficient; and
when the analysis indicates that the connectedness of some concepts is not sufficient, adding to the corpus documents relating to the concepts that do not have sufficient connectedness.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for augmenting a corpus with documents on concepts not sufficiently covered within the corpus is provided. The augmentation system generates a corpus concept graph from the documents of a corpus. A corpus concept graph represents concepts of the documents as nodes and related concepts as links between nodes. To generate a corpus concept graph, the augmentation system identifies the concepts that are related within each document of the corpus and adds nodes and links to the corpus concept graph for related concepts. The augmentation system analyzes the corpus concept graph to determine whether the relatedness of concepts of the documents of the corpus is sufficient. If the relatedness of a pair of concepts is not sufficient, then the augmentation system attempts to identify documents not already in the corpus that are related to the concepts that are not sufficiently related.
89 Citations
30 Claims
-
1. A method in a computing system for augmenting a corpus of documents, the method comprising:
-
generating a corpus concept graph from the documents indicating connections between concepts of the documents of the corpus;
analyzing the corpus concept graph to determine whether connectedness of concepts of the documents of the corpus is sufficient; and
when the analysis indicates that the connectedness of some concepts is not sufficient, adding to the corpus documents relating to the concepts that do not have sufficient connectedness. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for identifying gaps in a knowledge base, the method comprising:
-
generating a concept graph representing concepts of the knowledge base that connects concepts that are related as indicated by the knowledge base;
analyzing the concept graph to determine whether connectedness between the concepts of the knowledge base is sufficient; and
indicating the concepts whose connectedness in the knowledge base is not sufficient. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method in a computing system for determining connectedness of concepts within a corpus of documents, the method comprising:
-
generating document concept graphs for the documents of the corpus indicating connections and strength of connections between concepts within a document; and
generating a corpus concept graph from the document concept graphs indicating connections and an aggregate strength of connections between concepts of the documents of the corpus wherein the connectedness of the concepts of the corpus is based on the aggregate strength of connections between concepts. - View Dependent Claims (24, 25, 26)
-
-
27. A method in a computing system for generating a query, the method comprising:
-
providing a corpus concept graph indicating connections and strength of connections between concepts represented by the corpus concept graph;
receiving a query having an input concept;
identifying from the corpus concept graph a concept that is related to the input concept based on the connections and strength of the connections; and
augmenting the query with the identified concept. - View Dependent Claims (28, 29, 30)
-
Specification