Identifying conceptual gaps in a knowledge base
First Claim
1. A method in a computing system with a processor for augmenting a corpus of documents, the method comprising:
- generating a corpus concept graph from the documents indicating connections between concepts of the documents of the corpus;
analyzing the corpus concept graph to determine whether connectedness of concepts of the documents of the corpus is sufficient; and
when the analysis indicates that the connectedness of some concepts is not sufficient, adding to the corpus documents relating to the concepts that do not have sufficient connectednesswherein the generating, analyzing, and adding are performed by the processor.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for augmenting a corpus with documents on concepts not sufficiently covered within the corpus is provided. The augmentation system generates a corpus concept graph from the documents of a corpus. A corpus concept graph represents concepts of the documents as nodes and related concepts as links between nodes. To generate a corpus concept graph, the augmentation system identifies the concepts that are related within each document of the corpus and adds nodes and links to the corpus concept graph for related concepts. The augmentation system analyzes the corpus concept graph to determine whether the relatedness of concepts of the documents of the corpus is sufficient. If the relatedness of a pair of concepts is not sufficient, then the augmentation system attempts to identify documents not already in the corpus that are related to the concepts that are not sufficiently related.
53 Citations
37 Claims
-
1. A method in a computing system with a processor for augmenting a corpus of documents, the method comprising:
-
generating a corpus concept graph from the documents indicating connections between concepts of the documents of the corpus; analyzing the corpus concept graph to determine whether connectedness of concepts of the documents of the corpus is sufficient; and when the analysis indicates that the connectedness of some concepts is not sufficient, adding to the corpus documents relating to the concepts that do not have sufficient connectedness wherein the generating, analyzing, and adding are performed by the processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method in a computing device with a processor for identifying gaps in a knowledge base, the method comprising:
-
generating a concept graph representing concepts of the knowledge base that connects concepts that are related as indicated by the knowledge base; analyzing the concept graph to determine whether connectedness between the concepts of the knowledge base is sufficient; and indicating the concepts whose connectedness in the knowledge base is not sufficient wherein the generating, analyzing, and indicating are performed by the processor. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method in a computing system with a processor for determining connectedness of concepts within a corpus of documents, the method comprising:
-
generating document concept graphs for the documents of the corpus indicating connections and strength of connections between concepts within a document; and generating a corpus concept graph from the document concept graphs indicating connections and an aggregate strength of connections between concepts of the documents of the corpus wherein the connectedness of the concepts of the corpus is based on the aggregate strength of connections between concepts and wherein the generating of the document concept graphs and the corpus concept graph are performed by the processor. - View Dependent Claims (24, 25, 26)
-
-
27. A method in a computing system with a processor for generating a query, the method comprising:
-
providing a corpus concept graph indicating connections and strength of connections between concepts represented by the corpus concept graph; receiving a query having an input concept; identifying from the corpus concept graph a concept that is related to the input concept based on the connections and strength of the connections; and augmenting the query with the identified concept wherein the providing, receiving, identifying, and augmenting are performed by the processor. - View Dependent Claims (28, 29, 30)
-
-
31. A computing device with a processor and memory for augmenting a corpus of documents, comprising:
-
a corpus store containing the corpus of documents; a component having computer-executable instructions that generate a corpus concept graph from the documents of the corpus, the corpus concept graph indicating connections between concepts of the documents of the corpus; a component having computer-executable instructions that determine whether connectedness of concepts of the documents of the corpus is sufficient based on analysis of the connections between concepts of the documents of the corpus as indicated by the corpus concept graph; and a component having computer-executable instructions that, when it is determined that the connectedness of some concepts is not sufficient, adds to the corpus documents relating to the concepts that do not have sufficient connectedness wherein the computer-executable instructions of the component are stored in the memory for execution by the processor. - View Dependent Claims (32, 33, 34)
-
-
35. A computer-readable storage medium containing computer-executable instructions for identifying gaps in a knowledge base, by a method comprising:
-
generating a concept graph representing concepts of the knowledge base that connects concepts that are related as indicated by the knowledge base; determining whether connectedness between the concepts of the knowledge base is sufficient to adequately represent the concepts of the knowledge base; and indicating the concepts whose connectedness in the knowledge base is not sufficient and thereby identifying gaps in the knowledge base wherein the instructions are executable by a processor of a computer. - View Dependent Claims (36, 37)
-
Specification