Process for Constructing a Semantic Knowledge Base Using a Document Corpus
First Claim
1. A computer assisted method for creating a semantic knowledge base for a document corpus describing physical or tangible objects, consisting of unique sentential propositions, such that a single sentential proposition represents the entire meaning of a simple or complex sentence, said method comprising the steps of:
- (a) segmenting at least a portion of a document corpus into individual sentences using a computer processor; and
(b) reducing each said sentence to one or more simple or complex sentences; and
(c) displaying candidate propositions to a domain expert for each said simple or complex sentence using a computer processor; and
(d) said domain expert creating a unique sentential proposition to represent the entire meaning for each said simple or complex sentence whose meaning is not represented in said knowledge base using a knowledge editor; and
(e) adding said sentential proposition(s) using a computer processor to said knowledge base.
0 Assignments
0 Petitions
Accused Products
Abstract
Related free-text documents, a corpus, are used to empirically derive a semantic knowledge base through a method in which documents are segmented into unique sentences, and then used to define sentential propositions which are arranged in a knowledge hierarchy. The method takes compound natural language sentences and transforms them to simple sentences by a process that is a part of the invention. A knowledge editor enables a domain expert using the methods of the invention to map the sentences in the corpus to sentential proposition(s). The resulting knowledge base can be used to semantically analyze documents in data mining and decision support applications, and can assist word processors or speech recognition devices. The invention is illustrated in connection with radiology reports, but it has wide applicability.
103 Citations
23 Claims
-
1. A computer assisted method for creating a semantic knowledge base for a document corpus describing physical or tangible objects, consisting of unique sentential propositions, such that a single sentential proposition represents the entire meaning of a simple or complex sentence, said method comprising the steps of:
-
(a) segmenting at least a portion of a document corpus into individual sentences using a computer processor; and (b) reducing each said sentence to one or more simple or complex sentences; and (c) displaying candidate propositions to a domain expert for each said simple or complex sentence using a computer processor; and (d) said domain expert creating a unique sentential proposition to represent the entire meaning for each said simple or complex sentence whose meaning is not represented in said knowledge base using a knowledge editor; and (e) adding said sentential proposition(s) using a computer processor to said knowledge base. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer assisted method for creating a computer database mapping table consisting of associations between a sentence from a corpus and sentential propositions in a knowledge base derived from said corpus, each sentential proposition representing the meaning of all semantically equivalent simple sentences, comprising the steps of:
-
a. reducing each said sentence to one or more simple or complex sentences; and b. for each said simple or complex sentence using a computer processor to display candidate propositions which may represent the entire meaning of said simple or complex sentence to a domain expert; and c. said domain expert using a knowledge editor to associate one said candidate sentential proposition for each said simple or complex sentence; and
,d. storing said associations using a computer processor in said mapping table. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A knowledge editor operating a computer processor used by a domain expert comprising:
-
A user interface, and means for selecting a sentence from a corpus, means for selecting one or more sentential propositions from a knowledge base, means for adding association(s) between said sentence and said sentential proposition(s), such that said association(s) represent the entire meaning of said sentence. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23)
-
Specification