Process for constructing a semantic knowledge base using a document corpus
First Claim
1. A computer assisted method for creating a semantic knowledge base for a document corpus describing physical or tangible objects, consisting of unique sentential propositions, said method comprising the steps of:
- (a) segmenting at least a portion of a document corpus into individual sentences using a computer processor; and
(b) reducing each said sentence to one or more simple or complex sentences; and
(c) displaying candidate propositions to a domain expert for each said simple or complex sentence using a computer processor; and
(d) said domain expert creating a unique sentential proposition to represent the entire meaning for each said simple or complex sentence whose meaning is not represented in said knowledge base using a knowledge editor; and
(e) adding said sentential proposition(s) using a computer processor to said knowledge base, wherein a single sentential proposition represents the entire meaning of a simple or complex sentence.
0 Assignments
0 Petitions
Accused Products
Abstract
Related free-text documents, a corpus, are used to empirically derive a semantic knowledge base through a method in which documents are segmented into unique sentences, and then used to define sentential propositions which are arranged in a knowledge hierarchy. The method takes compound natural language sentences and transforms them to simple sentences by a process that is a part of the invention. A knowledge editor enables a domain expert using the methods of the invention to map the sentences in the corpus to sentential proposition(s). The resulting knowledge base can be used to semantically analyze documents in data mining and decision support applications, and can assist word processors or speech recognition devices. The invention is illustrated in connection with radiology reports, but it has wide applicability.
39 Citations
14 Claims
-
1. A computer assisted method for creating a semantic knowledge base for a document corpus describing physical or tangible objects, consisting of unique sentential propositions, said method comprising the steps of:
-
(a) segmenting at least a portion of a document corpus into individual sentences using a computer processor; and (b) reducing each said sentence to one or more simple or complex sentences; and (c) displaying candidate propositions to a domain expert for each said simple or complex sentence using a computer processor; and (d) said domain expert creating a unique sentential proposition to represent the entire meaning for each said simple or complex sentence whose meaning is not represented in said knowledge base using a knowledge editor; and (e) adding said sentential proposition(s) using a computer processor to said knowledge base, wherein a single sentential proposition represents the entire meaning of a simple or complex sentence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer assisted method for creating a semantic mapping table consisting of associations between a sentence from a corpus and sentential propositions in a knowledge base describing physical or tangible objects, comprising the steps of:
-
a. reducing each said sentence to one or more said simple or complex sentences; and b. for each said simple or complex sentence using a computer processor to display candidate propositions which may represent the entire meaning of said simple or complex sentence to a domain expert; and c. said domain expert using a knowledge editor to associate one said candidate sentential proposition for each said simple or complex sentence; and
,d. storing said associations using a computer processor in said mapping table, wherein a single sentential proposition represents the entire meaning of a simple or complex sentence. - View Dependent Claims (11, 12, 13, 14)
-
Specification