Probabilistic system for natural language processing
First Claim
Patent Images
1. A method for encoding free-text data, comprising:
- (A) receiving free-text data, wherein said free-text data includes;
words, a grammar, a syntax and a semantic relationship between said words;
(B) checking for synonyms of said words within said received free-text data;
(C) checking spelling of said words within said received free-text data;
(D) parsing said syntax of said received free-text data;
(E) transforming said grammar of said received free-text data;
(F) inferring concepts from said received free-text data, using a probabilistic system, wherein said probabilistic system further comprises a Bayesian network for managing one or more probabilistic calculations for use in slotting said words of said free-text data for translation to said inferred concept;
(G) creating an encoded representation of said received free-text data; and
(H) writing said encoded representation into a database.
3 Assignments
0 Petitions
Accused Products
Abstract
A natural language understanding system is described to provide generation of concept codes from free-text medical data. A probabilistic model of lexical semantics, is implemented by means of a Bayesian network, and is used to determine the most probable concept or meaning associated with a sentence or phrase. The inventive method and system includes the steps of checking for synonyms, checking spelling, performing syntactic parsing, transforming text to its “deep” or semantic form, and performing a semantic analysis based on a probabilistic model of lexical semantics.
66 Citations
20 Claims
-
1. A method for encoding free-text data, comprising:
-
(A) receiving free-text data, wherein said free-text data includes;
words, a grammar, a syntax and a semantic relationship between said words;
(B) checking for synonyms of said words within said received free-text data;
(C) checking spelling of said words within said received free-text data;
(D) parsing said syntax of said received free-text data;
(E) transforming said grammar of said received free-text data;
(F) inferring concepts from said received free-text data, using a probabilistic system, wherein said probabilistic system further comprises a Bayesian network for managing one or more probabilistic calculations for use in slotting said words of said free-text data for translation to said inferred concept;
(G) creating an encoded representation of said received free-text data; and
(H) writing said encoded representation into a database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 18, 19)
-
-
13. A method for providing encoded medical information from free-text data, operating on a computer system, including:
-
a digital computer processor executing the steps of the method;
a mass storage device connected to said digital computer processor for storing the data being worked on by the method;
an input device, electrically connected to said digital computer processor, for receiving data to be worked on by the method;
a preservation storage device electrically connected to said digital computer processor, to store resulting coded data;
the method comprising;
(A) receiving free-text data, wherein said free-text data includes;
words, a grammar, a syntax, and a semantic relationship between said words;
(B) checking for synonyms of said words within said received free-text data;
(C) checking spelling of said words within said received free-text data;
(D) parsing said syntax of said received free-text data;
(E) transforming said grammar of said received free-text data;
(F) analyzing said semantic relationship of said received free-text data, wherein said analysis is based on a probabilistic model of lexical semantics, wherein said probabilistic model relates said words to one or more concepts, wherein said words are appropriate for translation into a concept;
(G) creating an encoded representation of said received free-text data; and
(H) writing said encoded representation into a database.
-
-
20. A system for encoding free-text information, comprising:
-
(A) an input device for receiving free-text information;
(B) a processor electrically connected to said input device for processing said received free-text information, wherein said processing further comprises probabilistically calculating a relationship between said received free-text information and one or more concepts and wherein said probabilistic calculation further comprises a Bayesian network;
(C) a digital storage device electrically connected to said processor;
(D) a means for encoding said received free-text information employing said processor; and
(E) a means for storing said encoded free-text information on said digital storage device.
-
Specification