SYSTEM AND METHOD FOR LANGUAGE EXTRACTION AND ENCODING
First Claim
Patent Images
1. A method for extracting information from medical or natural-language input text, comprising:
- receiving medical or natural-language input text;
utilizing a lexicon knowledge base to identify and categorize multi-word and single word phrases within sentences of the input text;
parsing said input text to determine the grammatical structure of the text data, said parsing step comprising the step of referring to a domain parameter having a value indicative of a domain from which the text data originated, the domain parameter corresponding to one or more rules of grammar within a knowledge base related to the domain to be applied for parsing the text data;
regularizing the parsed text data to form structured word terms; and
tagging the text data with a structured data component derived from the structured word terms.
2 Assignments
0 Petitions
Accused Products
Abstract
Improved systems and methods for extracting information from medical and natural-language text data.
52 Citations
11 Claims
-
1. A method for extracting information from medical or natural-language input text, comprising:
-
receiving medical or natural-language input text; utilizing a lexicon knowledge base to identify and categorize multi-word and single word phrases within sentences of the input text; parsing said input text to determine the grammatical structure of the text data, said parsing step comprising the step of referring to a domain parameter having a value indicative of a domain from which the text data originated, the domain parameter corresponding to one or more rules of grammar within a knowledge base related to the domain to be applied for parsing the text data; regularizing the parsed text data to form structured word terms; and tagging the text data with a structured data component derived from the structured word terms. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for extracting information from medical or natural-language input text, comprising:
-
a lexicon knowledge base to identify and categorize multi-word and single word phrases within sentences of the input text; a processor receiving said medical or natural-language input text coupled to said lexicon knowledge base; a boundary identifier, coupled to said processor and said lexicon knowledge base and receiving said medical or natural-language input text; and a parser, coupled to said boundary identifier and receiving said input text to determine the grammatical structure of the text data. - View Dependent Claims (8, 9, 10, 11)
-
Specification