System and method for automatic semantic labeling of natural language texts
First Claim
Patent Images
1. A processor-implemented method for automatic labeling of natural language text, the method comprising:
- receiving text from at least one natural language document in electronic form;
performing, using a processor, a basic linguistic analysis of the text that includes recognizing cause-effect relationships in the text and generating cause-effect labels for words or phrases in the text that form part of the cause-effect relationships;
matching the linguistically analyzed text and the generated cause-effect labels against stored target semantic relationship patterns, wherein the stored target semantic relationship patterns generically describe semantic relationships between words or phrases, the stored target semantic relationships being derived in part from cause-effect relationships between words or phrases;
producing additional semantic relationship labels for the linguistically analyzed text based on the matching of the linguistically analyzed text and the generated cause-effect labels against the stored target semantic relationship patterns, wherein the additional semantic relationship labels are tagged to words or phrases from sentences within the linguistically analyzed text in order to identify semantic relationships between those words or phrases by identifying those words or phrases as components of semantic relationships of the stored target semantic relationship patterns; and
storing the linguistically analyzed text and the additional semantic relationship labels in a non-transitory storage medium.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for automatic semantic labeling of natural language documents provided in electronic or digital form include a semantic processor that performs a basic linguistic analysis of text, including recognizing in the text semantic relationships of the type objects and/or classes of objects, facts and cause-effect relationships; matching linguistically analyzed text against target semantic relationship patterns, created by generalization of particular cases of target semantic relationships; and generating semantic relationship labels based on linguistically analyzed text and a result of the matching.
105 Citations
24 Claims
-
1. A processor-implemented method for automatic labeling of natural language text, the method comprising:
-
receiving text from at least one natural language document in electronic form; performing, using a processor, a basic linguistic analysis of the text that includes recognizing cause-effect relationships in the text and generating cause-effect labels for words or phrases in the text that form part of the cause-effect relationships; matching the linguistically analyzed text and the generated cause-effect labels against stored target semantic relationship patterns, wherein the stored target semantic relationship patterns generically describe semantic relationships between words or phrases, the stored target semantic relationships being derived in part from cause-effect relationships between words or phrases; producing additional semantic relationship labels for the linguistically analyzed text based on the matching of the linguistically analyzed text and the generated cause-effect labels against the stored target semantic relationship patterns, wherein the additional semantic relationship labels are tagged to words or phrases from sentences within the linguistically analyzed text in order to identify semantic relationships between those words or phrases by identifying those words or phrases as components of semantic relationships of the stored target semantic relationship patterns; and storing the linguistically analyzed text and the additional semantic relationship labels in a non-transitory storage medium. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A semantic processor for automatically semantic labeling of natural language text in electronic or digital form, the semantic processor comprising:
-
a preformatter that preformats received electronic text; a linguistic analyzer that performs basic linguistic analysis of the preformatted text using a central processing unit (CPU), wherein the basic linguistic analysis includes; recognizing objects or classes of objects within the text; recognizing facts over the objects or classes of objects; recognizing cause-effect relationships over the facts; and creating tags that associate the text with corresponding cause-effect relationships; and a labeler that matches the linguistically analyzed text and the created tags against stored target semantic relationship patterns to produce semantically labeled text, wherein the stored target semantic relationship patterns generically describe semantic relationships between words or phrases, the stored target semantic relationships being derived in part from cause-effect relationships between words or phrases, wherein the semantically labeled text includes semantic relationship labels that indicate semantic relationships between words or phrases from sentences within the text, wherein the semantic relationship labels indicate which words or phrases in the linguistically analyzed text are components of semantic relationships of the stored target semantic relationship patterns. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A computer-implemented method for recognizing new types of linguistic patterns for use in a pattern database, the method comprising,
performing, using a processor, a first level linguistic analysis of a natural language document to recognize and tag words indicative of objects and actions in the natural language document; -
performing a second level linguistic analysis of the natural language document by comparing the tags created during the first level linguistic analysis to linguistic patterns stored in a linguistic knowledge database and by creating semantic relationship labels indicative of semantic relationship linguistic patterns in the natural language document, the semantic relationship linguistic patterns including cause-effect relationships patterns; and performing a third level linguistic analysis of the natural language document by recognizing a new type of semantic relationship linguistic pattern based on the semantic relationship labels created during the second level linguistic analysis and on target semantic relationship patterns that generically describe semantic relationships between words or phrases, the target semantic relationships being derived in part from cause-effect relationships between words or phrases, the new type of semantic relationship linguistic pattern identifying a new semantic relationship between words or phrases in the natural language document. - View Dependent Claims (24)
-
Specification