Method and system of text synthesis based on extracted information in the form of an RDF graph making use of templates
First Claim
Patent Images
1. A method comprising:
- receiving, by a hardware processor, a plurality of information objects extracted from text data in one or more source text documents, the plurality of information objects having representations in a data model, wherein each information object comprises one or more properties and, for each property, a corresponding natural language string value from the text data;
selecting, by the hardware processor, at least one information object and a synthesis template associated with the selected information object, wherein the selecting of the at least one information object and the synthesis template associated with the selected information object comprises;
selecting the at least one information object from the plurality of information objects; and
selecting the synthesis template associated with the selected information object from a plurality of synthesis templates stored in a template library, the synthesis template comprising a sentence in a natural language, a plurality of substitutions comprising one or more words from the sentence and their associated properties, and a template semantic-syntactic tree, and wherein the template semantic-syntactic tree is generated based on the sentence in the natural language, wherein the selecting of the synthesis template associated with the selected information object comprises;
for each synthesis template from the plurality of synthesis templates;
comparing a property associated with a word in a plurality of substitutions of a respective synthesis template with one or more properties of the selected information object;
responsive to determining that the property of the respective synthesis template matches one of the one or more properties of the selected information object, determining whether a natural language string value corresponding to the matching property of the selected information object is nominative or not; and
responsive to determining that the natural language string value is not nominative, eliminating the respective synthesis template from the plurality of synthesis templates for selection;
generating, by the hardware processor, for each selected information object, a synthesis semantic-syntactic tree based on the template semantic-syntactic tree and the plurality of substitutions of the synthesis template associated with the selected information object; and
generating, by the hardware processor, a natural language text based on each generated synthesis semantic-syntactic tree to provide an annotation for the text data in the one or more source text documents.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are system, method and computer program product for synthesis of natural-language text; receiving information objects; selecting among the received information objects information objects and an associated synthesis templates in a template library, each synthesis template including a template semantic-syntactic tree; generating for each selected information object a synthesis semantic-syntactic tree based on the template semantic-syntactic tree; and generating natural language text based on each generated synthesis semantic-syntactic tree.
51 Citations
20 Claims
-
1. A method comprising:
-
receiving, by a hardware processor, a plurality of information objects extracted from text data in one or more source text documents, the plurality of information objects having representations in a data model, wherein each information object comprises one or more properties and, for each property, a corresponding natural language string value from the text data; selecting, by the hardware processor, at least one information object and a synthesis template associated with the selected information object, wherein the selecting of the at least one information object and the synthesis template associated with the selected information object comprises; selecting the at least one information object from the plurality of information objects; and selecting the synthesis template associated with the selected information object from a plurality of synthesis templates stored in a template library, the synthesis template comprising a sentence in a natural language, a plurality of substitutions comprising one or more words from the sentence and their associated properties, and a template semantic-syntactic tree, and wherein the template semantic-syntactic tree is generated based on the sentence in the natural language, wherein the selecting of the synthesis template associated with the selected information object comprises; for each synthesis template from the plurality of synthesis templates; comparing a property associated with a word in a plurality of substitutions of a respective synthesis template with one or more properties of the selected information object; responsive to determining that the property of the respective synthesis template matches one of the one or more properties of the selected information object, determining whether a natural language string value corresponding to the matching property of the selected information object is nominative or not; and responsive to determining that the natural language string value is not nominative, eliminating the respective synthesis template from the plurality of synthesis templates for selection; generating, by the hardware processor, for each selected information object, a synthesis semantic-syntactic tree based on the template semantic-syntactic tree and the plurality of substitutions of the synthesis template associated with the selected information object; and generating, by the hardware processor, a natural language text based on each generated synthesis semantic-syntactic tree to provide an annotation for the text data in the one or more source text documents. - View Dependent Claims (2, 3, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
4. A system comprising:
-
a memory; and a hardware processor, coupled to the memory, to; receive a plurality of information objects extracted from text data in one or more source text documents, the plurality of information objects having representations in a data model, wherein each information object comprises one or more properties and, for each property, a corresponding natural language string value from the text data; select at least one information object and a synthesis template associated with the selected information object, wherein to select the at least one information object and the synthesis template associated with the selected information object, the hardware processor further to; select the at least one information object from the plurality of information objects; and select the synthesis template from a plurality of synthesis templates stored in a template library, the synthesis template comprising a sentence in a natural language, a plurality of substitutions parameters comprising one or more words from the sentence and their associated properties, and a template semantic-syntactic tree, and wherein the template semantic-syntactic tree is generated based on the sentence in the natural language wherein to select the synthesis template associated with the selected information object, the hardware processor further to; for each synthesis template from the plurality of synthesis templates; compare a property associated with a word in a plurality of substitutions of a respective synthesis template with one or more properties of the selected information object; responsive to determining that the property of the respective synthesis template matches one of the one or more properties of the selected information object, determine whether a natural language string value corresponding to the matching property of the selected information object is nominative or not; and responsive to determining that the natural language string value is not nominative, eliminate the respective synthesis template from the plurality of synthesis templates for selection; generate for each selected information object a synthesis semantic-syntactic tree based on the template semantic-syntactic tree and the plurality of substitutions of the synthesis template associated with the selected information object; and generate a natural language text based on each generated synthesis semantic-syntactic tree to provide an annotation for the text data in the one or more source text documents. - View Dependent Claims (5)
-
-
6. A computer program product stored on a non-transitory computer-readable storage medium, the computer program product comprising computer-executable instructions for synthesis of natural-language text, comprising instructions for:
-
receiving, by a hardware processor, a plurality of information objects extracted from text data in one or more source text documents, the plurality of information objects having representations in a data model, wherein each information object comprises one or more properties and, for each property, a corresponding natural language string value from the text data; selecting, by the hardware processor, at least one information object and a synthesis template associated with the selected information object, wherein the selecting of the at least one information object and the synthesis template associated with the selected information object comprises; selecting the at least one information object from the plurality of information objects; and selecting the synthesis template from a plurality of synthesis templates stored in a template library, the synthesis template comprises a sentence in a natural language, a plurality of substitutions comprising one or more words from the sentence and their associated properties, and a template semantic-syntactic tree, and wherein the template semantic-syntactic tree is generated based on the sentence in the natural language, wherein the selecting of the synthesis template associated with the selected information object comprises; for each synthesis template from the plurality of synthesis templates; comparing a property associated with a word in a plurality of substitutions of a respective synthesis template with one or more properties of the selected information object; responsive to determining that the property of the respective synthesis template matches one of the one or more properties of the selected information object, determining whether a natural language string value corresponding to the matching property of the selected information object is nominative or not; and responsive to determining that the natural language string value is not nominative, eliminating the respective synthesis template from the plurality of synthesis templates for selection; generating, by the hardware processor, for the selected information object, a synthesis semantic-syntactic tree based on the template semantic-syntactic tree and the plurality of substitutions of the synthesis template associated with the selected information object; generating by the hardware processor, natural language text based on each generated synthesis semantic-syntactic tree to provide an annotation for the text data in the one or more source text documents. - View Dependent Claims (7)
-
Specification