×

Method and system of text synthesis based on extracted information in the form of an RDF graph making use of templates

  • US 10,210,249 B2
  • Filed: 05/20/2015
  • Issued: 02/19/2019
  • Est. Priority Date: 03/19/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving, by a hardware processor, a plurality of information objects extracted from text data in one or more source text documents, the plurality of information objects having representations in a data model, wherein each information object comprises one or more properties and, for each property, a corresponding natural language string value from the text data;

    selecting, by the hardware processor, at least one information object and a synthesis template associated with the selected information object, wherein the selecting of the at least one information object and the synthesis template associated with the selected information object comprises;

    selecting the at least one information object from the plurality of information objects; and

    selecting the synthesis template associated with the selected information object from a plurality of synthesis templates stored in a template library, the synthesis template comprising a sentence in a natural language, a plurality of substitutions comprising one or more words from the sentence and their associated properties, and a template semantic-syntactic tree, and wherein the template semantic-syntactic tree is generated based on the sentence in the natural language, wherein the selecting of the synthesis template associated with the selected information object comprises;

    for each synthesis template from the plurality of synthesis templates;

    comparing a property associated with a word in a plurality of substitutions of a respective synthesis template with one or more properties of the selected information object;

    responsive to determining that the property of the respective synthesis template matches one of the one or more properties of the selected information object, determining whether a natural language string value corresponding to the matching property of the selected information object is nominative or not; and

    responsive to determining that the natural language string value is not nominative, eliminating the respective synthesis template from the plurality of synthesis templates for selection;

    generating, by the hardware processor, for each selected information object, a synthesis semantic-syntactic tree based on the template semantic-syntactic tree and the plurality of substitutions of the synthesis template associated with the selected information object; and

    generating, by the hardware processor, a natural language text based on each generated synthesis semantic-syntactic tree to provide an annotation for the text data in the one or more source text documents.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×