×

System for identifying textual relationships

  • US 9,400,778 B2
  • Filed: 12/14/2011
  • Issued: 07/26/2016
  • Est. Priority Date: 02/01/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for identifying textual statement relationships, the method comprising:

  • identifying a textual statement pair that includes a first textual statement and a second textual statement, the first textual statement comprising a first set of words and the second textual statement comprising a second set of words;

    removing, by a pre-processing module, non-alpha numeric characters from the first textual statement and the second textual statement;

    communicating, by the pre-processing module, the pre-processed first textual statement and second textual statement to a processor;

    extracting, by the processor, a first parsed word group from the first textual statement and a second parsed word group from the second textual statement, wherein each parsed word group is a verb-object-preposition (VOP) triple including a verb, an object, and a preposition from each respective textual statement;

    comparing, for the textual statement pair, the first parsed word group and the second parsed word group; and

    calculating, through the use of the processor, a parsed word score for the textual statement pair, wherein the parsed word score is based on the comparison of the first parsed word group and the second parsed word group;

    determining a match score for the textual statement pair based on the parsed word score wherein calculating the parsed word score for the textual statement pair comprises;

    extracting, through the use of the processor, a parsed word group pair from the textual statement pair, wherein the parsed word group pair includes a plurality of term pairs, the plurality of term pairs including a verb pair comprising a verb from the VOP triple for the first word group and a verb from the VOP triple for the second word group, an object pair comprising an object from the VOP triple for the first word group and an object from the VOP triple for the second word group, and a preposition pair comprising a preposition from the VOP triple for the first word group and a preposition from the VOP triple for the second word group;

    calculating a verb pair sub-score, an object pair sub-score, and a preposition pair sub-score, the calculation of each pair sub-score based on a string similarity, a semantic similarity, and a lexicon similarity between each verb, object, or preposition of the respective verb pair, object pair, or preposition pair; and

    wherein the parsed word score is the product of at least one of the verb pair sub-score, the object pair sub-score, and the preposition pair sub-score;

    generating, by the processor, a user interface configured to depict one or more first textual statements and one or more second textual statements along with one or more match indicators that visually indicate a match between one or more of the first textual statements and one or more of the second textual statements;

    communicating, by a graphics processor in communication with the processor and a display the generated user interface to thereby cause the display to visually display the generated user interface.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×