×

Smart string replacement

  • US 7,788,085 B2
  • Filed: 12/17/2004
  • Issued: 08/31/2010
  • Est. Priority Date: 12/17/2004
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method to replace a source string in a document with a target string, comprising:

  • preprocessing textual content via a computer by;

         1) tokenizing text in said document

         2) collecting information concerning morphological information, part-of-speech disambiguation, syntactic dependencies, anaphoric dependencies and semantic relationships in the textual content and

         3) labeling the document with such information, wherein said tokenizing includes multiword tokenization wherein textual input is tokenized into non-isolated units;

    selecting a target string that is placed at one or more locations within the document via the computer;

    selecting a source string to replace the target string via the computer;

    morpho-syntactically disambiguating textual content of the document via the computer;

    identifying a set of string dependencies, via the computer, to detect grammatical or anaphoric dependencies, or both, between the strings in the textual content of the document;

    disambiguating one or more of gender, number, or part of speech and prompting user- specified disambiguation, via the computer, if the source string or the target string have more than at least one of one possible meaning, gender, and number;

    identifying occurrences of the source string in the document that satisfy the user specifications via the computer;

    identifying string relations from the set of string dependencies that define direct or indirect links, or both, to the source string via the computer;

    assessing whether replacing the source string with the target string is semantically coherent via the computer;

    replacing each occurrences of the source string in the document that satisfy the user specifications with the target string via the computer;

    correcting grammatical and anaphoric inconsistencies beyond the phrase level, via the computer, in the string relations in the document that are introduced when the source string is replaced with the target string; and

    outputting the document.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×