×

Method for text processing

  • US 9,898,448 B2
  • Filed: 10/23/2015
  • Issued: 02/20/2018
  • Est. Priority Date: 08/29/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for text processing, the method being executable at a computing device, the method comprising:

  • at a training phase;

    acquiring one or more source phrases, each of the source phrase comprising a first set of sequential words, each word of the first set of sequential words being a source word;

    acquiring one or more target phrases, each of the target phrase being in a same language as the source phrases, each of the target phrase comprising a second set of sequential words being at least partially different from the first set of sequential words of a respective source phrase, each word of the second set of sequential words being a target word;

    associating, for a given source phrase, a respective source word feature set with each one of the source words, the respective source word feature set for a given source word comprising;

    one or more grammatical features of the given source word; and

    a meaning of the given source word;

    associating, for a respective target phrase, a respective target word feature set with each one of the target words, the respective target word feature set for a given target word comprising;

    one or more grammatical features of the given target word; and

    a meaning of the given target word;

    analyzing the respective source word feature set of each source words of the given source phrase and the respective target word feature set of each target words of the respective target phrase;

    mapping the given source word of the given source phrase to a corresponding target word of the respective target phrase based on a similarity of the source word feature set of the given source word to the target word feature set of the corresponding target word;

    based on the mapping, generating one or more phrase transformation rules applicable to the given source phrase to transform the first set of sequential words into the second set of sequential words of the respective target phrase;

    storing the one or more source phrases and the associated one or more generated phrase transformation rules in a memory of the computing device;

    at an in-use phase;

    acquiring a text phrase, the text phrase comprising a third set of sequential words being at least partially different from the first set of sequential words; and

    retrieving the one or more source phrases from the memory;

    performing at least one of a grammatical analysis and a semantic analysis of the text phrase and the one or more stored source phrases, to determine similarity of the text phrase to the one or more stored source phrases;

    upon determining that the text phrase has the similarity to the given stored source phrase greater than a threshold, applying the associated one or more phrase transformation rules to the text phrase to generate a transformed text phrase, the transformed text phrase comprising a fourth set of sequential words being at least partially similar to the second set of sequential words of the respective target phrase.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×