×

Method for generating paraphrases for use in machine translation system

  • US 10,318,642 B2
  • Filed: 12/12/2016
  • Issued: 06/11/2019
  • Est. Priority Date: 02/01/2016
  • Status: Active Grant
First Claim
Patent Images

1. A paraphrase generation method, comprising:

  • receiving an original text;

    generating, using a processor, one or more paraphrase candidate sentences by paraphrasing one or more of a plurality of fragments included in the original text into another expression in a language of the original text, the plurality of fragments being obtained by dividing the original text in accordance with a predetermined rule;

    determining whether the paraphrasing of one or more of the plurality of fragments in each of the one or more paraphrase candidate sentences is acceptable by comparing each of a cumulative total of paraphrase acceptability scores with a first acceptable limit, each of the cumulative total of paraphrase acceptability scores being calculated by adding each of paraphrase acceptability scores in each of the one or more paraphrase candidate sentences, each of the paraphrase acceptability scores being assigned to a paraphrase pair including a first fragment and a second fragment that represents another expression of the first fragment, each of the paraphrase acceptability scores indicating a degree to which paraphrasing from the first fragment into the second fragment is accepted;

    determining whether each of the one or more paraphrase candidate sentences is acceptable by comparing each of linguistic acceptability scores with a second acceptable limit, each of the linguistic acceptability scores indicating a degree to which the paraphrase candidate sentence is accepted as having a linguistically correct meaning, and each of the linguistic acceptability scores being obtained from a language model; and

    outputting each of the one or more paraphrase candidate sentences when (i) each of the cumulative total of paraphrase acceptability scores is within the first acceptable limit, and (ii) each of the linguistic acceptability scores is within the second acceptable limit.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×