Unsupervised learning of paraphrase/translation alternations and selective application thereof
First Claim
Patent Images
1. A computer-implemented method for applying a paraphrase alternation pattern to an input string, the method comprising:
- receiving the input string;
receiving the paraphrase alternation pattern, wherein the paraphrase alternation pattern indicates a pattern of transformation from a first set of words to a second set of words wherein the given paraphrase alternation pattern is abstract in that one or both of the first and second sets of words include a place holder or blank to be filled in rather than a complete listing of specific words;
generating a language model; and
utilizing a computer processor that is a functional component of the computer to apply the language model to determine how to apply the given paraphrase alternation pattern to the input string, and wherein applying the language model comprises;
applying the given paraphrase alternation pattern to the input string in a plurality of different ways to produce a plurality of different textual variations, wherein applying the given paraphrase alternation pattern to the input string in a plurality of different ways comprises switching different combinations of words around a word within the input string identified as being a pivot word; and
applying the language model to the plurality of different textual variations to determine a probable sequence of words.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method is disclosed for applying a given paraphrase alternation pattern to an input string. The method includes generating a language model and applying the language model to determine how to apply the given paraphrase alternation pattern to the input string.
151 Citations
17 Claims
-
1. A computer-implemented method for applying a paraphrase alternation pattern to an input string, the method comprising:
-
receiving the input string; receiving the paraphrase alternation pattern, wherein the paraphrase alternation pattern indicates a pattern of transformation from a first set of words to a second set of words wherein the given paraphrase alternation pattern is abstract in that one or both of the first and second sets of words include a place holder or blank to be filled in rather than a complete listing of specific words; generating a language model; and utilizing a computer processor that is a functional component of the computer to apply the language model to determine how to apply the given paraphrase alternation pattern to the input string, and wherein applying the language model comprises; applying the given paraphrase alternation pattern to the input string in a plurality of different ways to produce a plurality of different textual variations, wherein applying the given paraphrase alternation pattern to the input string in a plurality of different ways comprises switching different combinations of words around a word within the input string identified as being a pivot word; and applying the language model to the plurality of different textual variations to determine a probable sequence of words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-implemented method for applying a paraphrase alternation pattern to an input string, the method comprising:
-
utilizing a computer processor that is a functional component of the computer to derive the paraphrase alternation pattern from a set of parallel texts, wherein deriving the paraphrase alteration pattern from a set of parallel texts comprises deriving from a set of bilingual texts;
creating a series of different application alternatives that each represent a differentapplication of the paraphrase alternation pattern to the input string; and
applying the series of different application alternatives to a language model to determinea probable sequence of words. - View Dependent Claims (10, 11)
-
-
12. A computer-implemented method of selecting boundaries for application of a paraphrase alternation pattern to an input string, the method comprising:
-
utilizing a computer processor that is a functional component of the computer to apply the paraphrase alternation pattern multiple times to the input string with multiple boundaries so as to create a plurality of alternation alternatives, wherein each alternation alternative is a different sequence of words included in the input string; generating a language model based on a set of test data and then applying the language model to the plurality of alternation alternatives to determine which, based on frequency within the set of test data, is a relatively commonly used sequence of words; and wherein applying so as to create a plurality of alternation alternatives comprise applying so as to switch different combinations of words around a word in the input string identified as being a pivot word. - View Dependent Claims (13, 14, 15, 16, 17)
-
Specification