Paraphrasing the web by search-based data collection
First Claim
Patent Images
1. A method for supporting a language processing application, the method comprising:
- obtaining a target item of text;
utilizing an index search procedure to identify a distributionally similar item of text;
determining, based on the results of one or more index queries, whether the distributionally similar item of text is semantically equivalent to the target item of text; and
if the distributionally similar item is semantically equivalent to the target item of text, then utilizing the distributionally similar item as a substitute for the target item of text within the language processing application.
2 Assignments
0 Petitions
Accused Products
Abstract
String-oriented web queries are utilized as a tool to examine the fabric of how words, phrases and/or n-grams alternate in a language. This fabric is exploited in order to build up a matrix of semantically equivalent pieces of language. In one embodiment, the Distributional Hypothesis is utilized, along with strategies for confirming synonymy, to systematically build up a picture of what words/phrases can be legitimately substituted for one another.
-
Citations
20 Claims
-
1. A method for supporting a language processing application, the method comprising:
-
obtaining a target item of text; utilizing an index search procedure to identify a distributionally similar item of text; determining, based on the results of one or more index queries, whether the distributionally similar item of text is semantically equivalent to the target item of text; and if the distributionally similar item is semantically equivalent to the target item of text, then utilizing the distributionally similar item as a substitute for the target item of text within the language processing application. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A language processing system, comprising:
-
an index query engine; a processing component configured to determine, based on the results of one or more queries executed by the index query engine, whether an item of text is semantically equivalent to the target item of text. - View Dependent Claims (19)
-
-
20. A method for supporting a language processing application, the method comprising:
-
obtaining a target item of text; determining, based on the results of one or more index queries, whether the target item of text is semantically equivalent to another item of text; and if said another item is semantically equivalent to the target item of text, then utilizing said another item as a substitute for the target item of text within the language processing application.
-
Specification