Systems and methods for using anchor text as parallel corpora for cross-language information retrieval
First Claim
1. A system, comprising:
- at least one processor; and
a memory to store instructions, which when executed by the at least one processor, cause the at least one processor to;
search a plurality of documents based on a search query that includes terms in a first language;
identify, based on a content of the search query, one or more documents in the first language that include an anchor that links to one or more documents in a second language, the anchor having anchor text that matches at least one of the terms of the search query, where the second language is different than the first language;
identify the one or more documents in the second language based on the anchor having anchor text that matches at least one of the terms of the search query; and
translate the terms of the search query into the second language using content included in the one or more documents in the second language.
2 Assignments
0 Petitions
Accused Products
Abstract
A system performs cross-language query translations. The system receives a search query that includes terms in a first language and determines possible translations of the terms of the search query into a second language. The system also locates documents for use as parallel corpora to aid in the translation by: (1) locating documents in the first language that contain references that match the terms of the search query and identify documents in the second language; (2) locating documents in the first language that contain references that match the terms of the query and refer to other documents in the first language and identify documents in the second language that contain references to the other documents; or (3) locating documents in the first language that match the terms of the query and identify documents in the second language that contain references to the documents in the first language. The system may use the second language documents as parallel corpora to disambiguate among the possible translations of the terms of the search query and identify one of the possible translations as a likely translation of the search query into the second language.
42 Citations
20 Claims
-
1. A system, comprising:
-
at least one processor; and a memory to store instructions, which when executed by the at least one processor, cause the at least one processor to; search a plurality of documents based on a search query that includes terms in a first language; identify, based on a content of the search query, one or more documents in the first language that include an anchor that links to one or more documents in a second language, the anchor having anchor text that matches at least one of the terms of the search query, where the second language is different than the first language; identify the one or more documents in the second language based on the anchor having anchor text that matches at least one of the terms of the search query; and translate the terms of the search query into the second language using content included in the one or more documents in the second language. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A system, comprising:
-
at least one processor; and a memory to store instructions, which when executed by the at least one processor, cause the at least one processor to; search a plurality of documents based on a search query that includes terms in a first language; identify one or more documents in the first language that include an anchor that links to one or more other documents in the first language and that has anchor text that matches at least one of the terms of the search query; search the plurality of documents to identify one or more documents in a second language that is different than the first language, the one or more documents in the second language including an anchor that links to the one or more other documents in the first language; and translate the search query into the second language using content included in the one or more documents in the second language. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A memory device that stores one or more computer-executable instructions for execution by processing logic, the instructions comprising:
-
one or more instructions, which, when executed by the processing logic, cause the processing logic to receive a search query that includes terms in a first language; one or more instructions, which, when executed by the processing logic, cause the processing logic to identify, based on a content of the search query, one or more documents in the first language that include an anchor that links to one or more documents in a second language, the anchor having anchor text that matches at least one of the terms of the search query, where the second language is different than the first language; one or more instructions, which, when executed by the processing logic, cause the processing logic to identify the one or more documents in the second language based on the anchor having anchor text that matches at least one of the terms of the search query; and one or more instructions, which, when executed by the processing logic cause the processing logic to translate the terms of the search query into the second language using content included in the one or more documents in the second language. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A memory device that stores one or more computer-executable instructions for execution by processing logic, the computer-executable instructions comprising:
-
one or more instructions, which, when executed by the processing logic, cause the processing logic to receive a search query that includes terms in a first language; one or more instructions, which, when executed by the processing logic, cause the processing logic to identify one or more documents in the first language that include an anchor that links to one or more other documents in the first language and that has anchor text that matches at least one of the terms of the search query; one or more instructions, which, when executed by the processing logic, cause the processing logic to search the plurality of documents to identify one or more documents in a second language that is different than the first language, the one or more documents in the second language including an anchor that links to the one or more other documents in the first language; and one or more instructions, which, when executed by the processing logic, cause the processing logic to translate the search query into the second language using content included in the one or more documents in the second language. - View Dependent Claims (17, 18, 19, 20)
-
Specification