Systems and methods for using anchor text as parallel corpora for cross-language information retrieval
First Claim
Patent Images
1. A method, performed by one or more processors, the method comprising:
- receiving, by the one or more processors, a search query in a first natural language;
identifying, by the one or more processors and based on a content of the search query, one or more documents in the first natural language;
identifying, by the one or more processors, one or more documents in a second natural language that contain an anchor link that links to the one or more documents in the first natural language,the second natural language being different than the first natural language;
analyzing, by the one or more processors, content included in the one or more documents in the second natural language; and
translating, by the one or more processors and based on analyzing the content included in the one or more documents in the second natural language, one or more terms of the search query into the second natural language.
2 Assignments
0 Petitions
Accused Products
Abstract
A method may include obtaining, based on a content of a search query, one or more documents in a first language; identifying one or more documents in a second language that contain an anchor that links to the one or more documents in the first language, the second language being different than the first language; and translating one or more terms of the search query into the second language using content included in the one or more documents in the second language.
52 Citations
20 Claims
-
1. A method, performed by one or more processors, the method comprising:
-
receiving, by the one or more processors, a search query in a first natural language; identifying, by the one or more processors and based on a content of the search query, one or more documents in the first natural language; identifying, by the one or more processors, one or more documents in a second natural language that contain an anchor link that links to the one or more documents in the first natural language, the second natural language being different than the first natural language; analyzing, by the one or more processors, content included in the one or more documents in the second natural language; and translating, by the one or more processors and based on analyzing the content included in the one or more documents in the second natural language, one or more terms of the search query into the second natural language. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
one or more processors to; receive a search query in a first natural language; identify, based on a content of the search query, one or more documents in the first natural language; identify one or more documents in a second natural language that contain an anchor link that links to the one or more documents in the first natural language, the second natural language being different than the first natural language; analyze content included in the one or more documents in the second natural language; and translate, based on analyzing the content included in the one or more documents in the second natural language, one or more terms of the search query into the second natural language. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
15. A non-transitory computer-readable medium containing instructions for execution by one or more processors, the instructions comprising:
one or more instructions that, when executed by the one or more processors, cause the one or more processors to; receive a search query in a first natural language; identify, based on a content of the search query, one or more documents in the first natural language; identify one or more documents in a second natural language based on an anchor link, contained in the one or more documents in the first natural language, that links the one or more documents in the first natural language to the one or more documents in the second natural language, the second natural language being different than the first natural language; analyze content included in the one or more documents in the second natural language; and translate, based on analyzing the content included in the one or more documents in the second natural language, one or more terms of the search query into the second natural language. - View Dependent Claims (16, 17, 18, 19, 20)
Specification