Cross-lingual search re-ranking
First Claim
1. One or more computer readable media having stored thereupon a plurality of instructions that, when executed by a processor, cause the processor to perform the following actions:
- retrieving a first set of documents in a first language and a second set of documents in a second language via a cross-lingual search query that is based on a search query written in the first language;
for each document of the first set and the second set of documents, computing a value of a unified ranking function that is a function of the search query and a corresponding document, the unified ranking function comprising a term descriptive of a cross-lingual query generative probability between the search query and the corresponding document, the cross-lingual query generative probability being a summation calculated based on a term descriptive of a probability of translating the search query from the first language to the second language and a term descriptive of a monolingual query generative probability of the search query being in the second language given an observation of a document in the second language; and
ranking the first set of documents and the second set of documents together based on the values of the unified ranking function.
2 Assignments
0 Petitions
Accused Products
Abstract
Cross-lingual search re-ranking is performed during a cross-lingual search in which a search query of a first language is used to retrieve two sets of documents, a first set in the first language, and a second set in a second language. The two sets of documents are each first ranked by the search engine separately. Cross-lingual search re-ranking then aims to provide a uniform re-ranking of both sets of documents combined. Cross-lingual search re-ranking uses a unified ranking function to compute the ranking order of each document of the first set and the second set of documents. The unified ranking function is constructed using generative probabilities based on multiple features, and can be learned by optimizing weight parameters using a training corpus. Ranking SVM algorithms may be used for the optimization.
-
Citations
20 Claims
-
1. One or more computer readable media having stored thereupon a plurality of instructions that, when executed by a processor, cause the processor to perform the following actions:
-
retrieving a first set of documents in a first language and a second set of documents in a second language via a cross-lingual search query that is based on a search query written in the first language; for each document of the first set and the second set of documents, computing a value of a unified ranking function that is a function of the search query and a corresponding document, the unified ranking function comprising a term descriptive of a cross-lingual query generative probability between the search query and the corresponding document, the cross-lingual query generative probability being a summation calculated based on a term descriptive of a probability of translating the search query from the first language to the second language and a term descriptive of a monolingual query generative probability of the search query being in the second language given an observation of a document in the second language; and ranking the first set of documents and the second set of documents together based on the values of the unified ranking function. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A cross-lingual search ranking method, comprising:
-
receiving a search query of a first language; retrieving a first set of documents in the first language based on the search query; retrieving a second set of documents in a second language based on the search query; for each document of the first set and the second set of documents, computing a value of a unified ranking function that is a function of the search query and the document, the unified ranking function comprising a cross-lingual generative probability that is a summation calculated by multiplication of a term descriptive of a probability of translating the search query of the first language to the second language with a term descriptive of a monolingual query generative probability of the search query being in the second language given an observation of a document that is in the second language from the second set of documents; and ranking the first set of documents and the second set of documents together based on the values of the unified ranking function. - View Dependent Claims (16, 17)
-
-
18. One or more computer readable media having stored thereupon a plurality of instructions that, when executed by a processor, causes the processor to perform the following actions:
-
for each document of a first set and a second set of documents, computing a value of a unified ranking function that is a function of a search query and a corresponding document, the unified ranking function comprising a cross-lingual generative probability that is a summation calculated based on a first term descriptive of a probability of translating an observed document from the second set of documents that is in the second language to a first language and a second term descriptive of a monolingual query generative probability of the search query being in the first language given the observed document that is in the second language from the second set of documents; and ranking the first set of documents and the second set of documents together based on the values of the unified ranking function. - View Dependent Claims (19, 20)
-
Specification