Cross-lingual indexing and information retrieval
First Claim
1. A method implemented by one or more computers, the method comprising:
- translating anchor text of a collection of documents to a target language using a context-specific translation model, wherein one or more of the documents in the collection of documents include anchor text for links that point to one or more documents outside of the collection of documents, wherein the one or more documents outside the collection of documents are in the target language, and wherein anchor text in the one or more of the documents in the collection of documents is translated by the context-specific translation model using, at least in part, the one or more documents outside of the collection of documents that are linked to the respective anchor text;
indexing the translated anchor text to create one or more anchor text indexes;
indexing the one or more of the documents outside of the collection of documents that are in the target language to create one or more document indexes; and
providing the anchor text indexes and the document indexes for use in responding to queries.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed for searching across multi-lingual information. A user makes a query in a first language, and a group of documents that were previously machine-translated into the first language are searched for information responsive to the query. Contextual information derived can be used to improve the accuracy of the machine translation. Responsive documents are returned to the user. Alternatively, a query provided in a user'"'"'s language may be translated into one or more other languages. Documents written in these languages can then be searched for information responsive to the appropriate translated query. Responsive documents can be translated into the user'"'"'s language prior to providing them to the user.
-
Citations
21 Claims
-
1. A method implemented by one or more computers, the method comprising:
-
translating anchor text of a collection of documents to a target language using a context-specific translation model, wherein one or more of the documents in the collection of documents include anchor text for links that point to one or more documents outside of the collection of documents, wherein the one or more documents outside the collection of documents are in the target language, and wherein anchor text in the one or more of the documents in the collection of documents is translated by the context-specific translation model using, at least in part, the one or more documents outside of the collection of documents that are linked to the respective anchor text; indexing the translated anchor text to create one or more anchor text indexes; indexing the one or more of the documents outside of the collection of documents that are in the target language to create one or more document indexes; and providing the anchor text indexes and the document indexes for use in responding to queries. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
one or more computers programmed to perform operations comprising; translating anchor text of a collection of documents to a target language using a context-specific translation model, wherein one or more of the documents in the collection of documents include anchor text for links that point to one or more documents outside of the collection of documents, wherein the one or more documents outside the collection of documents are in the target language, and wherein anchor text in the one or more of the documents in the collection of documents is translated by the context-specific translation model using, at least in part, the one or more documents outside of the collection of documents that are linked to the respective anchor text; indexing the translated anchor text to create one or more anchor text indexes; indexing the one or more of the documents outside of the collection of documents that are in the target language to create one or more document indexes; and providing the anchor text indexes and the document indexes for use in responding to queries. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer readable media encoded with instructions that are operable, when executed by a computer, to cause the computer to perform operations comprising:
-
translating anchor text of a collection of documents to a target language using a context-specific translation model, wherein one or more of the documents in the collection of documents include anchor text for links that point to one or more documents outside of the collection of documents, wherein the one or more documents outside the collection of documents are in the target language, and wherein anchor text in the one or more of the documents in the collection of documents is translated by the context-specific translation model using, at least in part, the one or more documents outside of the collection of documents that are linked to the respective anchor text; indexing the translated anchor text to create one or more anchor text indexes; indexing the one or more of the documents outside of the collection of documents that are in the target language to create one or more document indexes; and providing the anchor text indexes and the document indexes for use in responding to queries. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification