SEMANTIC REFINING OF CROSS-LINGUAL INFORMATION RETRIEVAL RESULTS
First Claim
1. A method for cross language information retrieval comprising:
- receiving an input query which includes at least one word in a source language;
translating the input query from the source language to a target language to provide a set of translated queries;
retrieving documents from a document collection based on the translated queries;
translating at least a part of the retrieved documents into the source language to generate a set of translated documents;
assessing an entailment relationship between each of the translated documents and the input query;
refining the set of translated documents based on the assessment of the entailment relationship; and
outputting at least a subset of the refined set of translated documents or the target documents to which the translated documents in the subset correspond;
wherein at least one of the translating the input query, retrieving documents, translating the retrieved documents, assessing the entailment relationship, and refining the set of translated documents is performed with a computer processor.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for cross language information retrieval includes receiving an input query which includes at least one word in a source language and translating the input query from the source language to a target language to provide a set of translated queries. A set of documents is retrieved from a document collection based on the translated queries. The retrieved documents are translated back into the source language to generate a set of translated documents. An entailment relationship between each of the translated documents and the input query is assessed. The set of translated documents is refined, based on the assessment of the entailment relationship. A subset (or all) of the refined set of translated documents, and/or the target documents to which the translated documents in the subset correspond, is output.
-
Citations
20 Claims
-
1. A method for cross language information retrieval comprising:
-
receiving an input query which includes at least one word in a source language; translating the input query from the source language to a target language to provide a set of translated queries; retrieving documents from a document collection based on the translated queries; translating at least a part of the retrieved documents into the source language to generate a set of translated documents; assessing an entailment relationship between each of the translated documents and the input query; refining the set of translated documents based on the assessment of the entailment relationship; and outputting at least a subset of the refined set of translated documents or the target documents to which the translated documents in the subset correspond; wherein at least one of the translating the input query, retrieving documents, translating the retrieved documents, assessing the entailment relationship, and refining the set of translated documents is performed with a computer processor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for cross language information retrieval comprising:
-
a first machine translation component for translating an input query from a source language to a target language to provide a set of translated queries; a retrieval component for retrieving documents from an associated document collection based on the translated queries; a second machine translation component for translating the retrieved documents into the source language to generate a set of translated documents; an entailment component for assessing an entailment relationship between each of the translated documents and the input query; a refinement component for refining the set of translated documents based on the assessment of the entailment relationship; and a processor which implements the first and second machine translation components, retrieval component, entailment component, and refinement component. - View Dependent Claims (17, 18)
-
-
19. A method for cross language information retrieval comprising:
-
receiving an input query which includes at least one word in a source language; translating the input query from the source language to a target language to provide a set of translated queries; retrieving documents from a document collection based on the translated queries; optionally, translating the retrieved documents into the source language to generate a set of translated documents; assessing an entailment relationship between each of the translated documents and the input query or between each of the untranslated retrieved documents and the input query; refining the set of translated or untranslated documents based on the assessment of the entailment relationship, the refining comprising at least one of; retaining only those documents for which an entailment relationship is found; and ranking the documents based on an entailment score; and providing for a user to review documents in the refined set of translated documents or corresponding untranslated documents, wherein at least one of the translating the input query, retrieving documents, assessing the entailment relationship, and refining the set of translated documents is performed with a computer processor. - View Dependent Claims (20)
-
Specification