Query transformation system and method enabling retrieval of multilingual web documents
First Claim
1. A query transformation system enabling retrieval of multilingual web documents comprising:
- a query input unit for imputing a query consisting of a source language, the query input unit comprising a keyboard;
an input query memory unit for storing the query, consisting of the source language, received from the query input unit;
a transformation control unit for controlling the entire query transformation operation of the system;
a translation generating/filtering unit for generating translations of the input query and filtering unnecessary ones of the generated translations;
a translation knowledge unit stored with at least one electronic dictionary to be used for a transformation of queries and a variety of information;
a transformed query memory unit for storing the query transformed from the source language into a target language; and
a result output unit for outputting the result of the transformation in the form of the target language on a screen;
wherein the translation generating/filtering unit comprises;
a translation generator for generating all possible translations of the source language input query by reference to a translation dictionary;
a semantic category verifier for receiving the generated translations from the translation generator, a eliminating translations having a low semantic similarity from the received translations, based on a semantic category tree; and
a collocation information verifier for receiving the translations, which includes no translation having a low semantic similarity, from the semantic category verifier, and eliminating translations having no collocation from the received translations, based on word collocation information.
3 Assignments
0 Petitions
Accused Products
Abstract
A query transformation system and method capable of not only solving an ambiguousness of words involved in the transformation of queries from one language to another language, but also executing its processing independently of the processing of an information retrieval system used, so that it can be applied to a variety of information retrieval systems, thereby enabling the information retrieval system used to function as a multilingual information retrieval system. The system includes a translation generator for generating all possible translations of an input query consisting of a source language by reference to a translation dictionary, a semantic category verifier for receiving the generated translations from the translation generator, and eliminating translations having a low semantic similarity from the received translations, based on a semantic category tree, and a collocation information verifier for receiving the translations, which includes no translation having a low semantic similarity, from the semantic category verifier, and eliminating translations having no collocation from the received translations, based on word collocation information.
-
Citations
7 Claims
-
1. A query transformation system enabling retrieval of multilingual web documents comprising:
-
a query input unit for imputing a query consisting of a source language, the query input unit comprising a keyboard; an input query memory unit for storing the query, consisting of the source language, received from the query input unit; a transformation control unit for controlling the entire query transformation operation of the system; a translation generating/filtering unit for generating translations of the input query and filtering unnecessary ones of the generated translations; a translation knowledge unit stored with at least one electronic dictionary to be used for a transformation of queries and a variety of information; a transformed query memory unit for storing the query transformed from the source language into a target language; and a result output unit for outputting the result of the transformation in the form of the target language on a screen; wherein the translation generating/filtering unit comprises; a translation generator for generating all possible translations of the source language input query by reference to a translation dictionary; a semantic category verifier for receiving the generated translations from the translation generator, a eliminating translations having a low semantic similarity from the received translations, based on a semantic category tree; and a collocation information verifier for receiving the translations, which includes no translation having a low semantic similarity, from the semantic category verifier, and eliminating translations having no collocation from the received translations, based on word collocation information. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A query transformation method enabling retrieval of multilingual web documents, comprising the step of:
-
generating a transformation start signal when a tool button on a screen associated with the start of a transformation is clicked; starting a query transformation in response to the transformation start signal; generating all possible translations of an input query from a user by reference to a translation dictionary; determining whether or not translations are generated; if there is no translation generated, informing the user of the fact that there is no translation generated, while if there are translations generated, executing a comparison processing for the generated translations, based on a semantic category tree, eliminating translations having a low semantic similarity, thereby eliminating unnecessary ones of the translations; analyzing a collocation of the resultant translations by reference to a collocation information dictionary, thereby eliminating unnecessary ones of the analyzed translations; determining whether or not there are translations left; and if there are translations left, outputting the translations left as a transformed query on the screen, while if there is no translation left, recovering the generated translations, and outputting the recovered translations as a transformed query on the screen.
-
Specification