Method and system for detecting user intentions in retrieval of hint sentences
First Claim
1. A method of constructing a confusion set database for use in detecting user query intentions in the retrieval of sentences from a sentence database in response to the query, the method comprising:
- a memory coupled to a processor;
obtaining a bilingual database having word pairs in first and second languages, wherein each of a plurality of word pairs in the second language in the bilingual database is aligned with a corresponding word pair in the first language in the bilingual database, the aligned word pair in the first language in the bilingual database being a correct translation word pair in the first language for the corresponding word pair in the second language;
using the processor to obtain, separately from the bilingual database, human translation word pairs in the first language corresponding to each of the plurality of word pairs in the second language in the bilingual database;
using the processor to align each human translation word pair in the first language with the correct translation word pair in the first language, both corresponding to the same word pair in the second language, to define first language set pairs in the confusion set database; and
using the processor to remove from the confusion set database first language set pairs in which the human translation word pair in the first language and the correct translation word pair in the first language, together defining the first language set pair, are the same and wherein removing from the confusion set database first language set pairs in which the human translation word pair in the first language and the correct translation word pair in the first language are the same comprises removing both the human translation word pair in tile first language and the corresponding correct translation word pair in the first language which together define the first language set pair.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of constructing a confusion set database for use in detecting user query intentions includes obtaining a bilingual database having aligned word pairs in first and second languages. Second language word pairs in the bilingual database are aligned with corresponding correct translation first language word pairs. First language human translation word pairs corresponding to each of the second language word pairs in the bilingual database are obtained. Each first language human translation word pair for a particular second language word pair in the bilingual database is aligned with the correct translation first language word pair to define first language set pairs in the confusion set database. Methods, systems and computer readable medium for constructing the confusion set database and for retrieving sentences using the confusion set database are also disclosed.
-
Citations
13 Claims
-
1. A method of constructing a confusion set database for use in detecting user query intentions in the retrieval of sentences from a sentence database in response to the query, the method comprising:
-
a memory coupled to a processor; obtaining a bilingual database having word pairs in first and second languages, wherein each of a plurality of word pairs in the second language in the bilingual database is aligned with a corresponding word pair in the first language in the bilingual database, the aligned word pair in the first language in the bilingual database being a correct translation word pair in the first language for the corresponding word pair in the second language; using the processor to obtain, separately from the bilingual database, human translation word pairs in the first language corresponding to each of the plurality of word pairs in the second language in the bilingual database; using the processor to align each human translation word pair in the first language with the correct translation word pair in the first language, both corresponding to the same word pair in the second language, to define first language set pairs in the confusion set database; and using the processor to remove from the confusion set database first language set pairs in which the human translation word pair in the first language and the correct translation word pair in the first language, together defining the first language set pair, are the same and wherein removing from the confusion set database first language set pairs in which the human translation word pair in the first language and the correct translation word pair in the first language are the same comprises removing both the human translation word pair in tile first language and the corresponding correct translation word pair in the first language which together define the first language set pair. - View Dependent Claims (2, 3, 4)
-
-
5. A tangible computer-storage medium having computer-executable instructions for performing steps comprising:
-
using a processor to obtain a bilingual database having word pairs in first and second languages, wherein each of a plurality of word pairs in the second language in the bilingual database is aligned with a corresponding word pair in the first language in the bilingual database, the aligned word pair in the first language in the bilingual database being a correct translation word pair in the first language for the corresponding word pair in the second language; using the processor to obtain human translation word pairs in the first language corresponding to each of the plurality of word pairs in the second language in the bilingual database; using the processor to align each human translation word pair in the first language with the correct translation word pair in the first language, both corresponding to the same word pair in the second language, to define first language set pairs in a confusion set database for use in detecting user query intentions in the retrieval of sentences from a sentence database in response to the query; and using the processor to remove from the confusion set database first language set pairs in which the human translation word pair in the first language and the correct translation word pair in the first language, together defining the first language set pair, are the same, and wherein removing from the confusion set database first language set pairs in which the human translation word pair in the first language and the correct translation word pair in the first language are the same comprises removing both the human translation word pair in the first language and the corresponding correct translation word pair in the first language which together define the first language set pair. - View Dependent Claims (6, 11, 12, 13)
-
-
7. A method of constructing a confusion set database for use in detecting user query intentions in the retrieval of sentences from a sentence database in response to the query, the method comprising:
-
a memory coupled to a processor; using the processor to obtain a bilingual database having word pairs in first and second languages, wherein each of a plurality of word pairs in the second language in the bilingual database is aligned with a corresponding word pair in the first language in the bilingual database, the aligned word pair in the first language in the bilingual database being a correct translation word pair in the first language for the corresponding word pair in the second language; using the processor to obtain, separately from the bilingual database, human translation word pairs in the first language corresponding to each of the plurality of word pairs in the second language in the bilingual database; and using the processor to align each human translation word pair in the first language for a particular second language word pair in the bilingual database with the correct translation word pair also aligned with the particular second language word pair in the bilingual database to define first language set pairs in the confusion set database, each first language set pair comprising an aligned human translation word pair and correct translation word pair both in the first language and both corresponding to the same second language word pair, wherein aligning further comprises; determining which first language set pairs have a human translation word pair in the first language which differs from its corresponding correct translation first word pair in the first language; and
inserting into the confusion set database only those first language set pairs in which the human translation word pair in the first language differs from its corresponding correct translation word pair in the first language. - View Dependent Claims (8, 9, 10)
-
Specification