System and method for multi-lingual information retrieval
First Claim
Patent Images
1. A system for searching a set of documents in one or more target languages when search terms in a source language different from said one or more target languages are supplied, said system comprising:
- a document base storing a plurality of target language documents with corresponding language codes and/or tags representing respective target languages;
a translation engine configured to translate a search query in the source language into a plurality of search queries in said one or more target languages;
a search part configured to search the plurality of target language documents in the document base using each of the plurality of target language search queries, to return a plurality of matched documents in the target language,wherein the translation engine translates the plurality of matched documents in the target language into corresponding translated documents in the source language, andwherein said translation engine translates a source language term to plurality of target language translation options, translates the plurality of target language translation options to respective source language translation results, and stores the collection of source language translation results in a synonym database as synonyms for the source language term.
1 Assignment
0 Petitions
Accused Products
Abstract
Multi-lingual information retrieval methods and systems are described.
54 Citations
68 Claims
-
1. A system for searching a set of documents in one or more target languages when search terms in a source language different from said one or more target languages are supplied, said system comprising:
-
a document base storing a plurality of target language documents with corresponding language codes and/or tags representing respective target languages; a translation engine configured to translate a search query in the source language into a plurality of search queries in said one or more target languages; a search part configured to search the plurality of target language documents in the document base using each of the plurality of target language search queries, to return a plurality of matched documents in the target language, wherein the translation engine translates the plurality of matched documents in the target language into corresponding translated documents in the source language, and wherein said translation engine translates a source language term to plurality of target language translation options, translates the plurality of target language translation options to respective source language translation results, and stores the collection of source language translation results in a synonym database as synonyms for the source language term. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A multi-lingual information retrieval system comprising:
-
a vote store configured to store for each target language term vote counts for respective source language translation options corresponding to the target language term; and a translation engine configured to translate a target language document to a translated document in the source language, wherein said translation engine translates a selected term in the target language document by retrieving from said vote store a source language translation option corresponding to the selected term, said retrieved translation option having a highest vote count amongst source language translation options corresponding to the selected target language term, and wherein when a first translation option of a selected term is replaced by a second translation option of the selected term as the option with the most votes, each translated document wherein said first translation option is used as a primary translation result for the selected term is dynamically changed to reflect the second translation option as the translation of the selected term. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 57, 58)
-
-
29. A multi-lingual information retrieval method performed by a multi-lingual information retrieval system for searching a set of documents in one or more target languages when search terms in a source language different from said one or more target languages are supplied, the multi-lingual information retrieval method comprising:
-
maintaining, by the multi-lingual information retrieval system, a vote store to store for each target language term vote counts for respective source language translation options corresponding to the target language term; translating, by the multi-lingual information retrieval system, a selected term in a target language document by retrieving from said vote store a source language translation option corresponding to the selected term, wherein said retrieved translation option has a highest vote count amongst source language translation options corresponding to the selected target language term, when a first translation option is replaced by a second translation option as the option with the most votes for a term, dynamically changing, by the multi-lingual information retrieval system, each translated document wherein said first translation option is used as the primary translation result for the term. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A multi-lingual information retrieval method performed by a multi-lingual information retrieval system for searching a set of documents in one or more target languages when search terms in a source language different from said one or more target languages are supplied, the multi-lingual information retrieval method comprising:
-
maintaining, by the multi-lingual information retrieval system, a vote store to store for each target language term vote counts for respective source language translation options corresponding to the target language term; translating, by the multi-lingual information retrieval system, a selected term in a target language document by retrieving from said vote store a source language translation option corresponding to the selected term, wherein said retrieved translation option has a highest vote count amongst source language translation options corresponding to the selected target term; translating, by the multi-lingual information retrieval system, a source language term to a plurality of target language translation options; translating, by the multi-lingual information retrieval system, the plurality of target language translation options to respective source language translation results; and storing, by the multi-lingual information retrieval system, the collection of source language translation results in a synonym database as synonyms for the source language term. - View Dependent Claims (42)
-
-
43. A method performed by a multi-lingual information retrieval system for searching a set of documents in a target language when search terms in a source language different from the target language are supplied, said method comprising:
-
translating, by the multi-lingual information retrieval system, a search query in the source language into a plurality of search queries in the target language; searching, by the multi-lingual information retrieval system, a target language document set in the target language using each of the plurality of target language search queries, to return a plurality of target language documents; translating, by the multi-lingual information retrieval system, the plurality of target language documents into corresponding translated documents in the source language; searching, by the multi-lingual information retrieval system, a source language document set in the source language using each of a plurality of synonyms of the source language search terms saved in a synonym database, to return a plurality of source language documents; translating, by the multi-lingual information retrieval system, the plurality of the source language documents into corresponding translated documents in the target language, maintaining, by the multi-lingual information retrieval system, a vote store, wherein for each term having a plurality of translation options, each translation option is associated with a corresponding number of votes and the numbers of votes associated with the translation options are stored in the vote store; and selecting, by the multi-lingual information retrieval system, the translation option with the most votes for the term as the primary translation result, wherein remaining translation options for the term are represented in the translated document as one of meta-data or hidden field. - View Dependent Claims (44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 59, 60, 61, 62, 63, 64)
-
-
65. A method performed by a multi-lingual information retrieval system for searching a set of documents in a target language when search terms in a source language different from the target language are supplied, said method comprising:
-
translating, by the multi-lingual information retrieval system, a search query in the source language into a plurality of search queries in the target language; searching, by the multi-lingual information retrieval system, a target language document set in the target language using each of the plurality of target language search queries to return a plurality of target language documents; translating, by the multi-lingual information retrieval system, the plurality of target language documents into corresponding translated documents in the source language, searching, by the multi-lingual information retrieval system, a source language document set in the source language using each of the plurality of synonyms of the source language search terms saved in a synonym database, to return a plurality of source language documents translating, by the multi-lingual information retrieval system, the plurality of the source language documents into corresponding translated documents in the target language; translating, by the multi-lingual information retrieval system, a source language term to a plurality of target language translation options; translating, by the multi-lingual information retrieval system, the plurality of target language translation options to respective source language translation results; and storing, by the multi-lingual information retrieval system, a collection of the source language translation results in the synonym database as synonyms for the source language term. - View Dependent Claims (66, 67, 68)
-
Specification