Method and system for searching a multi-lingual database
First Claim
1. A method of creating and searching a database of multi-lingual documents, the method comprising the steps of:
- using a keyword dictionary to extract keywords;
translating the keywords into the languages supported by a bi-directional synonym listing;
creating a keyword listing in each supported language;
using the keywords in each supported language to create more than one inverted index of the documents including the keywords, one inverted index being in a first language and another inverted index being in a second language;
retrieving documents in a search mode comprising the steps of translating the search terms using the same bi-directional synonym listing, generating a translated query, submitting parallel inquiries to first and second different inverted indices, receiving first and second results of the search using the first and second inverted indices and selecting one of the first and second results for use.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method of searching a database in which documents of different languages are included. The system includes a synonym or keyword dictionary which is bidirectional and allows for translation of keywords between a first language and other languages. The translated words keywords for the document are stored in an inverted index which is then used for searching, either in a selected language, a second language or in all languages, as determined by the user. This use of multiple searching and a translated synonym dictionary avoids the need for translation of the entire document and avoids inaccuracies which may result from translations.
-
Citations
17 Claims
-
1. A method of creating and searching a database of multi-lingual documents, the method comprising the steps of:
-
using a keyword dictionary to extract keywords;
translating the keywords into the languages supported by a bi-directional synonym listing;
creating a keyword listing in each supported language;
using the keywords in each supported language to create more than one inverted index of the documents including the keywords, one inverted index being in a first language and another inverted index being in a second language;
retrieving documents in a search mode comprising the steps of translating the search terms using the same bi-directional synonym listing, generating a translated query, submitting parallel inquiries to first and second different inverted indices, receiving first and second results of the search using the first and second inverted indices and selecting one of the first and second results for use. - View Dependent Claims (2, 3)
-
-
4. A method of searching a database including documents written in more than one language, the steps of the method comprising:
-
identifying a search in one language;
converting the search into another language;
searching the database in both the one language and the another language and generating a first list of results associated with the one language and a second list of results from the another language; and
providing the first and second lists of search results as output. - View Dependent Claims (5, 6, 7)
-
-
8. A system for searching a database based on an input search, the system comprising:
-
a synonym list for keywords which includes corresponding words in at least a first and a second language;
an inverted index for the database including keywords and synonyms for keywords based on the synonym list;
an indicator of the language associated with the input search along with a selection of whether documents are desired in the natural language or in all languages;
a search engine responsive to the natural language indication and the selection of documents, said search engine using the inverted index to determine a first set of results which include the documents which correspond to the input search in the input language and a second set of results which correspond to the documents which include documents which are not in the input language. - View Dependent Claims (9, 10)
-
-
15. A database searching program stored on a medium comprising:
-
a first module which includes a bi-directional synonym listing of terms;
a second module which receives text and selects one or more keywords;
a third module which receives the one or more keywords and translates the one or more keywords into another language;
a fourth module which searches a database using the keywords in one language and the translations of keywords using the translations of the third module, said fourth module creating a first set of results using a first language and a second set of results which includes documents in a language other than the first language; and
a fifth module which receives an indication from the user as to whether results in the first language or results including documents in a second language are desired - View Dependent Claims (16, 17)
-
Specification