Method and system for searching a multi-lingual database
First Claim
1. A computerized method of searching documents written in a plurality of languages, the method comprising:
- identifying keywords from each of the plurality of documents;
translating each identified keyword into each of the plurality of languages;
creating an index in each of the plurality of languages;
receiving a first query that includes at least one keyword in a first language;
generating a second query by translating the at least one keyword into a second language;
applying the first query against documents written in both the first language and the second language;
applying the second query against documents written in the second language;
generating a first set of results that includes each document written in the first language that matches the first query; and
generating a second set of results that includes each document written in the second language that matches at least one of the first query or the second query.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and method of searching a database in which documents of different languages are included. The system includes a synonym or keyword dictionary which is bi-directional and allows for translation of keywords between a first language and other languages. The translated words keywords for the document are stored in an inverted index which is then used for searching, either in a selected language, a second language or in all languages, as determined by the user. This use of multiple searching and a translated synonym dictionary avoids the need for translation of the entire document and avoids inaccuracies which may result from translations.
-
Citations
9 Claims
-
1. A computerized method of searching documents written in a plurality of languages, the method comprising:
-
identifying keywords from each of the plurality of documents; translating each identified keyword into each of the plurality of languages; creating an index in each of the plurality of languages; receiving a first query that includes at least one keyword in a first language; generating a second query by translating the at least one keyword into a second language; applying the first query against documents written in both the first language and the second language; applying the second query against documents written in the second language; generating a first set of results that includes each document written in the first language that matches the first query; and generating a second set of results that includes each document written in the second language that matches at least one of the first query or the second query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
Specification