Method and system for searching a multi-lingual database
First Claim
1. A system for searching documents written in a plurality of languages, the system comprising at least one computer including:
- a system for identifying keywords from each of the plurality of documents;
a system for translating each identified keyword into each of the plurality of languages;
a system for creating an index in each of the plurality of languages;
a system for receiving a first query that includes at least one keyword in a first language;
a system for generating a second query by translating the at least one keyword into a second language;
a system for applying the first query against documents written in both the first language and the second language;
a system for applying the second query against documents written in the second language;
a system for generating a first set of results that includes each document written in the first language that matches the first query; and
a system for generating a second set of results that includes each document written in the second language that matches at least one of the first query or the second query.
0 Assignments
0 Petitions
Accused Products
Abstract
A system and method of searching a database in which documents of different languages are included. The system includes a synonym or keyword dictionary which is bi-directional and allows for translation of keywords between a first language and other languages. The translated words keywords for the document are stored in an inverted index which is then used for searching, either in a selected language, a second language or in all languages, as determined by the user. This use of multiple searching and a translated synonym dictionary avoids the need for translation of the entire document and avoids inaccuracies which may result from translations.
-
Citations
12 Claims
-
1. A system for searching documents written in a plurality of languages, the system comprising at least one computer including:
-
a system for identifying keywords from each of the plurality of documents; a system for translating each identified keyword into each of the plurality of languages; a system for creating an index in each of the plurality of languages; a system for receiving a first query that includes at least one keyword in a first language; a system for generating a second query by translating the at least one keyword into a second language; a system for applying the first query against documents written in both the first language and the second language; a system for applying the second query against documents written in the second language; a system for generating a first set of results that includes each document written in the first language that matches the first query; and a system for generating a second set of results that includes each document written in the second language that matches at least one of the first query or the second query. - View Dependent Claims (2, 3, 4)
-
-
5. A document searching program stored on a tangible computer-useable medium, wherein the documents are written in a plurality of languages, the program comprising:
-
a module for identifying keywords from each of the plurality of documents; a module for translating each identified keyword into each of the plurality of languages; a module for creating an index in each of the plurality of languages; a module for receiving a first query that includes at least one keyword in a first language; a module for generating a second query by translating the at least one keyword into a second language; a module for applying the first query against documents written in both the first language and the second language; a module for applying the second query against documents written in the second language; a module for generating a first set of results that includes each document written in the first language that matches the first query; and a module for generating a second set of results that includes each document written in the second language that matches at least one of the first query or the second query. - View Dependent Claims (6, 7, 8)
-
-
9. A computer system comprising:
a set of computing devices for searching documents written in a plurality of languages by performing a method comprising; identifying keywords from each of the plurality of documents; translating each identified keyword into each of the plurality of languages; creating an index in each of the plurality of languages; receiving a first query that includes at least one keyword in a first language; generating a second query by translating the at least one keyword into a second language; applying the first query against documents written in both the first language and the second language; applying the second query against documents written in the second language; generating a first set of results that includes each document written in the first language that matches the first query; and generating a second set of results that includes each document written in the second language that matches at least one of the first query or the second query. - View Dependent Claims (10, 11, 12)
Specification