Word phrase translation using a phrase index
First Claim
1. A method for translating a word phrase from a first natural language to a second natural language, said word phrase being a group of two or more associated words, the method comprising the steps of:
- inputting a text written in the first language;
extracting said word phrase from said text;
querying a database for said extracted word phrase using a phrase index of said database;
said phrase index indexing text fragments by word phrases;
said text fragments representing a primary grammatical unit including at least one clause;
the database containing pairs of text fragments, each pair including a text fragment of the first language and a corresponding text fragment of the second language; and
obtaining a translation of said extracted word phrase based on one of the pairs of text fragments revealed during the step of querying the database.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method are provided for translating an input text from a natural source language to a natural target language. The system stores a database that contains a plurality of pairs of text fragments with each pair including a text fragment in the source language and a corresponding text fragment in the target language. Each text fragment contains at least one word phrase and represents a primary grammatical unit such as a sentence or a clause. For translating a word phrase, the database is queried using a phrase index of the database, where the phrase index indexes text fragments by word phrases. Word phrases are noun phrases or word phrases. Alternatively, word phrases are predicates involving at least one verb and one noun or adjective used as a noun. The system further comprises a phrase extractor for extracting a word phrase from a text fragment of an input text.
-
Citations
29 Claims
-
1. A method for translating a word phrase from a first natural language to a second natural language, said word phrase being a group of two or more associated words, the method comprising the steps of:
-
inputting a text written in the first language;
extracting said word phrase from said text;
querying a database for said extracted word phrase using a phrase index of said database;
said phrase index indexing text fragments by word phrases;
said text fragments representing a primary grammatical unit including at least one clause;
the database containing pairs of text fragments, each pair including a text fragment of the first language and a corresponding text fragment of the second language; and
obtaining a translation of said extracted word phrase based on one of the pairs of text fragments revealed during the step of querying the database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A computer-readable storage medium storing instructions for translating a word phrase from a first natural language to a second natural language by performing the steps of:
-
inputting a text written in the first language;
extracting said word phrase from said text, said word phrase being a group of two or more associated words;
querying a database for said extracted word phrase using a phrase index of said database;
said phrase index indexing text fragments by word phrases;
said text fragments representing a primary grammatical unit including at least one clause;
the database containing pairs of text fragments, each pair including a text fragment of the first language and a corresponding text fragment of the second language; and
obtaining a translation of said extracted word phrase based on one of the pairs of text fragments revealed during the step of querying the database.
-
-
14. A system for translating an input text from a natural source language to a natural target language, the system comprising:
-
storage means for storing a database containing a plurality of pairs of text fragments;
said text fragments representing a primary grammatical unit including at least one clause;
each pair including a text fragment in the source language and a corresponding text fragment in the target language, each text fragment containing at least one word phrase, said word phrase being a group of two or more associated words;
a phrase extractor for extracting a word phrase from a text fragment of said input text;
database retrieval means for retrieving, from said database, pairs of text fragments that contain the extracted word phrase, using a phrase index of said database, said phrase index indexing text fragments by word phrases; and
user interface means for allowing a user to select one of said retrieved pairs of text fragments to obtain a translation of the extracted word phrase. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method for generating a text fragment database for use in translating a word phrase from a first natural language into a second natural language, said word phrase being a group of two or more associated words, the method comprising the steps of:
-
inputting a first document containing a text written in the first language;
inputting a second document containing said text written in the second language;
aligning corresponding text fragments of the first and second documents;
said text fragments representing a primary grammatical unit including at least one clause;
extracting word phrases from the text fragments of the first document; and
generating index information on the extracted word phrases and the aligned text fragments holding the word phrases, to generate a phrase index indexing text fragments by word phrases. - View Dependent Claims (24, 25, 26, 27, 28, 29)
-
Specification