System for Multiligual Machine Translation from English to Hindi and Other Indian Languages Using Pseudo-Interlingua and Hybridized Approach
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to a method and system for translating a source language into a target language comprising the steps of:—identifying the nature of text extracted from a source document, - filtering and storing the text formatting and structure information of the extracted text,—selecting an appropriate text translation engine based on the nature of the extracted text, —using the text translation engine for analysing and translating the extracted text into an unformatted translated text, and—using the stored text formatting and structure information to process the unformatted text for obtaining a structured translated text document in the target language.
77 Citations
60 Claims
-
1-40. -40. (canceled)
-
41. A method for translating a source language into a target language comprising the steps of:
-
identifying the nature of text extracted from a source document;
filtering and storing the text formatting and structure information of the extracted text;
selecting an appropriate text translation engine based on the nature of the extracted text;
using the text translation engine for analyzing and translating the extracted text into an unformatted translated text; and
using the stored text formatting and structure information to process the unformatted text for obtaining a structured translated text document in the target language. - View Dependent Claims (42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52)
-
-
53. A system for translating a source language into a target language comprising:
-
means for identifying the nature of text extracted from a source document wherein the source document includes a language specific knowledge base;
means for filtering and storing the text formatting and structure information of the extracted text;
means for selecting an appropriate text translation engine based on the nature of the extracted text;
means for analyzing and translating the extracted text into an unformatted translated text, using text specific translating engines, said translating and analyzing means further comprising;
means for identifying the sentence unit delimiter of the extracted text for breaking the text into separate sentences;
means for performing the lexical analysis on each word of the sentence; and
means for storing the analyzed words (lexicons) along with their properties in an online-lexical and phrasal database and storing the unknown lexicons in a separate database for increasing the translation speed maintaining a history of nouns for resolving pronoun reference abiguity; and
means for using the stored text formatting and structure information to process the unformatted text for obtaining a structured translated text document in the target language;
optionally comprising editing means for performing post editing on the structured translated text document for improving the accuracy of the translation and its presentation style. - View Dependent Claims (54, 55, 56, 57, 58)
-
-
59. A computer program product comprising computer readable program code stored on computer readable storage medium embodied therein for translating a source language into a target language, comprising:
-
computer readable program code means configured for identifying the nature of text extracted from a source document;
computer readable program code means configured for filtering and storing the text formatting and structure information of the extracted text;
computer readable program code means configured for selecting an appropriate text translation engine based on the nature of the extracted text;
computer readable program code means configured for analyzing and translating the extracted text into an unformatted translated text;
computer readable program code means configured for using the stored text formatting and structure information to process the unformatted text for obtaining a structured translated text document in the target language;
computer readable program code means configured to expand the example-base interactively; and
computer readable program code means configured to derive abstracted examples from the raw examples. - View Dependent Claims (60)
-
Specification