Third language text generating algorithm by multi-lingual text inputting and device and program therefor
First Claim
1. A third language text generating algorithm, for use in computer-based language processing, for generating anew third language text by using a plurality of multi-lingual texts, the algorithm including the steps of:
- inputting two or more multi-lingual texts written in different languages including a first language which serves as a source language and at least a second language into which the first language is translated;
performing language analysis including at least dependency analysis and semantic analysis, on each of the mufti-lingual texts in the form of each language or a combination of any two or more languages, thereby obtaining language information on at least a dependency structure and semantic representation; and
generating a third language text, wherein the generating step generates a third language text by using the language information obtained by the analyzing step, or the algorithm further including the step of performing language conversion based on the results of analysis obtained by the analyzing step or based on the results of analysis and conversion knowledge characteristic of a third language, the converting step following the analyzing step, wherein the generating step generates a third language text by using at least either the language information obtained by the analyzing step or the results of conversion obtained by the converting step.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided is a technique which includes inputting a plurality of multi-lingual texts and using multi-lingual text corpora, thereby generating a higher-accuracy third language text as compared to the input of a unilingual text which has heretofore taken place. After inputting the texts, the processes for analyzing, converting and generating are performed, and a target language document text is outputted. The generation of target language document text does not require a large-scale corpus because information characteristic of the language can be automatically acquired.
25 Citations
13 Claims
-
1. A third language text generating algorithm, for use in computer-based language processing, for generating anew third language text by using a plurality of multi-lingual texts, the algorithm including the steps of:
-
inputting two or more multi-lingual texts written in different languages including a first language which serves as a source language and at least a second language into which the first language is translated;
performing language analysis including at least dependency analysis and semantic analysis, on each of the mufti-lingual texts in the form of each language or a combination of any two or more languages, thereby obtaining language information on at least a dependency structure and semantic representation; and
generating a third language text, wherein the generating step generates a third language text by using the language information obtained by the analyzing step, or the algorithm further including the step of performing language conversion based on the results of analysis obtained by the analyzing step or based on the results of analysis and conversion knowledge characteristic of a third language, the converting step following the analyzing step, wherein the generating step generates a third language text by using at least either the language information obtained by the analyzing step or the results of conversion obtained by the converting step. - View Dependent Claims (2, 3, 4)
-
-
5. A third language text generating device, for use in language processing, for generating a new third language text by using a plurality of languages, the device including:
-
inputting means for inputting two or more mufti-lingual texts written in different languages including a first language which serves as a source language and at least a second language into which the first language is translated;
analyzing means for performing language analysis including at least dependency analysis and semantic analysis, on each of the mufti-lingual texts in the form of each language or a combination of any two or more languages, thereby obtaining language information on at least a dependency structure and semantic representation;
generating means for generating a third language text; and
outputting means capable of outputting the third language text generated by the generating means, wherein the generating means generates the third language text by using the language information obtained by the analyzing means, or the device further including converting means for performing language conversion based on the results of analysis obtained by the analyzing means or based on the results of analysis and conversion knowledge characteristic of a third language, wherein the generating means generates the third language text by using at least either the language information obtained by the analyzing means or the results of conversion obtained by the converting means. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A third language text generating program, for use in computer-based language processing, for generating anew third language text by using a plurality of multi-lingual texts, the program including:
-
an inputting portion which obtains two or more multi-lingual texts written in different languages including a first language which serves as a source language and at least a second language into which the first language is translated, from a storage device or an input device of a computer;
an analyzing portion which performs language analysis including at least dependency analysis and semantic analysis, on each of the obtained multi-lingual texts in the form of each language or a combination of any two or more languages, and obtains language information on at least a dependency structure and semantic representation by arithmetic operation using an arithmetic unit and a storage device of a computer;
a generating portion which generates a third language text by arithmetic operation using the arithmetic unit and the storage device of the computer; and
an outputting portion which outputs the third language text generated by the generating portion by using the storage device or an output device of the computer, wherein the generating portion generates the third language text by using the language information obtained by the analyzing portion, or the program further including a converting portion which performs language conversion based on the results of analysis obtained by the analyzing portion or based on the results of analysis and conversion knowledge characteristic of a third language, wherein the generating portion generates the third language text by using at least either the language information obtained by the analyzing portion or the results of conversion obtained by the converting portion. - View Dependent Claims (11, 12, 13)
-
Specification