Glossary construction tool
First Claim
1. A system for compiling a terminology list from a source document, said system comprising:
- a processor for extracting multiple word noun phrases from said source document;
a predefined stop list stored in a memory;
means for extracting single words not appearing on the predefined stop list from said source document; and
a display for presenting said extracted multiple word noun phrases and said single words in a candidate terminology list to a user for evaluation in order to develop a final terminology list.
8 Assignments
0 Petitions
Accused Products
Abstract
A glossary construction tool for generating and maintaining a translation glossary, consisting of a number of terms and their translations. The glossary construction tool includes a terminology list development tool for generating a terminology list in the source language and a glossary development tool for automatically obtaining candidate translations for the terms in the terminology list. The terminology list development tool will construct the terminology list in the source language by analyzing the source text document to be translated and automatically extracting a list of candidate terms, comprised of multiple word noun phrases and single words not appearing on a standard or predefined stop list of "noise" words. The glossary development tool will obtain candidate translations for terms in the final terminology list by searching the source text document of a word-aligned text pair for a term to be translated and then provide candidate translations based on the indicated alignment with the target text document of the aligned text pair. A concordance tool provides monolingual and bilingual concordances in order to facilitate the user'"'"'s evaluation of the automatically generated list of candidate terms and candidate translations, respectively.
156 Citations
38 Claims
-
1. A system for compiling a terminology list from a source document, said system comprising:
-
a processor for extracting multiple word noun phrases from said source document; a predefined stop list stored in a memory; means for extracting single words not appearing on the predefined stop list from said source document; and a display for presenting said extracted multiple word noun phrases and said single words in a candidate terminology list to a user for evaluation in order to develop a final terminology list. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for automatically compiling a terminology list from a source document, said system comprising:
-
a predefined stop list stored in a memory; a processor for extracting multiple word noun phrases and single words not appearing on the predefined stop list from said source document; and means for compiling said terminology list comprised of said extracted multiple word noun phrases and said single words. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system for automatically generating one or more candidate translations for a source language term to be translated into a target language, said system comprising:
-
a memory storing a previously translated source language document and a target language document which is a translation of said source language document; a word level text alignment tool for automatically aligning said pair of previously translated documents; and a processor for searching for said term to be translated in said source language document and obtaining a candidate translation based on the indicated alignment of the words of said term to be translated in the source language document with the words of the target language document of the aligned text pair. - View Dependent Claims (20, 21, 22, 23, 24, 25)
-
-
26. A method for compiling a terminology list from a source document, said method comprising the steps of:
-
extracting multiple word noun phrases from said source document; storing a predefined stop list in a memory; extracting single words not appearing on the predefined stop list from said source document; and presenting said extracted multiple word noun phrases and said single words in a candidate terminology list on a display to a user for evaluation in order to develop a final terminology list. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33)
-
-
34. A method for automatically generating one or more candidate translations for a source language term to be translated into a target language, said method comprising the steps of:
-
storing a previously translated source language document and a target language document which is a translation of said source language document in a memory; automatically aligning said pair of previously translated documents at the word level; and searching for said term to be translated in said source language document and obtaining a candidate translation based on the indicated alignment of the words of said term to be translated in the source language document with the words of the target language document of the aligned text pair. - View Dependent Claims (35, 36, 37, 38)
-
Specification