×

Method for determining the semantic relatedness of lexical items in a text

  • US 5,128,865 A
  • Filed: 03/02/1990
  • Issued: 07/07/1992
  • Est. Priority Date: 03/10/1989
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of selecting automatically the most appropriate translation, in a given target language, of a given lexical item in a given context in a given source language, comprising the following steps:

  • a) parsing, with the aid of a parsing system, an original sentence in which said given lexical item appears, in order to determine a syntactic structure of said sentence;

    b) identifying, in said syntactic structure, those contextual relations which said given lexical item has in said sentence including identifying other lexical items in said sentence to which the given lexical item is syntactically related, and the syntactic relations involved;

    c) retrieving said given lexical item, together with a set of alternative translations thereof, from a bilingual lexicon stored in electronic form, in which each of said alternative translations is associated with at least one contextual relation of said given lexical item in said given source language, with each said at least one contextual relation comprising a further lexical item and a syntactic relation;

    d) comparing each of said contextual relations identified in step b) in said original sentence, with each contextual relation associated with one of said alternative translations retrieved in step c), including comparison of the syntactic relations involved;

    e) for each of the comparisons performed in step d) in which said syntactic relations are found to be identical, determining a degree of semantic proximity between the given lexical item involved in the contextual relation identified in step b), and the further lexical item involved in the contextual relation retrieved in step c) by means of the following procedure;

    1) identifying in a predefined text corpus in said source language a set of sentences in which at least one of said lexical items appear, and retrieving said set of sentences from said text corpus,2) parsing, with the aid of said parsing system, each of said retrieved sentences in order to determine a syntactic structure of each of said sentences,3) for each said sentence retrieved, determining from the obtained syntactic structure those contextual relations which said lexical items have in that sentence,4) determining, for each of said lexical items, a total number of contextual relations found in step

         3),5) determining a number of contextual relations which said lexical items have in common, and6) determining, on the basis of the results obtained in steps

         4) and

         5), a degree of overlap between the contextual relations of said lexical items, and thereby a degree of semantic proximity, or a degree of similarity, between said lexical items;

    f) for each combination of a contextual relation identified in said original sentence in step b), and a contextual relation retrieved from said bilingual lexicon in step c) together with an associated translation, adding the result obtained in step e) to obtain a score representing the appropriateness of that translation; and

    g) selecting from said set of alternative translations retrieved in step c) that translation to which the highest score is attached at the conclusion of step f).

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×