×

System and method for cross-language knowledge searching

  • US 7,672,831 B2
  • Filed: 10/24/2005
  • Issued: 03/02/2010
  • Est. Priority Date: 10/24/2005
  • Status: Active Grant
First Claim
Patent Images

1. A computer-based method for cross-language knowledge searching, the method implemented by at least one computer processor accessing at least one knowledge base comprising sources in a first language and sources in a second language and a bilingual dictionary stored in at least one storage device, the method comprising:

  • building the bilingual dictionary using parallel corpora, including;

    for each sentence in a first source in the first language, generating a first source semantic index in the first language;

    for each sentence in a second source in the second language, generating a second source semantic index in the second language, where the second source is a translation of the first source and each first source semantic index and corresponding second source semantic index form parallel semantic indexes having parallel eSAO component pairs; and

    recognizing semantic components in an input expression in the first language;

    generating a first semantic index in the first language from the semantic components, wherein the first semantic index includes first lexical units, at least one first lexical unit comprising a word with a part of speech (POS) tag;

    translating the first semantic index into a second semantic index in the second language using a bilingual dictionary of actions and concepts, including translating the first lexical units into second lexical units in the second language, and translating a first word from the first semantic index into corresponding words in the second language and tagging each of the corresponding words with a POS tag of the first word; and

    retrieving information relevant to the input expression from a knowledge base, which includes semantically indexed information in the second language, when the first and second semantic indexes match a subset of semantic indexes of the knowledge base associated with the information.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×