×

Method and apparatus for multi-language indexing

  • US 6,389,387 B1
  • Filed: 05/27/1999
  • Issued: 05/14/2002
  • Est. Priority Date: 06/02/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for forming an index comprising indexing features for a plurality of documents, comprising data-processing apparatus implemented steps of:

  • identifying each of at least some of the terms present in the documents;

    generating from each identified term at least one equivalent term which is different from but linguistically related to the identified term;

    forming for each of the identified terms a first indexing feature comprising the identified term and an identifier of the or each document in which the identified term occurs;

    forming for each of the equivalent terms a second indexing feature comprising the equivalent term and an identifier of the or each document in which the identified term to which the equivalent term is equivalent occurs; and

    forming an index comprising the first and second indexing features, wherein the documents are natural language documents in a source language, the at least one equivalent term is a natural language translation of the corresponding identified term in the source language to the equivalent term in a target language different from the source language, and the forming steps include forming the first and second indexing features for the identified term in the source language and the equivalent term in the target language, respectively.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×