×

Content filtering for electronic documents generated in multiple foreign languages

  • US 6,542,888 B2
  • Filed: 11/26/1997
  • Issued: 04/01/2003
  • Est. Priority Date: 11/26/1997
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for categorizing documents generated in one or more languages comprising the steps of:

  • providing topic categories representing the terms from all of said languages for topic subject matter from documents;

    assigning topic token IDs to said topic categories regardless of language of generation;

    for each document to be categorized, assigning document token IDs representing the terms from all of said languages for the document subject matter, consistent with said topic categories;

    replacing document content with at least one replacement document token ID for each of said topic categories; and

    matching topic token IDs to said at least one replacement document token ID.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×