×

Multi-language document search and retrieval system

  • US 7,369,987 B2
  • Filed: 12/29/2006
  • Issued: 05/06/2008
  • Est. Priority Date: 11/30/1998
  • Status: Expired due to Fees
First Claim
Patent Images

1. A system for indexing textual content in any of a plurality of languages for searching purposes, comprising:

  • a processing device, comprising;

    a tokenizer which separates a string of text into individual word tokens,a stemmer which reduces the word tokens to grammatical stems by removing word endings which are associated with any one or more of the languages, without regard to whether the remaining stem is a recognized word in any combination of the plurality of languages, andan indexer which creates an index from the stems; and

    a computer-readable medium which stores the created index.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×