×

Apparatus and method for generating data useful in indexing and searching

  • US 7,152,056 B2
  • Filed: 04/16/2003
  • Issued: 12/19/2006
  • Est. Priority Date: 04/19/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method for computerized processing of document data, comprising:

  • receiving the document data;

    retrieving tokenization rules for the document data;

    applying the tokenization rules to the document data to generate a plurality of tokens, each having one or more concordable characters from the document data;

    receiving a user query;

    re-retrieving the tokenization rules;

    applying the re-retrieved tokenization rules to the user query to generate one or more search tokens, each having one or more concordable characters from the user query and wherein the user query is derived from a query language and comprises at least one reserved character from a set of concordable characters for use in the query language; and

    searching an index comprising the tokens with the one or more search tokens.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×