×

Selection of a set of optimal n-grams for indexing string data in a DBMS system under space constraints introduced by the system

  • US 20060101000A1
  • Filed: 11/05/2004
  • Published: 05/11/2006
  • Est. Priority Date: 11/05/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method for selecting a set of n-grams for indexing string data in a DBMS system, comprising:

  • providing a set of candidate n-grams, each n-gram comprising a sequence of characters;

    identifying sample queries having character strings containing the candidate n-grams; and

    based on the set of candidate n-grams, the sample queries, database records, and an n-gram space constraints, automatically selecting, given the space constraint, a minimal set of n-grams from the set of candidate n-grams that minimizes the number of false hits for the set of sample queries had the sample queries been executed against the database records.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×