×

INDEX SERVER ARCHITECTURE USING TIERED AND SHARDED PHRASE POSTING LISTS

  • US 20100161617A1
  • Filed: 03/02/2010
  • Published: 06/24/2010
  • Est. Priority Date: 03/30/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method of indexing documents based on phrases occurring in the documents, the method comprising:

  • selecting a phrase posting list associated with a phrase, and identifying a plurality of documents having at least one occurrence of the phrase;

    determining a length of the phrase posting list;

    responsive to the length of the phrase posting list being less than a first predetermined length, associating the phrase posting list with one of a plurality of first tier index servers;

    responsive to the length of the phrase posting list being greater than the first predetermined length;

    dividing the phrase posting list into a plurality of shards, each shard including a subset of the plurality of the documents; and

    associating each phrase posting list shard with a corresponding selected second tier index server, wherein the number of shards correspond to the number of second tier index servers.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×