×

Index server architecture using tiered and sharded phrase posting lists

  • US 8,090,723 B2
  • Filed: 03/02/2010
  • Issued: 01/03/2012
  • Est. Priority Date: 03/30/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method of indexing documents based on phrases occurring in the documents, the method comprising:

  • selecting a phrase posting list associated with a phrase, the phrase posting list identifying a number of documents having at least one occurrence of the phrase;

    determining, with one or more processors, a length of the phrase posting list, the length of the phrase posting list being based on the number of documents having at least one occurrence of the phrase;

    when the length of the phrase posting list is less than a first predetermined length, associating the phrase posting list with one of a plurality of first tier index servers;

    when the length of the phrase posting list is greater than the first predetermined length;

    dividing the phrase posting list into a plurality of shards, each shard identifying a subset of the plurality of the documents identified by the posting list; and

    associating each phrase posting list shard with a corresponding selected second tier index server.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×