×

Index server architecture using tiered and sharded phrase posting lists

  • US 8,682,901 B1
  • Filed: 12/20/2011
  • Issued: 03/25/2014
  • Est. Priority Date: 03/30/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method of indexing documents of a document collection in an indexing system that includes a plurality of index servers, the method comprising:

  • determining a phrase posting list associated with a first phrase, the phrase posting list identifying documents of the document collection associated with the first phrase;

    dividing the phrase posting list for the first phrase into a plurality of different shards, each shard identifying a subset of the documents identified by the posting list;

    storing each different shard of the phrase posting list for the first phrase on a corresponding different index server;

    storing a plurality of shards of different phrase posting lists on a first index server;

    storing a plurality of shards of different phrase posting lists on a second index server; and

    within each of the first and second index servers, for each shard of a phrase posting list, ordering the shard according to document identifiers of the documents included in the shard.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×