×

Realtime indexing and search in large, rapidly changing document collections

  • US 7,634,466 B2
  • Filed: 08/02/2006
  • Issued: 12/15/2009
  • Est. Priority Date: 06/28/2005
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computerized method for indexing content items, the method comprising:

  • generating, using a processor, an inverted index of word location pairs that identifies the location of one or more words in one or more content items available on a network;

    storing the inverted index in an index data store;

    dynamically receiving one or more additional content items over the network;

    prior to elapsing of a predetermined time threshold, storing the one or more additional content items in a stream search queue, the stream search queue operative to allow for a stream search of the one or more additional content items;

    once the time threshold elapses, indexing, using the processor, the one or more additional content items in the stream search queue and then writing the indexed content from the stream search queue into the inverted index;

    receiving a query from a user, the query comprising one or more query executing a stream search of the stream search queue to identify a given one of the query terms and to generate a stream search result set;

    executing an index search of the inverted index of word location pairs to identify a given one of the query terms and generate an index result set; and

    generating a merge result set on the basis of the stream result set and the index result set.

View all claims
  • 9 Assignments
Timeline View
Assignment View
    ×
    ×