×

Methods and systems for indexing references to documents of a database and for locating documents in the database

  • US 10,169,388 B2
  • Filed: 11/10/2015
  • Issued: 01/01/2019
  • Est. Priority Date: 10/10/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for locating documents of a database that contain search terms, the method comprising:

  • receiving a search term, at the database, from a client, the search term being associated with a posting list, the posting list being arranged in blocks, each block comprising a header and M truncated references, the M being an integer, each block having been compressed into a compressed block by encoding content of each block using an encoding pattern, the encoding pattern for each block having been determined based on values of the M truncated references in each block;

    reading a pointer from a header of a current block of the posting list;

    using the pointer to extract a decoding protocol from a decoding protocol table, wherein the decoding protocol defines the encoding pattern for the current block, the encoding pattern of the current block comprises;

    a base length b of M truncated references in the current block;

    a number n of patches in the current block;

    if n>

    0, one or more patch values vk of the current block, wherein k is in a range from 1 to n;

    if n>

    0, one or more patch positions pk in the current block, wherein pk is in a range of 0 to M−

    1;

    decompressing the current block based on the decoding protocol by;

    reading, in the current block, b·

    M bits comprising the M truncated references, wherein M−

    n of the M truncated references are database references; and

    if n>

    0, for each patch from k=1 to n, calculating an expanded patch value as vk·

    2b and adding the expanded patch value to a pkth of the M truncated reference numbered from 0 to M−

    1 for providing n additional database references; and

    the decompressing comprises identifying M database references based on the decoding protocol and the M truncated references within the current block.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×