×

Systems, methods and computer program products for fast and scalable proximal search for search queries

  • US 8,805,848 B2
  • Filed: 05/24/2012
  • Issued: 08/12/2014
  • Est. Priority Date: 05/24/2012
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer program product for information retrieval from multiple documents, the computer program product comprising a tangible storage medium readable by a computer system and storing instructions for execution by the computer system for performing a method comprising:

  • splitting each document into multiple snippets of words;

    indexing each snippet as a separate document;

    receiving an input search query including at least one sentence;

    processing the search query against the indexes of each of the multiple snippets by searching query terms over each of the multiple snippets to implicitly introduce term proximity information in the information retrieval;

    decomposing the search query into sub-queries;

    processing each sub-query against the indexes of each of the multiple snippets, sentence by sentence, using all words in each sentence of the sub-query to create an OR-Query of all non-stopwords in the sentence;

    returning a fit value for each OR-Query; and

    aggregating the fit values to provide a score for every document returned by the OR-Queries.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×