×

Systems, methods, and computer program products for fast and scalable proximal search for search queries

  • US 8,745,062 B2
  • Filed: 08/16/2012
  • Issued: 06/03/2014
  • Est. Priority Date: 05/24/2012
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of information retrieval from multiple documents, comprising:

  • splitting each document into multiple snippets of words;

    generating a separate index for each snippet;

    receiving an input search query including at least one sentence; and

    processing the search query against each separate index of each snippet of the multiple snippets by searching query terms over each of the multiple snippets to implicitly introduce term proximity information in the information retrieval, wherein processing the search query further comprises;

    creating an OR-Query of all non-stopwords in each sentence;

    returning a fit value for each OR-Query, wherein a fit value represents a similarity metric that measures the amount of word content overlap between two text units; and

    aggregating the fit values to provide a score for every document returned by the OR-Queries.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×