×

Substitute term identification based on over-represented terms identification

  • US 9,152,698 B1
  • Filed: 01/03/2012
  • Issued: 10/06/2015
  • Est. Priority Date: 01/03/2012
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • receiving an original query that includes one or more query terms;

    determining, by one or more computers, not to apply a weak query term substitution rule to the original query, wherein the weak query term substitution rule identifies a particular term as a substitute for one or more of the query terms;

    after determining not to apply the weak query term substitution rule to the original query, obtaining an initial set of search results from a text corpus of indexed resources;

    determining, using the particular term'"'"'s frequency-inverse document frequency (tf-idf) weight and by one or more computers, that the particular term occurs in text associated with a subset of the initial set of search results at a higher rate than the particular term occurs in the text corpus as a whole;

    in response to determining that the particular term occurs in text associated with the subset of the initial set of search results at the higher rate than the particular term occurs in the text corpus as a whole, applying the weak query term substitution rule to the original query, to revise the original query to include the particular term; and

    obtaining a subsequent set of search results in response to the revised query.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×