Adaptive evaluation of text search queries with blackbox scoring functions
First Claim
Patent Images
1. A computer program product embodied on a computer readable medium, the computer program product comprising computer instructions that implement the following operations:
- performing the following in response to a query comprising at least two terms, the query having a first portion comprising at least one term of the at least two terms and a second portion comprising at least one other term of the at least two terms;
selecting a document from a plurality of documents;
determining a score range of possible scores for the selected document by using a scoring function,where the score range comprises a lower bound and an upper bound, where the upper bounds and lower bounds are not equal,the determining the score range causing the scoring function to perform a partial evaluation, using only the first portion of the query, for the selected document in order to determine the score range,a score of the selected document residing within the score range and determinable using the first and second portion from the query; and
determining, based at least on the score range, whether the selected document is relevant to the query, and if the selected document is relevant to the query, retaining the selected document as being relevant to the query,where the selected document is determined as relevant to the query in response to determining that the lower bound of the score range exceeds a threshold andthe selected document is determined as not relevant to the query in response to determining that the upper bound of the score range does not exceeds the threshold.
0 Assignments
0 Petitions
Accused Products
Abstract
Disclosed is an evaluation technique for text search with black-box scoring functions, where it is unnecessary for the evaluation engine to maintain details of the scoring function. Included is a description of a system for dealing with blackbox searching, proofs of correctness, as well experimental evidence showing that the performance of the technique is comparable in efficiency to those techniques used in custom-built engines.
-
Citations
25 Claims
-
1. A computer program product embodied on a computer readable medium, the computer program product comprising computer instructions that implement the following operations:
-
performing the following in response to a query comprising at least two terms, the query having a first portion comprising at least one term of the at least two terms and a second portion comprising at least one other term of the at least two terms; selecting a document from a plurality of documents; determining a score range of possible scores for the selected document by using a scoring function, where the score range comprises a lower bound and an upper bound, where the upper bounds and lower bounds are not equal, the determining the score range causing the scoring function to perform a partial evaluation, using only the first portion of the query, for the selected document in order to determine the score range, a score of the selected document residing within the score range and determinable using the first and second portion from the query; and determining, based at least on the score range, whether the selected document is relevant to the query, and if the selected document is relevant to the query, retaining the selected document as being relevant to the query, where the selected document is determined as relevant to the query in response to determining that the lower bound of the score range exceeds a threshold and the selected document is determined as not relevant to the query in response to determining that the upper bound of the score range does not exceeds the threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system comprising a processor configured to perform operations of:
-
in response to a query comprising at least two terms, selecting a document from a plurality of documents, the query having a first portion comprising at least one term of the at least two terms and a second portion comprising at least one other term of the at least two terms; determining a score range of possible scores for the selected document by using a scoring function, where the score range comprises a lower bound and an upper bound, where the upper bounds and lower bounds are not equal, the determining the score range causing the scoring function to perform a partial evaluation, using only the first portion of the query, for the selected document in order to determine the score range, a score of the selected document residing within the score range and the score determinable using the first and second portion from the query, and determining, based at least on the score range, whether the selected document is relevant to the query, and if the selected document is relevant to the query, retaining the selected document as being relevant to the query, where the selected document is determined as relevant to the query in response to determining that the lower bound of the score range exceeds a threshold and the selected document is determined as not relevant to the query in response to determining that the upper bound of the score range does not exceeds the threshold. - View Dependent Claims (21, 22)
-
-
23. A method comprising:
-
performing the following in response to a query comprising at least two terms, the query having a first portion comprising at least one term of the at least two terms and a second portion comprising at least one other term of the at least two terms; selecting a document from a plurality of documents; determining a score range of possible scores for the selected document by using a scoring function, where the score range comprises a lower bound and an upper bound, where the upper bounds and lower bounds are not equal, the determining the score range causing the scoring function to perform a partial evaluation, using only the first portion of the query, for the selected document in order to determine the score range, a score of the selected document residing within the score range and the score determinable using the first and second portion from the query; and determining, based at least on the score range, whether the selected document is relevant to the query, and if the selected document is relevant to the query, retaining the selected document as being relevant to the query, where the selected document is determined as relevant to the query in response to determining that the lower bound of the score range exceeds a threshold and the selected document is determined as not relevant to the query in response to determining that the upper bound of the score range does not exceeds the threshold. - View Dependent Claims (24, 25)
-
Specification