System and method of information retrieval engine evaluation using human judgment input
First Claim
Patent Images
1. An information retrieval engine evaluation method, comprising:
- identifying a query benchmark comprising a plurality of queries, the queries having corresponding search results;
obtaining judgment input from one or more judges, the judgment input corresponding to the set of search results;
determining, using the judgment input obtained from the one or more judges, at least one metric corresponding to an indicator of performance, a first value of the at least one metric corresponding to a first information retrieval engine and a second value of the at least one metric corresponding to a second information retrieval engine; and
comparing the first and second values of the at least one metric to evaluate the performance indicator so as to evaluate performance of the first information retrieval engine relative to the second information retrieval engine based on the query benchmark.
3 Assignments
0 Petitions
Accused Products
Abstract
An information retrieval engine evaluation system and method is disclosed, which uses judgment input, or feedback, received from one or more individuals, or judges. Judgment input is provided by the one or judges, each of whom review at least one aspect of performance of a software application, and provide judgment input in the form of responses to questions. The judgment input is received and analyzed, and can be used to generate the one or more metrics. The metrics can be examined to evaluate at least one indicator in order to determine performance of the software application.
-
Citations
17 Claims
-
1. An information retrieval engine evaluation method, comprising:
-
identifying a query benchmark comprising a plurality of queries, the queries having corresponding search results; obtaining judgment input from one or more judges, the judgment input corresponding to the set of search results; determining, using the judgment input obtained from the one or more judges, at least one metric corresponding to an indicator of performance, a first value of the at least one metric corresponding to a first information retrieval engine and a second value of the at least one metric corresponding to a second information retrieval engine; and comparing the first and second values of the at least one metric to evaluate the performance indicator so as to evaluate performance of the first information retrieval engine relative to the second information retrieval engine based on the query benchmark. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of measuring search engine performance using a set of stored queries, results and judgments, the method comprising:
-
generating a query benchmark from the set of stored queries, the query benchmark includes one or more queries; obtaining query results using the one or more benchmark queries; retrieving one or more stored results associated with the plurality of queries; retrieving a judgment associated with at least one stored result; predicting a judgment associated with one or more of the obtained results based on the at least one stored result; and determining a performance measure using the retrieved and predicted judgments. - View Dependent Claims (12, 13, 14, 15)
-
-
16. An information retrieval engine evaluation system, comprising:
-
program memory for storing process steps executable to;
identify a query benchmark comprising a plurality of queries, the queries having corresponding search results, obtain judgment input from one or more judges, the judgment input corresponding to the set of search results determine, using the judgment input obtained from the one or more judges, at least one metric corresponding to an indicator of performance, a first value of the at least one metric corresponding to a first information retrieval engine and a second value of the at least one metric corresponding to a second information retrieval engine; and
compare the first and second values of the at least one metric to evaluate the performance indicator so as to evaluate performance of the first information retrieval engine relative to the second information retrieval engine; andat least one processor for executing the process steps stored in said program memory.
-
-
17. A system for measuring search engine performance using a set of stored queries, results and judgments, the system comprising:
-
program memory for storing process steps executable to;
generate a query benchmark from the set of stored queries, the query benchmark includes one or more queries;
obtain query results using the one or more benchmark queries;
retrieve one or more stored results associated with the plurality of queries;
retrieve a judgment associated with at least one stored result;
predict a judgment associated with one or more of the obtained results based on the at least one stored result; and
determine a performance measure using the retrieved and predicted judgments; andat least one processor for executing the process steps stored in said program memory.
-
Specification