System and method for improving the ranking of information retrieval results for short queries
First Claim
1. A method for a computer system, having a CPU and RAM, to retrieve information in response to a query, comprising the steps of:
- issuing a query on a database;
identifying a retrieved document based on the query;
receiving into said RAM said retrieved document and an accompanying variable s having a value corresponding to a relevance-ranking score of said retrieved document;
receiving into said RAM a variable q having a value corresponding to the number of words in the query and a variable v having a value corresponding to the overlap between the words in said retrieved document and in the query; and
using said CPU and said variables s, q and v to generate an adjusted score s1 corresponding to the value of said variable s increased by an amount proportional to the value of said variable v, said amount decreasing as the value of said variable q increases.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for retrieving information in response to a query by a user. The method includes the steps of receiving a signal s having a value corresponding to a relevance-ranking algorithm score of a retrieved document, receiving a signal q having a value corresponding to the number of words in the query and a signal v having a value corresponding to the coordination level of the retrieved document and query (i.e., the degree of overlap between the document terms and the query terms), and generating an adjusted score s1 dependent on the signal s, the signal q and the signal v. The adjusted score s1 takes the coordination level into account for small values of q and gradually decreases the importance of the coordination level as q increases. The system of this invention includes a computer-based system for carrying out the method of this invention.
-
Citations
45 Claims
-
1. A method for a computer system, having a CPU and RAM, to retrieve information in response to a query, comprising the steps of:
-
issuing a query on a database; identifying a retrieved document based on the query; receiving into said RAM said retrieved document and an accompanying variable s having a value corresponding to a relevance-ranking score of said retrieved document; receiving into said RAM a variable q having a value corresponding to the number of words in the query and a variable v having a value corresponding to the overlap between the words in said retrieved document and in the query; and using said CPU and said variables s, q and v to generate an adjusted score s1 corresponding to the value of said variable s increased by an amount proportional to the value of said variable v, said amount decreasing as the value of said variable q increases. - View Dependent Claims (2, 3, 4)
-
-
5. A computer system for assigning an adjusted relevancy score to information retrieved in response to a query, comprising:
-
a CPU, RAM and a database; means for issuing a query on the database; means for identifying a retrieved document based on the query; means for receiving into said RAM a retrieved document and an accompanying variable s having a value corresponding to a relevance-ranking score of said retrieved document; means for receiving into said RAM a variable q having a value corresponding to the number of words in the query and a variable v having a value corresponding to the overlap between the words in said retrieved document and in the query; and means for generating, dependent on said variables s, q and v, an adjusted score s1 equal to the value of said variable s increased by an amount dependent on the value of said variable v, said amount decreasing as the value of said variable q increases. - View Dependent Claims (6, 7, 8)
-
-
9. A computer system for assigning an adjusted relevancy score to information retrieved in response to a query, comprising:
-
a CPU, RAM and a database; means for issuing a query on the database; means for identifying a retrieved document based on the query; means for receiving into said RAM said retrieved document; means responsive to a variable s having a value corresponding to a relevance-ranking score of said retrieved document; means responsive to a variable q having a value corresponding to the number of words in the query and to a variable v having a value corresponding to the overlap between the words in said retrieved document and in the query; and a function generator for receiving said variables s, q and v and responsively generating an adjusted score s1 equal to the value of said variable s increased by an amount dependent on the value of said variable v, said amount decreasing as the value of said variable q increases. - View Dependent Claims (10, 11, 12)
-
-
13. A method for a computer system having a CPU, RAM, and a database to assign a relevancy score to information retrieved in response to a query, comprising the steps of:
-
issuing a query on said database; identifying a retrieved document based on the query; receiving into said RAM said retrieved document and an accompanying variable s having a value corresponding to a relevance-ranking score of said retrieved document; receiving into said RAM said variable f1 having a value dependent on the number of words in the query and on a value corresponding to the overlap between words in said retrieved document and in the query; and using said CPU to add said variable f1 to said variable s to generate a function B equal to the value of said variable s increased by an amount dependent on the value of said variable f1, said amount decreasing as the number of words in the query increases. - View Dependent Claims (14, 15, 16, 17)
-
-
18. A computer system for assigning an adjusted relevancy score to information retrieved in response to a query, comprising:
-
means for issuing a query on a database; means for identifying a retrieved document based on said query; means for receiving into a retrieved document and an accompanying variable s having a value corresponding to a relevance-ranking score of said retrieved document; means for receiving a variable f1 having a value dependent on the number of words in the query and on a value corresponding to the overlap between the words in said retrieved document and in the query; and means for adding said variable s to said variable f1 to generate a function B corresponding to the value of said variable s increased by an amount dependent on the value of said variable f1. - View Dependent Claims (19, 20, 21, 22, 23)
-
-
24. A computer system for assigning an adjusted relevancy score to information retrieved in response to a query, comprising:
-
a function generator for producing a function s having a value corresponding to a relevance-ranking score of a retrieved document; a variable f1 having a value dependent on the number of words in the query and on a value corresponding to the overlap between the words in said retrieved document and in the query; and an adder, coupled to said function generator, for adding said variable f1 to said variable s to generate a function B corresponding to the value of said variable s increased by an amount dependent on the value of said variable f1, said amount decreasing as the number of words in the query increases. - View Dependent Claims (25, 26)
-
-
27. A method for a computer system having a CPU and RAM to assign an adjusted relevancy score to information identified in response to a query, comprising the steps of:
-
receiving into said RAM a variable s having a value corresponding to a relevance-ranking score of an identified document; receiving into said RAM a variable q having a value corresponding to the number of words in the query and a variable v having a value corresponding to the overlap between the number of words in said identified document and in the query; and using said CPU and said variables s, q and v to generate an adjusted score s1 corresponding to the value of said variable s increased by an amount proportional to the value of said variable v, said amount decreasing as the value of said variable q increases. - View Dependent Claims (28, 29, 30)
-
-
31. A computer system for identifying information in response to a query, comprising:
-
means for receiving a variable s having a value corresponding to a relevance-ranking score of an identified document; means for receiving a variable q having a value corresponding to the number of words in the query and a variable v having a value corresponding to the overlap between the words in said identified document and in the query; and means for generating, dependent on said variables s, q and v, an adjusted score s1 corresponding to the value of said variable s increased by an amount dependent on the value of said variable v, said amount decreasing as the value of said variable q increases. - View Dependent Claims (32, 33, 34)
-
-
35. A method for a computer system to identify information in response to a query, comprising the steps of:
-
receiving a variable s having a value corresponding to a relevance-ranking score of an identified document; receiving a variable f1 having a value dependent on the number of words in the query and on a value corresponding to the overlap between words in said identified document and in the query; and adding said variable f1 to said variable s to generate a function B corresponding the value of said variable s increased by an amount dependent on the value of said variable f1, said amount decreasing as the number of words in the query increases. - View Dependent Claims (36, 37, 38)
-
-
39. A computer system for identifying information in response to a query, comprising:
-
means for receiving a variable s having a value corresponding to a relevance-ranking score of an identified document; means for receiving a variable f1 having a value dependent on the number of words in the query and on a value corresponding to the overlap between the words in said identified document and in the query; and means for adding said variable s to said variable f1 to generate a function B corresponding to the value of said variable s increased by an amount dependent on the value of said variable f1. - View Dependent Claims (40, 41, 42, 43)
-
-
44. A program recorded in a computer-readable medium for causing a computer to perform the steps of:
-
receiving a variable s having a value corresponding to a relevance-ranking score of an identified document; receiving a variable q having a value corresponding to the number of words in the query and a variable v having a value corresponding to the overlap between the words in said identified document and in the query; and using said variables s, q and v to generate an adjusted score s1 corresponding to the value of said variable s increased by an amount dependent on the value of said variable v, said amount decreasing as the value of said variable q increases.
-
-
45. A program recorded in a computer-readable medium for causing a computer to perform the steps of:
-
receiving a variable s having a value corresponding to a relevance-ranking score of an identified document; receiving a variable f1 having a value dependent on the number of words in the query and on a value corresponding to the overlap between words in said identified document and in the query; and adding said variable f1 to said variable s to generate a function B corresponding the value of said variable s increased by an amount dependent on the value of said variable f1, said amount decreasing as the number of words in the query increases.
-
Specification