Ranking functions using an incrementally-updatable, modified naive bayesian query classifier
First Claim
1. A computer readable medium having stored thereon computer-executable instructions for ranking documents on a network in response to a user inputted search query comprising one or more search query terms, said computer-executable instructions utilizing an incrementally-updatable query classifier model that can be updated by updating count values #(Asset), #(wi, Asset) and Σ
- #(wi, Asset), wherein #(Asset) represents a number of times that a given document on the network is selected for viewing by any user, #(wi, Asset) represents a number of times that a given document on the network and a search query term, wi, of the search query are matched by any user, and Σ
#(wi, Asset) represents a sum of the number of times that a given document on the network and any search query term, wi, of the search query are matched by any user.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods of ranking documents on a network using an incrementally-updatable system are disclosed. Computer readable medium having stored thereon computer-executable instructions for performing a method of ranking documents on a network using an incrementally-updatable system are also disclosed. Further, computing systems containing at least one application module, wherein the at least one application module comprises application code for performing methods of ranking documents on a network using an incrementally-updatable system are disclosed.
49 Citations
20 Claims
-
1. A computer readable medium having stored thereon computer-executable instructions for ranking documents on a network in response to a user inputted search query comprising one or more search query terms, said computer-executable instructions utilizing an incrementally-updatable query classifier model that can be updated by updating count values #(Asset), #(wi, Asset) and Σ
- #(wi, Asset), wherein #(Asset) represents a number of times that a given document on the network is selected for viewing by any user, #(wi, Asset) represents a number of times that a given document on the network and a search query term, wi, of the search query are matched by any user, and Σ
#(wi, Asset) represents a sum of the number of times that a given document on the network and any search query term, wi, of the search query are matched by any user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- #(wi, Asset), wherein #(Asset) represents a number of times that a given document on the network is selected for viewing by any user, #(wi, Asset) represents a number of times that a given document on the network and a search query term, wi, of the search query are matched by any user, and Σ
-
10. A method of incrementally updating a query classifier model suitable for use as a ranking function component in a search engine, said method comprising:
-
determining count values #(Asset), #(wi, Asset) and Σ
#(wi, Asset), wherein #(Asset) represents a number of times that a given document on the network is selected for viewing by any user, #(wi, Asset) represents a number of times that a given document on the network and a search query term, wi, of the search query are matched by any user, and Σ
#(wi, Asset) represents a sum of the number of times that a given document on the network and any search query term, wi, of the search query are matched by any user;storing the count values #(Asset), #(wi, Asset) and Σ
#(wi, Asset); andupdating the stored count values by adding any new data collected during a time period to the previously stored count values #(Asset), #(wi, Asset) and Σ
#(wi, Asset). - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
11. The method of claim 11, wherein the time period is equal to or less than 24 hours in length.
-
18. A computing system containing at least one application module usable on the computing system, wherein the at least one application module comprises application code for performing a method of ranking documents on a network based on document relevance to a user inputted search query, said method comprising the steps of:
-
utilizing formula (I) to determine a document relevance score for each document; and ranking documents in descending order based on the document relevance score for each document; wherein formula (I) comprises wherein; P(Asset|Query) represents a probability of returning a given document, Asset, given a particular user inputted search query, Query; NQ is the number of terms in the search query; V is the size of the vocabulary of the network; #(T) is the total number of search queries that have been processed by any user; #(Asset) represents a number of times that a given document on the network is selected for viewing by any user; #(wi, Asset) represents a number of times that a given document on the network and a search query term, wi, of the search query are matched by any user; and Σ
#(wi, Asset) represents a sum of the number of times that a given document on the network and any search query term, wi, of the search query are matched by any user.- View Dependent Claims (19, 20)
wherein; λ
is a weighing multiplier having a value equal to or less than 1.0; andt is an integer representing an age of a count value component.
-
-
20. The computing system of claim 19, wherein λ
- is less than 1.0.
Specification