Method and system for ranking documents of a search result to improve diversity and information richness
First Claim
1. A computing device for ranking documents of a search result, each document having words, comprising:
- a memory having computer-executable instructions and a processor for executing the computer-executable instructions stored in the memory, the executed computer-executable instructions perform steps comprising;
receiving from a user a query;
identifying documents as a search result for the received query;
identifying keywords of the identified documents;
for each pair of identified documents of the search result, calculating an affinity measurement indicating affinity that a document of the pair has to the other document of the pair, wherein affinity measurement is calculated based on the following;
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for ranking documents of search results based on information richness and diversity of topics. A ranking system determines the information richness of each document within a search result. The ranking system groups documents of a search result based on their relatedness, meaning that they are directed to similar topics. The ranking system ranks the documents to ensure that the highest ranking documents may include at least one document covering each topic, that is, one document from each of the groups. The ranking system selects the document from each group that has the highest information richness of the documents within the group. When the documents are presented to a user in rank order, the user will likely find on the first page of the search result documents that cover a variety of topics, rather than just a single popular topic.
-
Citations
8 Claims
-
1. A computing device for ranking documents of a search result, each document having words, comprising:
-
a memory having computer-executable instructions and a processor for executing the computer-executable instructions stored in the memory, the executed computer-executable instructions perform steps comprising; receiving from a user a query; identifying documents as a search result for the received query; identifying keywords of the identified documents; for each pair of identified documents of the search result, calculating an affinity measurement indicating affinity that a document of the pair has to the other document of the pair, wherein affinity measurement is calculated based on the following; - View Dependent Claims (2, 3)
-
-
4. A method in a computer system with a processor and a memory for calculating information richness of a document within a collection of documents, the documents in the collection having words, the method comprising:
-
identifying by the processor an affinity each document in the collection has to another document in the collection, wherein the affinity is identified for each pair of documents with one document of the pair being the document and the other document of the pair being another document in the collection of documents, affinity indicating to what extent the information of one document is subsumed by the information of another document; determining by the processor information richness for each document in the collection based on an element of a normalized affinity matrix that is derived from an affinity matrix; storing in the memory indications of the determined information richnesses of the documents in the collection; and ranking the documents based on the stored indications of information richnesses, wherein the determined information richness for each document is defined as
-
-
5. A computing device for calculating information richness of a document within a collection of web pages, the web pages having words, comprising:
-
a memory having computer-executable instructions and a processor for executing the computer-executable instructions stored in the memory, the executed computer-executable instructions perform steps comprising; identifying an affinity of each web page in the collection has to the another web page in the collection, wherein the affinity is identified for each pair of web pages with one web page of the pair being the web page and the other web page of the pair being another web page in the collection of web pages, affinity indicating to what extent the information of one web page is subsumed by the information of another web page; determining information richness for each web page in the collection based on an element of a normalized affinity matrix that is derived from an affinity matrix; storing in the memory indications of the determined information richnesses of the web pages in the collection; and ranking the web pages based on the stored indications of information richnesses, wherein the determined information richness for each web page is defined as - View Dependent Claims (6, 7, 8)
-
Specification