Determining quality of linked documents
First Claim
Patent Images
1. A method performed by a device, the method comprising:
- identifying, by a processor of the device, a plurality of documents, where a first one of the identified documents is linked by a second one of the identified documents and the second document is one of a plurality of documents in an affiliated set of documents;
calculating, by the processor, a first value for each document in the affiliated set of documents based on a ranking score of the document and a number of outbound links from the document;
calculating, by the processor, a second value as a maximum of the first values for the documents in the affiliated set of documents;
assigning, by the processor, a ranking score to the first document based the second value, where assigning the ranking score includes;
determining whether the documents in the affiliated set of documents are weakly affiliated or strongly affiliated, andsetting the amount that the second document contributes to the ranking score of the first document as a function that acts as a summation operator over the affiliated set of documents when the affiliated set is weakly affiliated and as a maximum operator over the affiliated set of documents when the affiliated set is strongly affiliated, where the function is defined as;
(CONTRIB(D1)a+CONTRIB(D2)a+ . . . +CONTRIB(Dk)a)1/a,where CONTRIB for document Dk represents an individual ranking score contribution for document k in the affiliated set, and a is defined as where e is a constant and γ
represents a continuous measure of the affiliation of the documents in the affiliated set; and
storing, by the processor, the ranking score.
2 Assignments
0 Petitions
Accused Products
Abstract
A ranking component ranks documents, such as web pages or web sites, to obtain a ranking score that defines a quality judgment of the document. The ranking score of a particular document is based on the ranking score of the documents which link to it and based on affiliation among the documents.
21 Citations
21 Claims
-
1. A method performed by a device, the method comprising:
-
identifying, by a processor of the device, a plurality of documents, where a first one of the identified documents is linked by a second one of the identified documents and the second document is one of a plurality of documents in an affiliated set of documents; calculating, by the processor, a first value for each document in the affiliated set of documents based on a ranking score of the document and a number of outbound links from the document; calculating, by the processor, a second value as a maximum of the first values for the documents in the affiliated set of documents; assigning, by the processor, a ranking score to the first document based the second value, where assigning the ranking score includes; determining whether the documents in the affiliated set of documents are weakly affiliated or strongly affiliated, and setting the amount that the second document contributes to the ranking score of the first document as a function that acts as a summation operator over the affiliated set of documents when the affiliated set is weakly affiliated and as a maximum operator over the affiliated set of documents when the affiliated set is strongly affiliated, where the function is defined as;
(CONTRIB(D1)a+CONTRIB(D2)a+ . . . +CONTRIB(Dk)a)1/a,where CONTRIB for document Dk represents an individual ranking score contribution for document k in the affiliated set, and a is defined as where e is a constant and γ
represents a continuous measure of the affiliation of the documents in the affiliated set; andstoring, by the processor, the ranking score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
where α and
β
are predetermined constants, the sum is taken over m affiliated sets of documents that link to the particular document, and SETCONTRIB is the function (CONTRIB(D1)ai +CONTRIB(D2)ai + . . . +CONTRIB(Dk)ai )1/ai for each of the m sets of affiliated documents.
-
-
13. The method of claim 1, wherein the function is defined as:
-
where THRESHOLD is set to a predetermined value, RankingScore(D) represents the ranking score of a document D, and OutD(D) represents a number of outbound links from document D.
-
-
14. A device comprising:
-
a set location component to identify documents, where a first one of the documents is linked by a second one of the documents and the second document is one of a plurality of documents in an affiliated set of documents; and a ranking component that assigns a ranking score to the first document by; calculating a first value for each document in the affiliated set of documents based on a ranking score of the document and a number of outbound links from the document; calculating a second value as one of the first values for the documents in the affiliated set of documents; assigning a ranking score to the first document based the second value, where the ranking component assigns the ranking score by; determining whether the documents in the affiliated set of documents are weakly affiliated or strongly affiliated, and setting the amount that the second document contributes to the ranking score of the first document as a function that acts as a summation operator over the affiliated set of documents when the affiliated set is weakly affiliated and as a maximum operator over the affiliated set of documents when the affiliated set is strongly affiliated, where the function is defined as;
(CONTRIB(D1)a+CONTRIB(D2)a+ . . . +CONTRIB(Dk)a)1/a,where CONTRIB for document Dk represents an individual ranking score contribution for document k in the affiliated set, and a is defined as where e is a predetermined constant and γ
represents a continuous measure of the affiliation of the documents in the affiliated set; andstoring the ranking score. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21)
-
Specification