Ranking similar passages
First Claim
1. A computer-implemented method for calculating a score for a passage having a plurality of instances occurring in a digital corpus, comprising:
- calculating at least one score based at least in part on characteristics of instances of the passage occurring in the digital corpus;
generating a ranking score associated with the passage based at least in part on the calculated at least one score; and
storing the ranking score in association with the passage in a computer-readable medium.
2 Assignments
0 Petitions
Accused Products
Abstract
Passages in a digital corpus are scored and ranked based at least in part on characteristics of instances of the passages occurring in the corpus. Such characteristics include the popularity of the author, the characteristics of the words introducing and following the similar passage, frequency of appearance of the passage in the digital corpus, the length of the similar passage, the words of the similar passage, the usage of punctuation with the similar passage, and the diffusion of the similar passage within the digital corpus. The characteristics are scored and weighted to produce ranking scores for the associated passages. The ranking scores are used for purposes including selecting passages to display in association with a document and ranking passages displayed in response to a search.
54 Citations
24 Claims
-
1. A computer-implemented method for calculating a score for a passage having a plurality of instances occurring in a digital corpus, comprising:
-
calculating at least one score based at least in part on characteristics of instances of the passage occurring in the digital corpus; generating a ranking score associated with the passage based at least in part on the calculated at least one score; and storing the ranking score in association with the passage in a computer-readable medium. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer-readable storage medium containing executable program code for calculating a score for a passage having multiple occurrences in a digital corpus, the program code comprising code for:
-
calculating at least one score based at least in part on characteristics of instances of the passage occurring in the digital corpus; generating a ranking score associated with the passage based at least in part on the calculated at least one score; and storing the ranking score in association with the passage in a computer-readable medium. - View Dependent Claims (18, 19, 20)
-
-
21. A computer system for calculating a score for a passage having multiple occurrences in a digital corpus, the system comprising:
a computer-readable storage medium containing executable program code for calculating a score for a passage having multiple occurrences in a digital corpus, the program code comprising code for; calculating at least one score based at least in part on characteristics of instances of the passage occurring in the digital corpus; generating a ranking score associated with the passage based at least in part on the calculated at least one score; and storing the ranking score in association with the passage in a computer-readable medium. - View Dependent Claims (22, 23, 24)
Specification