USING SPECIFICITY MEASURES TO RANK DOCUMENTS
First Claim
1. A method of ranking documents by specificity values, comprising:
- specifying a reference set of documents, each document including one or more terms;
specifying a first document that includes one or more terms that are included in the reference set of documents;
determining, from the reference set of documents, one or more term-specificity values for the one or more terms of the first document by calculating frequencies of terms within the reference set of documents, wherein a larger term-specificity value corresponds to a lower likelihood relative to the reference set of documents;
determining a document-specificity value for the first document by combining the one or more term-specificity values for the first document, wherein larger term-specificity values correspond to a larger document-specificity value; and
saving one or more values for the document-specificity value of the first document in a computer-readable medium.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of ranking documents by specificity values includes specifying a reference set of documents, each document including one or more terms, and specifying a first document that includes one or more terms that are included in the reference set of documents. The method includes determining, from the reference set of documents, one or more term-specificity values for the one or more terms of the first document by calculating frequencies of terms within the reference set of documents, wherein a larger term-specificity value corresponds to a lower likelihood relative to the reference set of documents, and determining a document-specificity value for the first document by combining the one or more term-specificity values for the first document, wherein larger term-specificity values correspond to a larger document-specificity value.
9 Citations
20 Claims
-
1. A method of ranking documents by specificity values, comprising:
-
specifying a reference set of documents, each document including one or more terms; specifying a first document that includes one or more terms that are included in the reference set of documents; determining, from the reference set of documents, one or more term-specificity values for the one or more terms of the first document by calculating frequencies of terms within the reference set of documents, wherein a larger term-specificity value corresponds to a lower likelihood relative to the reference set of documents; determining a document-specificity value for the first document by combining the one or more term-specificity values for the first document, wherein larger term-specificity values correspond to a larger document-specificity value; and saving one or more values for the document-specificity value of the first document in a computer-readable medium. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-readable medium that stores a computer program for ranking documents by specificity values, wherein the computer program includes instructions for:
-
specifying a reference set of documents, each document including one or more terms; specifying a first document that includes one or more terms that are included in the reference set of documents; determining, from the reference set of documents, one or more term-specificity values for the one or more terms of the first document by calculating frequencies of terms within the reference set of documents, wherein a larger term-specificity value corresponds to a lower likelihood relative to the reference set of documents; determining a document-specificity value for the first document by combining the one or more term-specificity values for the first document, wherein larger term-specificity values correspond to a larger document-specificity value; and saving one or more values for the document-specificity value of the first document. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An apparatus for ranking documents by specificity values, the apparatus comprising a computer for executing computer instructions, wherein the computer includes computer instructions for:
-
specifying a reference set of documents, each document including one or more terms; specifying a first document that includes one or more terms that are included in the reference set of documents; determining, from the reference set of documents, one or more term-specificity values for the one or more terms of the first document by calculating frequencies of terms within the reference set of documents, wherein a larger term-specificity value corresponds to a lower likelihood relative to the reference set of documents; determining a document-specificity value for the first document by combining the one or more term-specificity values for the first document, wherein larger term-specificity values correspond to a larger document-specificity value; and saving one or more values for the document-specificity value of the first document. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification