Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance
First Claim
1. A method for retrieving similarity values from a data structure for a set of documents, comprising:
- accessing the data structure to determine whether the data structure contains an explicit entry for a similarity value for a given pair of documents;
if the data structure contains an explicit entry of the similarity value for the given pair of documents, retrieving the similarity value; and
if the data structure does not contain an explicit entry of the similarity value for the given pair of documents, retrieving a default similarity value from the data structure or from another memory location,wherein the data structure comprises a plurality of entries of the similarity values, and wherein the plurality of entries of the similarity values are fewer than N2−
N in number if the similarity values are asymmetric with regard to document pairing and wherein the plurality of entries of the similarity values are fewer than
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-readable medium comprises data structure for providing information about levels of similarity between pairs of N documents. The data structure comprises a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of the documents. Each of the similarity values represents a level of similarity of one document of a given pair relative to the other document of the given pair. The similarity value of each entry is greater than a threshold similarity value that is greater than zero. The plurality of similarity-value entries are fewer than N2−N in number if the similarity values are asymmetric with regard to document pairing, and the plurality of similarity-value entries are fewer than
in number if the similarity values are symmetric with regard to document pairing. A method and apparatus for generating the data structure are described.
12 Citations
9 Claims
-
1. A method for retrieving similarity values from a data structure for a set of documents, comprising:
-
accessing the data structure to determine whether the data structure contains an explicit entry for a similarity value for a given pair of documents; if the data structure contains an explicit entry of the similarity value for the given pair of documents, retrieving the similarity value; and if the data structure does not contain an explicit entry of the similarity value for the given pair of documents, retrieving a default similarity value from the data structure or from another memory location, wherein the data structure comprises a plurality of entries of the similarity values, and wherein the plurality of entries of the similarity values are fewer than N2−
N in number if the similarity values are asymmetric with regard to document pairing and wherein the plurality of entries of the similarity values are fewer than - View Dependent Claims (2, 3)
-
-
4. A computer readable medium storing processing instructions adapted to cause a processing unit to execute a method, comprising:
-
accessing the data structure to determine whether the data structure contains an explicit entry for a similarity value for a given pair of documents; if the data structure contains an explicit entry of the similarity value for the given pair of documents, retrieving the similarity value; and if the data structure does not contain an explicit entry of the similarity value for the given pair of documents, retrieving a default similarity value from the data structure or from another memory location, wherein the data structure comprises a plurality of entries of the similarity values, and wherein the plurality of entries of the similarity values are fewer than N2−
N in number if the similarity values are asymmetric with regard to document pairing and wherein the plurality of entries of the similarity values are fewer than
-
-
5. An apparatus for retrieving similarity values from a data structure for a set of documents, comprising:
-
a memory; and a processing unit coupled to the memory, wherein the processing unit is configured to execute the steps of; accessing the data structure to determine whether the data structure contains an explicit entry for a similarity value for a given pair of documents; if the data structure contains an explicit entry of the similarity value for the given pair of documents, retrieving the similarity value; and if the data structure does not contain an explicit entry of the similarity value for the given pair of documents, retrieving a default similarity value from the data structure or from another memory location, wherein the data structure comprises a plurality of entries of the similarity values, and wherein the plurality of entries of the similarity values are fewer than N2−
N in number if the similarity values are asymmetric with regard to document pairing and wherein the plurality of entries of the similarity values are fewer than
-
-
6. A computer-readable medium having stored thereon a data structure for providing information about levels of similarity between pairs of documents of a set of documents, the documents being N in number, the data structure comprising:
-
a plurality of entries of similarity values representing levels of similarity for a plurality of pairs of said documents, each of said similarity values representing a level of similarity of one document of a given pair relative to the other document of the given pair, wherein the similarity value of each entry is greater than a threshold similarity value that is greater than zero, and wherein the plurality of entries of similarity values are fewer than N2−
N in number if the similarity values are asymmetric with regard to document pairing and wherein the plurality of entries of similarity values are fewer than - View Dependent Claims (7, 8, 9)
-
Specification