Method of searching text to find relevant content
First Claim
1. A method of locating relevant documents within a universe of documents, the documents of said universe having been classified so that each document in the universe has a fingerprint, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and comprising;
a computer processor creating a fingerprint for a piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classification system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for said first piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program,a computer processor searching all or a portion of the universe of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in the all or a portion of the universe of documents, andranking the all or a portion of the universe of documents based on a degree to which a document has a mathematical overlap with the fingerprint of the first piece of text, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of locating relevant documents wherein documents are given a fingerprint comprising weights associated with particular topic categories of a classification system, each weight representing a degree to which the document relates to the particular topic category, a first piece of text is identified and given a fingerprint comprising a list of other weights associated with similar topic categories, the other weights representing a degree to which the first piece of text relates to the particular topic category. All or a portion of the universe of documents is searched by comparing the fingerprint for the first piece of text with the fingerprint for each document. You select those documents whose fingerprints have a predetermined degree of mathematical overlap with the fingerprint of the first piece of text. A user fingerprint of the user'"'"'s recently accessed texts can be used in place of the first piece of text.
-
Citations
73 Claims
-
1. A method of locating relevant documents within a universe of documents, the documents of said universe having been classified so that each document in the universe has a fingerprint, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and comprising;
a computer processor creating a fingerprint for a piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classification system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for said first piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program,a computer processor searching all or a portion of the universe of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in the all or a portion of the universe of documents, and ranking the all or a portion of the universe of documents based on a degree to which a document has a mathematical overlap with the fingerprint of the first piece of text, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (2, 3, 4, 5)
- =X<
-
6. A method of locating relevant documents within a universe of documents, the documents of said universe having been classified so that each document in the universe has a fingerprint, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and comprising;a computer processor creating a fingerprint for a first piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classification system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for said first piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program, a computer processor searching all or a portion of the universe of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in that all or a portion of the universe of documents, and selecting those documents whose fingerprints have a predetermined degree of mathematical overlap with the fingerprint of the first piece of text, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (7, 8, 9)
- =X<
-
10. A method of locating relevant documents within a universe of documents, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and comprising;a computer processor creating a fingerprint for each document in the universe of documents, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, a computer processor creating a fingerprint for a first piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classified system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for said first piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program, a computer processor searching all or a portion of the universe of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in that all or a portion of the universe of documents, and ranking the all or a portion of the universe of documents based on a degree to which a document has a mathematical overlap with the fingerprint of the first piece of text, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (11, 12, 13, 72, 73)
- =X<
-
14. A method of locating relevant documents within a universe of documents, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and comprising;creating a fingerprint for each document in the universe of documents, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, a computer processor creating a fingerprint for a first piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classification system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for said first piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program, a computer processor searching all or a portion of the universe of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in that all or a portion of the universe of documents, and selecting those documents whose fingerprints have a predetermined degree of mathematical overlap with the fingerprint of the first piece of text, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (15, 16, 17)
- =X<
-
18. A method of re-ranking a list of documents obtained from a search wherein a ranking of a document in the list is determined by a relevance of the document to a search text, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and comprising;a computer processor classifying the list of documents so that each document in the list has a fingerprint, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, a computer processor creating a fingerprint for the first piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classification system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for said first piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program, a computer processor searching the list of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in the list of documents, re-ranking the list of documents based on a degree to which a document in the list has a mathematical overlap with the fingerprint of the first piece of text, the method configured to re-rank the list of documents based on relevance to the search text whether the list of documents includes text written in one language or in more than one language. - View Dependent Claims (19, 20, 21)
- =X<
-
22. A method of re-ranking a list of documents obtained from a search wherein a ranking of a document in the list is determined by a relevance of the document to a search text, and wherein each document in the list has a fingerprint, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document related to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and comprising;a computer processor creating a fingerprint for a first piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classification system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for the first piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program, a computer processor searching the list of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in the list of documents, re-ranking the list of documents based on a degree to which a fingerprint of a document in the list has a mathematically overlap with the fingerprint of the first piece of text, the method configured to re-rank the list of documents based on relevance to the search text whether the list of documents includes text written in one language or in more than one language. - View Dependent Claims (23, 24, 25)
- =X<
-
26. A method of re-ranking a list of document obtained from a search wherein a ranking of a document in the list of documents is determined by a relevance of the document to a search text, wherein the list of documents has been classified and appears in an inverted list, said inverted list comprising for each topic category of a classification system a weight associated with a particular document of the list of documents, the weight representing a degree to which the particular document relates to said each topic category, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and comprising;creating a fingerprint for a first piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classification system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for said first piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program, a computer processor searching the list of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in the list of documents, and a computer processor re-ranking the list of documents based on a degree to which a fingerprint of a document in the list has a mathematical overlap with the fingerprint of the first piece of text, the method configured to re-rank the list of documents based on relevance to the search text whether the list of documents includes text written in one language or in more than one language. - View Dependent Claims (27, 28, 29, 30)
- =X<
-
31. A method of re-ranking a list of documents obtained from a search wherein a ranking of a document in the list is determined by a relevance of the document to a search text, the method performed by a computer processor, having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0, and comprising;classifying the list of documents so that each document in the list has a fingerprint, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, providing a user fingerprint, the user fingerprint comprising a list of cumulative weights associated with particular topic categories in the classification system, each of the cumulative weights representing a degree to which text or texts in a link recently accessed by a user relates to the particular topic category that the cumulative weight is associated with, the cumulative weights obtained from weights that in turn were obtained automatically from a computer program, searching the list of documents by comparing the user fingerprint with the fingerprint for each document in the list of documents, and re-ranking the list of documents based on a degree to which a fingerprint of the document in the list has a mathematical overlap with the user fingerprint, the method configured to re-rank the list of documents based on relevance to the search text whether the list of documents includes text written in one language or in more than one language. - View Dependent Claims (32, 33, 34, 35, 71)
- =X<
-
36. A method of re-ranking a list of documents obtained from a search wherein a ranking of a document in the list is determined by a relevance of the document to a search text, and wherein each document in the list has a fingerprint, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, the method performed by a computer processor, having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0, and comprising;providing a user fingerprint, the user fingerprint comprising a list of cumulative weights associated with particular topic categories in the classification system, each of the cumulative weights representing a degree to which text or texts in a link recently accessed by a user relate to the particular topic category that the cumulative weight is associated with, the cumulative weights obtained from weights that in turn were obtained automatically from a computer program, searching the list of documents by comparing the user fingerprint with the fingerprint for each document in the list of documents, and re-ranking the list of documents based on a degree to which a fingerprint of a document in the list has a mathematical overlap with the user fingerprint, the method configured to re-rank the list of documents based on relevance to the search text whether the list of documents includes text written in one language or in more than one language. - View Dependent Claims (37, 38, 39)
- =X<
-
40. A method of re-ranking a list of documents obtained from a search wherein a ranking of a document in the list is determined by a relevance of the document to a search text, wherein the list of documents has been classified and appears in an inverted list, said inverted list comprising for each topic category of a classification system a weight associated with a particular document of the list of documents, the weight representing a degree to which the particular document relates to said each topic category, the weights obtained automatically from a computer program, the method performed by a computer processor, having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0, and comprising;providing a user fingerprint, the user fingerprint comprising a list of cumulative weights associated with particular topic categories in the classification system, each of the cumulative weights representing a degree to which text or texts in a link recently accessed by a user relates to the particular topic category that the cumulative weight is associated with, the cumulative weights obtained from weights that in turn were obtained automatically from a computer program, searching the inverted list by comparing the user fingerprint with the fingerprint for each document in the inverted list, and re-ranking the list of documents by making use of a degree to which a fingerprint of a document in the inverted list has a mathematical overlap with the user fingerprint the method configured to re-rank the list of documents based on relevance to the search text whether the list of documents includes text written in one language or in more than one language. - View Dependent Claims (41, 42, 43, 44)
- =X<
-
45. A method of locating relevant documents within a universe of documents, the documents of said universe having been classified and appears in an inverted list,
said inverted list comprising for each category of a classification system a weight associated with a particular document of the list of documents, each of the weights representing a degree to which the particular document relates to said each topic category, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0< - =X<
=1.0 for a universe of text on the world wide web and comprising;creating a fingerprint for a first piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classification system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for said piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program, a computer processor searching all or a portion of the universe of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in that all or a portion of the universe of documents, and ranking the all or a portion of the universe of documents by making use of a degree to which a fingerprint of a document in the inverted list has a mathematical overlap with the fingerprint of the first piece of text, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (46, 47, 48, 49)
- =X<
-
50. A method of locating relevant documents within a universe of documents, the documents of said universe having been classified and appears in an inverted list,
said inverted list comprising for each topic category of a classification system a weight associated with a particular document of the list of documents, each of the weights representing a degree to which the particular document relates to said each topic category, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0< - =X<
=1.0 for a universe of text on the world wide web and comprising;a computer processor creating a fingerprint for a first piece of text, the fingerprint comprising a list of weights associated with particular topic categories in the classification system, each of the weights in the fingerprint for said first piece of text representing a degree to which the first piece of text relates to the particular topic category that the weight in the fingerprint for said first piece of text is associated with, the weights in the fingerprint for said first piece of text obtained automatically from a computer program, a computer processor searching all or a portion of the universe of documents by comparing the fingerprint for the first piece of text with the fingerprint for each document in that all or a portion of the universe of documents, and selecting those documents whose fingerprints have a predetermined degree of mathematical overlap with the fingerprint of the first piece of text, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (51, 52, 53, 54)
- =X<
-
55. A method of locating relevant documents within a universe of documents, the documents of said universe having been classified so that each document in the universe has a fingerprint, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and performed by a computer processor and comprising;providing a user fingerprint, the user fingerprint comprising a list of cumulative weights associated with particular topic categories in the classification system, each of the cumulative weights representing a degree to which text of texts in a link recently accessed by a user related to the particular topic category that the cumulative weight is associated with, the cumulative weights obtained from weights that in turn were obtained automatically from a computer program, searching all or a portion of the universe of documents by comparing the user fingerprint with the fingerprint for each document in that all or a portion of the universe of documents, and ranking the all or a portion of the universe of documents based on a degree to which a document has a mathematical overlap with the user fingerprint, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (56, 57, 58)
- =X<
-
59. A method of locating relevant documents within a universe of documents, the documents of said universe having been classified so that each document in the universe has a fingerprint, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web and performed by a computer processor comprising;providing a user fingerprint, the user fingerprint comprising a list of cumulative weights associated with particular topic categories in the classification system, each of the cumulative weights representing a degree to which text or texts in a link recently accessed by a user relates to the particular topic category that the cumulative weight is associated with, the cumulative weights obtained from weights that in turn were obtained automatically from a computer program, searching all or a portion of the universe of documents by comparing the user fingerprint with the fingerprint for each document in that all or a portion of the universe of documents, and selecting those documents whose fingerprints have a predetermined degree of mathematical overlap with the user fingerprint, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (60, 61, 62)
- =X<
-
63. A method performed by a computer processor of locating relevant documents within a universe of documents, comprising:
-
creating a fingerprint for each document in the universe of documents, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, the method having a scalable time complexity of O(Nx) where 0<
=X<
=1.0 for a universe of text on the world wide web and performed by a computer processor comprising;providing a user fingerprint, the user fingerprint comprising a list of cumulative weights associated with particular topic categories in the classification system, each of the cumulative weights representing a degree to which text or texts in a link recently accessed by a user relates to the particular topic category that the cumulative weight is associated with, the cumulative weights obtained from weights that in turn were obtained automatically from a computer program, searching all or a portion of the universe of documents by comparing the user fingerprint with the fingerprint for each document in that all or a portion of the universe of documents, and ranking the all or a portion of the universe of documents based on a degree to which a document has a mathematical overlap with the user fingerprint, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (64, 65, 66)
-
-
67. A method of locating relevant documents within a universe of documents, the method performed by a computer processor and having a scalable time complexity of O(Nx) where 0<
- =X<
=1.0 for a universe of text on the world wide web, the method comprising;creating a fingerprint for each document in the universe of documents, said fingerprint comprising a list of weights associated with particular topic categories in a classification system, each of the weights representing a degree to which the document relates to the particular topic category that the weight is associated with, the weights obtained automatically from a computer program, providing a user fingerprint, the user fingerprint comprising a list of cumulative weights associated with particular topic categories in the classification system, each of the cumulative weights representing a degree to which text or texts in a link recently accessed by a user relates to the particular topic category that the cumulative weight is associated with, the cumulative weights obtained from weights that in turn were obtained automatically from a computer program, searching all or a portion of the universe of documents by comparing the user fingerprint with the fingerprint for each document in that all or a portion of the universe of documents, and selecting those documents whose fingerprints have a predetermined degree of mathematical overlap with the user fingerprint, the method configured to locate the relevant documents within the universe of documents whether the universe of documents includes text written in one language or in more than one language. - View Dependent Claims (68, 69, 70)
- =X<
Specification