Automatically identifying similar purchasing opportunities
First Claim
1. A method in a computer system for identifying documents in a set of documents relevant to a distinguished document, comprising the steps of:
- determining, in the computer system, an inverse document frequency for each one of a plurality of key words within the distinguished document with respect to the set of documents;
identifying, in the computer system, the key words within the distinguished document having the highest ones of the inverse document frequencies;
for each of the identified key words, conducting, in the computer system, a search for at least a subset of the documents containing at least one of the identified key words;
generating, in the computer system, a score associated with each of the at least a subset of the documents, the score calculated by summing the inverse document frequency of each identified key word contained within each of the at least a subset of the documents;
ranking, in the computer system, the at least a subset of the documents of the set based at least upon the score.
2 Assignments
0 Petitions
Accused Products
Abstract
A facility for identifying purchasing opportunities within a set of purchasing opportunities that are similar to a distinguished purchasing opportunity is described. The distinguished purchasing opportunity has descriptive information associated with it. For each of several terms occurring in this descriptive information, the facility generates a term score. Each term score reflects the extent to which the occurrence of the term and the descriptive information associated with the distinguished purchasing opportunity differentiates the distinguished purchasing opportunity from auto purchasing opportunities in the set. The facility then selects as key words the terms occurring in the descriptive information associated with the distinguished purchasing opportunity that have the highest term scores. The facility identifies purchasing opportunities of the set containing the selected key words, and establishes a purchasing opportunity score for each identified purchasing opportunity by summing the term score of the key words occurring in information associated with the identified purchasing opportunities.
36 Citations
19 Claims
-
1. A method in a computer system for identifying documents in a set of documents relevant to a distinguished document, comprising the steps of:
-
determining, in the computer system, an inverse document frequency for each one of a plurality of key words within the distinguished document with respect to the set of documents; identifying, in the computer system, the key words within the distinguished document having the highest ones of the inverse document frequencies; for each of the identified key words, conducting, in the computer system, a search for at least a subset of the documents containing at least one of the identified key words; generating, in the computer system, a score associated with each of the at least a subset of the documents, the score calculated by summing the inverse document frequency of each identified key word contained within each of the at least a subset of the documents; ranking, in the computer system, the at least a subset of the documents of the set based at least upon the score. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for identifying documents in a set of documents relevant to a distinguished document, comprising:
-
a server computer system; logic executed in the server computer system that determines an inverse document frequency for each one of a plurality of key words within the distinguished document with respect to the set of documents; logic executed in the server computer system that identifies the key words within the distinguished document having the highest ones of the inverse document frequencies; logic that, for each of the identified key words, conducts a search for documents from the set of documents containing at least one of the identified key words; and logic that, for each document identified in the search, identifies a relevant document where a sum of the inverse document frequencies of the identified keywords contained in the document exceeds a threshold. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for identifying documents in a set of documents relevant to a distinguished document, comprising:
-
means for determining an inverse document frequency for each one of a plurality of key words within the distinguished document with respect to the set of documents; means for identifying the key words within the distinguished document having the highest ones of the inverse document frequencies; means for conducting, for each of the identified key words, a search for at least a subset of the documents containing at least one of the identified key words; means for generating a score associated with each of the at least a subset of the documents, the score calculated by summing the inverse document frequency of each identified key word contained within each of the at least a subset of the documents; means for ranking the at least a subset of the documents of the set based at least upon the score. - View Dependent Claims (17, 18, 19)
-
Specification