Fast adaptive document filtering
First Claim
Patent Images
1. A computer-readable storage medium having instructions that, when executed by a processor, cause the processor to:
- receive a new document;
create a document statistics file that indicates a frequency of a first set of terms in the new document;
determine a profile score using the document statistics file and information associated with a user profile;
notify a user associated with the user profile if the profile score exceeds a profile threshold value;
update a reference dictionary file to include the first set of terms from the new document, the reference dictionary file including a second set of terms located in a plurality of documents stored in a document repository and a third set of terms not located in the plurality of documents stored in the document repository;
update a document dictionary index to include the first set of terms from the new document;
create a parsed term data structure based on the reference dictionary file; and
update the user profile based on the parsed term data structure.
1 Assignment
0 Petitions
Accused Products
Abstract
Data structures, stored on various types of computer-readable media, include information related to user profiles and/or to various documents. The information included in these data structures is arranged and stored in manner that allows for rapid user profile updating to be performed as new or changed documents are processed in a document filtering system.
16 Citations
19 Claims
-
1. A computer-readable storage medium having instructions that, when executed by a processor, cause the processor to:
-
receive a new document; create a document statistics file that indicates a frequency of a first set of terms in the new document; determine a profile score using the document statistics file and information associated with a user profile; notify a user associated with the user profile if the profile score exceeds a profile threshold value; update a reference dictionary file to include the first set of terms from the new document, the reference dictionary file including a second set of terms located in a plurality of documents stored in a document repository and a third set of terms not located in the plurality of documents stored in the document repository; update a document dictionary index to include the first set of terms from the new document; create a parsed term data structure based on the reference dictionary file; and update the user profile based on the parsed term data structure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
receiving a new document at a document filtering system; determining a frequency of first terms in the new document; storing the frequency of the first terms in a document statistics file; determining a profile score based on the document statistics file and based on a user profile; in response to determining that the profile score does not satisfy a profile threshold, notifying a user associated with the user profile; updating a reference dictionary file to include the first terms from the new document, the reference dictionary file including second terms located in a plurality of documents stored in a document repository and including third terms not located in the plurality of documents; updating a document dictionary index to include the first terms from the new document, the document dictionary index including the second terms located in the plurality of documents; creating a parsed term data structure including the first terms and the second terms in the reference dictionary file; and updating the user profile based on the parsed term data structure. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
a processor; a memory; and an adaptive document filter module stored in the memory and executable by the processor to; receive a new document; create a document statistics file comprising a frequency of a first set of terms included in the new document; retrieve a user profile; calculate a profile score using the document statistics file and the user profile; notify a user associated with the user profile when the profile score satisfies a profile threshold; update a reference dictionary file to include the first set of terms from the new document, the reference dictionary file including a second set of terms located in a plurality of documents stored in a document repository and a third set of terms not located in the plurality of documents stored in the document repository; update a document dictionary index to include the first set of terms from the document; create a parsed term data structure including the first set of terms and the second set of terms; and update the user profile based on the parsed term data structure. - View Dependent Claims (16, 17, 18, 19)
-
Specification