INBOUND CONTENT FILTERING VIA AUTOMATED INFERENCE DETECTION
First Claim
1. A computer-executed method for filtering outbound content via inference detection, the method comprising:
- identifying previously sent content with a common first destination identifier;
extracting a set of keywords from the identified previously sent content;
issuing a number of Web search queries based on the keywords extracted from the identified content;
receiving a number of hits in response to the queries;
extracting a set of expected-content keywords from the hits;
associating the expected-content keywords with the first destination identifier;
receiving a piece of outbound content;
determining the number of expected-content keywords present in the outbound content; and
producing a result which indicates the destination identifier for the outbound content and at least one expected-content keyword present in the outbound content, thereby allowing a user to determine whether the outbound content has the proper content and destination identifier.
6 Assignments
0 Petitions
Accused Products
Abstract
A system is provided to detect email spam. During operation, the system receives an email, extracts a set of keywords from the email body, and constructs a first search query based a keyword extracted from the email body. The system further constructs a second search query based on the keyword in the first query and one additional word which pertains to a known spam word or to the subject of the email. Next, the system receives a first number of hits and a second number of hits in response to the first and second search queries, respectively. The system then determines whether the email is spam based on the first number and the second number. The system can also perform Website filtering using inference detection which is based on search results received in response to search queries formulated with keywords extracted from Websites.
23 Citations
20 Claims
-
1. A computer-executed method for filtering outbound content via inference detection, the method comprising:
-
identifying previously sent content with a common first destination identifier; extracting a set of keywords from the identified previously sent content; issuing a number of Web search queries based on the keywords extracted from the identified content; receiving a number of hits in response to the queries; extracting a set of expected-content keywords from the hits; associating the expected-content keywords with the first destination identifier; receiving a piece of outbound content; determining the number of expected-content keywords present in the outbound content; and producing a result which indicates the destination identifier for the outbound content and at least one expected-content keyword present in the outbound content, thereby allowing a user to determine whether the outbound content has the proper content and destination identifier. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer-executed method for filtering outbound content via inference detection, the method comprising:
-
receiving a piece of outbound content; extracting keywords from the outbound content; issuing Web queries based on the keywords; extracting keywords from documents corresponding to the hits returned by a search engine in response to the queries; ranking the keywords extracted from the hits; selecting one or more ranked keywords to represent the outbound content; and producing a result which indicates a destination identifier for the outbound content and at least one keyword which is selected to represent the outbound content, thereby allowing a user to determine whether the outbound content has the proper content and destination identifier. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computer system for filtering outbound content via inference detection, the computer system comprising:
-
a processor; a memory coupled to the processor; an content-selection mechanism configured to identify previously sent content with a common first destination identifier; a keyword-extraction mechanism configured to extract a set of keywords from the identified previously sent content; a query mechanism configured to; issue a number of Web search queries based on the keywords extracted from the identified content; receive a number of hits in response to the queries; extract a set of expected-content keywords from the hits; and associate the expected-content keywords with the first destination identifier; a receiving mechanism configured to receive a piece of outbound content; a decision mechanism configured to determine the number of expected-content keywords present in the outbound content; and a display mechanism configured to produce a result which indicates the destination identifier for the outbound content and at least one expected-content keyword present in the outbound content, thereby allowing a user to determine whether the outbound content has the proper content and destination identifier. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer system for filtering outbound content via inference detection, the computer system comprising:
-
a processor; a memory coupled to the processor; a receiving mechanism configured to receive a piece of outbound content; a first keyword-extraction mechanism configured to extract keywords from the output content; a query formulation mechanism configured to issue Web queries based on the keywords; a second keyword-extraction mechanism configured to extract keywords from documents corresponding to the hits returned by a search engine in response to the queries; a ranking mechanism configured to rank the keywords extracted from the hits; a selection mechanism configured to select one or more ranked keywords to represent the outbound content; and a display mechanism configured to produce a result which indicates a destination identifier for the outbound content and at least one keyword which is selected to represent the outbound content, thereby allowing a user to determine whether the outbound content has the proper content and destination identifier. - View Dependent Claims (17, 18, 19, 20)
-
Specification