Method and apparatus for filtering email
First Claim
1. A method for filtering messages with a data processing system having a processor and memory, the method comprising:
- storing in the memory a first semantic representation corresponding to a first group of messages and a second semantic representation corresponding to a second group of messages using at least one matrix stored in the memory, wherein the first group of messages are defined as legitimate messages and the second group of messages are defined as unsolicited messages;
determining a mathematical representation corresponding to an incoming message;
comparing the mathematical representation corresponding to the incoming message with at least one of the first semantic representation and the second semantic representation to obtain at least one of a first comparison value and a second comparison value;
designating the incoming message as ambiguous through classifying the incoming message between the first and second groups based on at least one of the first comparison value and the second comparison value; and
prompting a user to determine whether the ambiguous message is legitimate or unsolicited in response to the designating, wherein the user is permitted to discard the ambiguous message, and wherein the message is included into a training email corpus in response to the user'"'"'s determination, and wherein the first semantic representation and the second representation are calculated using latent semantic analysis.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for filtering messages comprising determining a first semantic anchor corresponding to a first group of messages, for example, legitimate messages and a second semantic anchor corresponding to a second group of messages, for example, unsolicited messages. Determining a vector corresponding to an incoming message; comparing the vector corresponding to the incoming message with at least one of the first semantic anchor and the second semantic anchor to obtain a first comparison value and a second comparison value; and filtering the incoming message based on the first comparison value and the second comparison value.
-
Citations
35 Claims
-
1. A method for filtering messages with a data processing system having a processor and memory, the method comprising:
-
storing in the memory a first semantic representation corresponding to a first group of messages and a second semantic representation corresponding to a second group of messages using at least one matrix stored in the memory, wherein the first group of messages are defined as legitimate messages and the second group of messages are defined as unsolicited messages; determining a mathematical representation corresponding to an incoming message; comparing the mathematical representation corresponding to the incoming message with at least one of the first semantic representation and the second semantic representation to obtain at least one of a first comparison value and a second comparison value; designating the incoming message as ambiguous through classifying the incoming message between the first and second groups based on at least one of the first comparison value and the second comparison value; and prompting a user to determine whether the ambiguous message is legitimate or unsolicited in response to the designating, wherein the user is permitted to discard the ambiguous message, and wherein the message is included into a training email corpus in response to the user'"'"'s determination, and wherein the first semantic representation and the second representation are calculated using latent semantic analysis. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. An article of manufacture comprising:
-
a machine-accessible medium including instructions that, when executed by a machine, causes the machine to perform operations comprising; storing a first semantic representation corresponding to a first group of messages and a second semantic representation corresponding to a second group of messages using at least one matrix, wherein the first group of messages are defined as legitimate messages and the second group of messages are defined as unsolicited messages; determining a mathematical representation corresponding to an incoming message; comparing the mathematical representation corresponding to the incoming message with at least one of the first semantic representation and the second semantic representation to obtain at least one of a first comparison value and a second comparison value; designating the incoming message as ambiguous through classifying the incoming message between the first and second groups based on at least one of the first comparison value and the second comparison value; and prompting a user to determine whether the ambiguous message is legitimate or unsolicited in response to the designating, wherein the user is permitted to discard the ambiguous message, and wherein the message is included into a training email corpus in response to the user'"'"'s determination, and wherein the first semantic representation and the second representation are calculated using latent semantic analysis. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A computer system comprising:
-
a bus; a data storage device coupled to said bus; a processor coupled to said data storage device; a singular value decomposition unit communicatively coupled to the processor; and
wherein the data storage device stores a first semantic representation corresponding to a first group of messages and a second semantic representation corresponding to a second group of messages using at least one matrix, wherein the first group of messages are defined as legitimate messages and the second group of messages are defined as unsolicited messages;an incoming email conversion unit communicatively coupled to the singular value decomposition unit to determine a mathematical representation corresponding to an incoming message; a logic unit communicatively coupled to the incoming email conversion unit and the singular value decomposition unit to compare the mathematical representation corresponding to the incoming message with at least one of the first semantic representation and the second semantic representation to obtain at least one of a first comparison value and a second comparison value, and to designate the incoming message as ambiguous through classifying the incoming message between the first and second groups based on at least one of the first comparison value and the second comparison value, and wherein the logic unit is configured to prompt a user to determine whether the ambiguous message is legitimate or unsolicited in response to the designating, wherein the user is permitted to discard the ambiguous message, and wherein the message is included into a training email corpus in response to the user'"'"'s determination, and wherein the first semantic representation and the second representation are calculated using latent semantic analysis. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
Specification