SYSTEM AND METHOD FOR EVALUATING A STRUCTURED MESSAGE STORE FOR MESSAGE REDUNDANCY
First Claim
1. A system for evaluating a structured message store for message redundancy, comprising:
- a parser extracting a header and a message body from each of a plurality of messages maintained in a structured message store;
a digester calculating a substantially unique hash code over at least part of the header and over the message body of each message; and
a message deduper grouping the messages by the hash codes and identifying one such message as a unique message within each group.
12 Assignments
0 Petitions
Accused Products
Abstract
A system and method for evaluating a structured message store for message redundancy is described. A header and a message body are extracted from each of a plurality of messages maintained in a structured message store. A substantially unique hash code is calculated over at least part of the header and over the message body of each message. The messages are grouped by the hash codes. One such message is identified as a unique message within each group. In a further embodiment, the messages are grouped by conversation thread. The message body for each message within each conversation thread group is compared. At least one such message within each conversation thread group is identified as a unique message.
-
Citations
38 Claims
-
1. A system for evaluating a structured message store for message redundancy, comprising:
-
a parser extracting a header and a message body from each of a plurality of messages maintained in a structured message store;
a digester calculating a substantially unique hash code over at least part of the header and over the message body of each message; and
a message deduper grouping the messages by the hash codes and identifying one such message as a unique message within each group. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for evaluating a structured message store for message redundancy, comprising:
-
extracting a header and a message body from each of a plurality of messages maintained in a structured message store;
calculating a substantially unique hash code over at least part of the header and over the message body of each message; and
grouping the messages by the hash codes and identifying one such message as a unique message within each group. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. An apparatus for evaluating a structured message store for message redundancy, comprising:
-
means for extracting a header and a message body from each of a plurality of messages maintained in a structured message store;
means for calculating a substantially unique hash code over at least part of the header and over the message body of each message; and
means for grouping the messages by the hash codes and identifying one such message as a unique message within each group.
-
-
27. A system for culling duplicative messages maintained in a structured message store, comprising:
-
a message extractor retrieving a plurality of messages maintained in a structured message store with each message comprising a header and a message body;
a digester calculating a substantially unique hash code over at least part of the header and over the message body;
a message deduper, comprising;
a comparer comparing the hash codes for each message within each group; and
a culling module culling each message having an hash code matching the hash code for at least one other message within the group and retaining one such non-culled message as a unique message. - View Dependent Claims (28, 29, 30, 31)
-
-
32. A method for culling duplicative messages maintained in a structured message store, comprising:
-
retrieving a plurality of messages maintained in a structured message store with each message comprising a header and a message body;
calculating a substantially unique hash code over at least part of the header and over the message body;
comparing the hash codes for each message within each group; and
culling each message having an hash code matching the hash code for at least one other message within the group; and
retaining one such non-culled message as a unique message. - View Dependent Claims (33, 34, 35, 36, 37)
-
-
38. An apparatus for culling duplicative messages maintained in a structured message store, comprising:
-
means for retrieving a plurality of messages maintained in a structured message store with each message comprising a header and a message body;
means for calculating a substantially unique hash code over at least part of the header and over the message body;
means for comparing the hash codes for each message within each group; and
means for culling each message having an hash code matching the hash code for at least one other message within the group; and
means for retaining one such non-culled message as a unique message.
-
Specification