System and method for processing message threads
First Claim
Patent Images
1. A system for processing message threads, comprising:
- a thread module to group a plurality of messages, each comprising a message body, by conversation thread;
a message body comparer module to compare the message bodies of the messages;
a near duplicate module to identify each message body recursively contained in at least one other message as a near duplicate message and to identify each message body matching at least one other message as an exact duplicate message;
an attachment module to generate an attachment sequence for at least part of each attachment associated with one or more of the messages;
a message sequence module to generate a message sequence for each of the messages;
a concatenator module to concatenate the message sequence and the attachment sequence into a compound digest for each message;
a digest comparer module to compare the compound digests of each message;
marker module to mark each near duplicate message and exact duplicate message having a compound digest not matching the compound digest of any other message as a unique message; and
a processor to execute each of the modules, which are stored on a computer-readable storage medium.
8 Assignments
0 Petitions
Accused Products
Abstract
A system and method for processing message threads is provided. A plurality of messages, each comprising a message body, is grouped by conversation thread. The message bodies of the messages are compared. Each message recursively contained in at least one other message is identified as a near duplicate message. An attachment sequence is generated for at least part of each attachment associated with one or more of the messages. The attachment sequences associated with the near duplicate messages are compared. Each near duplicate message having an attachment sequence not matching the attachment sequence of any other near duplicate message is identified as a unique message.
50 Citations
20 Claims
-
1. A system for processing message threads, comprising:
-
a thread module to group a plurality of messages, each comprising a message body, by conversation thread; a message body comparer module to compare the message bodies of the messages; a near duplicate module to identify each message body recursively contained in at least one other message as a near duplicate message and to identify each message body matching at least one other message as an exact duplicate message; an attachment module to generate an attachment sequence for at least part of each attachment associated with one or more of the messages; a message sequence module to generate a message sequence for each of the messages; a concatenator module to concatenate the message sequence and the attachment sequence into a compound digest for each message; a digest comparer module to compare the compound digests of each message; marker module to mark each near duplicate message and exact duplicate message having a compound digest not matching the compound digest of any other message as a unique message; and a processor to execute each of the modules, which are stored on a computer-readable storage medium. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method processed by a computer processor for processing message threads, comprising:
-
grouping a plurality of messages, each comprising a message body, by conversation thread; comparing the message bodies of the messages; identifying each message body recursively contained in at least one other message as a near duplicate message and each message body matching at least one other message as an exact duplicate message; generating an attachment sequence for at least part of each attachment associated with one or more of the messages; generating a message sequence for each of the messages; concatenating the message sequence and the attachment sequence into a compound digest for each message; comparing the compound digest of each message; and marking each near duplicate message and exact duplicate message having a a compound digest not matching the compound digest of any other message as a unique message. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification