Electronic mail duplicate detection
First Claim
Patent Images
1. A method for identifying linked electronic mails, the method comprising:
- receiving a query from a user, wherein the query comprises at least a segment of an electronic mail;
based on the segment received, identifying linked electronic mails via rendering to the user at least one of related subsets or related supersets of electronic mails related to the received segment, wherein the related subsets and related supersets are threads of the segment received and arranged in hierarchy;
said rendering comprising;
detecting at least one match between a query root segment signature of one the received segment and a root segment signature index, the root segment signature index comprising at least one of a word index and a metadata index, wherein the word index comprises at least one of a keyword and, and wherein the metadata index comprises at least one of temporal information and navigation information;
said detecting comprising;
receiving a pre-defined token match threshold;
identifying a set of root segment signatures of the root segment signature index;
comparing the query root segment signature with each root segment signature of the set of root segment signatures of the root segment signature index;
identifying a subset of the root segment signature index, wherein a match between the root segment signature and the query root segment signature is at least the pre-defined token match threshold; and
building an electronic mail thread hierarchy based on the at least one detected match.
2 Assignments
0 Petitions
Accused Products
Abstract
Embodiments of the invention are related to a method and system for identifying linked electronic mails by receiving a query from a user, wherein the query comprises at least a segment of an electronic mail; and based on the segment received, rendering to the user at least one of related subsets or a related supersets of electronic mails related to the received segment, wherein the related subsets and related supersets are threads of the segment received and arranged in a hierarchical manner.
-
Citations
21 Claims
-
1. A method for identifying linked electronic mails, the method comprising:
-
receiving a query from a user, wherein the query comprises at least a segment of an electronic mail; based on the segment received, identifying linked electronic mails via rendering to the user at least one of related subsets or related supersets of electronic mails related to the received segment, wherein the related subsets and related supersets are threads of the segment received and arranged in hierarchy; said rendering comprising; detecting at least one match between a query root segment signature of one the received segment and a root segment signature index, the root segment signature index comprising at least one of a word index and a metadata index, wherein the word index comprises at least one of a keyword and, and wherein the metadata index comprises at least one of temporal information and navigation information;
said detecting comprising;receiving a pre-defined token match threshold; identifying a set of root segment signatures of the root segment signature index; comparing the query root segment signature with each root segment signature of the set of root segment signatures of the root segment signature index; identifying a subset of the root segment signature index, wherein a match between the root segment signature and the query root segment signature is at least the pre-defined token match threshold; and building an electronic mail thread hierarchy based on the at least one detected match. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A data processing system comprising at least a processor and a memory, the data processing system configured for identifying linked electronic mails, the system configured to perform:
-
receiving a query from a user, wherein the query comprises at least a segment of an electronic mail; based on the segment received, identifying linked electronic mails via rendering to the user at least one of related subsets or related supersets of electronic mails related to the received segment, wherein the related subsets and related supersets are threads of the segment received and arranged in hierarchy; said rendering comprising; detecting at least one match between a query root segment signature of the received segment and a root segment signature index, the root segment signature index comprising at least one of a word index and a metadata index, wherein the word index comprises at least one of a keyword and subject information, and wherein the metadata index comprises at least one of temporal information and navigation information; said detecting comprising; receiving a pre-defined token match threshold; identifying a set of root segment signatures of the root segment signature index; comparing the query root segment signature with each root segment signature of the set of root segment signatures of the root segment signature index; identifying a subset of the root segment signature index, wherein a match between the root segment signature and the query root segment signature is at least the pre-defined token match threshold; and building an electronic mail thread hierarchy based on the at least one detected match. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
creating a thread hierarchy based on the weights assigned to each of the child node.
-
-
20. The system as claimed in claim 11, wherein the electronic mail and the segments are stored in a repository.
-
21. The system as claimed in claim 20, wherein the repository comprise at least one of structured data and unstructured data.
Specification