Method and apparatus for normalizing quoting styles in electronic mail messages
First Claim
1. ) A method for normalizing quoting styles in electronic mail messages comprising:
- a) receiving a first text having a first issue and a first length and a second text having a second issue and a second length, b) determining one or more quoting text portions of the first text which comprise at least a portion of the second text, c) deleting from the first text any quoting text portions of the first text which consist of the substantially all of the second text, d) determining whether the first text does not contain any remaining quoting portions, and whether the first issue and the second issue are likely to be the same, and e) responsive to the determination in step d, adding at least a portion of the second text to the first text.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus is provided for making message adjustments to normalize quoting styles in message content for applications such as finding messages dealing with a particular topic, or finding inter-conversation topic groupings via centroid-based clustering methods. First, message is analyzed into its essential text, plus inessential material such as entire prefixed or suffixed. Then, the essential text of the node-associated message is adjusted to avoid vector distance distortions based on differences in quoting styles. Initially, selective quotes are included, as these are reasonably considered to form a logical part of the message, and entire prefixed or suffixed messages are omitted. Finally, if the message does not contain selective quotes, an analysis is done to determine whether all or part of the parent message constitutes a logical, albeit implicit, references. If so, all or some parts of the essential text of the parent are included in the adjusted message.
-
Citations
3 Claims
-
1. ) A method for normalizing quoting styles in electronic mail messages comprising:
-
a) receiving a first text having a first issue and a first length and a second text having a second issue and a second length, b) determining one or more quoting text portions of the first text which comprise at least a portion of the second text, c) deleting from the first text any quoting text portions of the first text which consist of the substantially all of the second text, d) determining whether the first text does not contain any remaining quoting portions, and whether the first issue and the second issue are likely to be the same, and e) responsive to the determination in step d, adding at least a portion of the second text to the first text. - View Dependent Claims (2, 3)
-
Specification