METHODS AND SYSTEMS FOR ENABLING ANALYSIS OF COMMUNICATION CONTENT WHILE PRESERVING CONFIDENTIALITY
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed are methods and systems for enabling analysis of communication content while preserving confidentiality. In one embodiment, communication content is processed to increase the similarity of superficially dissimilar instances of communication content and/or to increase the distinctiveness of superficially similar instances of communications content. In this embodiment at least part of the processed communication content is hashed to obscure the actual communication content. In one embodiment, social network analysis is performed on the communication content after hashing, and visualization of the social network analysis includes thread graphs and/or circular graphs.
-
Citations
51 Claims
-
1-25. -25. (canceled)
-
26. A system for enabling analysis of communication content while preserving confidentiality, comprising:
-
means for capturing communication content including instances of communication content that can be rendered into text; means for processing said captured communication content into natural language tokens to adjust a level of similarity between separate instances of communication content, wherein each natural language token represents a root stem; and means for hashing at least part of said processed communication content to obscure the actual communication content and to produce hashed tokens corresponding to each natural language token, wherein the level of similarity between separate instances of communication content is adjusted to improve hashing results. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A method of enabling analysis of similarity of instances of communication content while preserving confidentiality, comprising:
-
capturing communication content including instances of communication content that can be rendered into text; processing said captured communication content into natural language tokens to adjust a level of similarity between separate instances of communication content, wherein each natural language token represents a root stem; and hashing at least part of said processed communication content to obscure the actual communication content and to produce hashed tokens corresponding to each natural language token, wherein the level of similarity between separate instances of communication content is adjusted to improve hashing results. - View Dependent Claims (40, 41, 42, 43, 44, 45, 46, 47, 48)
-
-
49. A method of analyzing the similarity of communications while preserving the confidentiality of the communications, comprising:
-
capturing at least two entire communications; processing the at least two entire communications into natural language tokens to improve the similarity of any similar content within the at least two entire communications and to reduce the similarity of any dissimilar content within the at least two entire communications, wherein each natural language token represents a root stem; encrypting the at least two processed communications to generate tokens which obscures the actual content and are similar in nature for similar content, wherein each generated token corresponds to a natural language token; and comparing the tokens to identify similar content within the at least two processed communications without determining the actual content of the least two processed communications. - View Dependent Claims (50, 51)
-
Specification