Differential privacy with cloud data
First Claim
1. A computer-implemented method comprising:
- receiving, by a service, clear text of an email of a client device;
processing, by the service, the clear text of the email into a processed email, wherein processing the clear text of the email includes parsing the clear text into one or more tokenized sentence segments including one or more tags, and wherein the one or more tags privatize personally identifiable information within the clear text of the email;
enriching the one or more tokenized sentence segments in the processed email using data from an application specific to a user of the client device and the one or more tags, wherein enriching the tokenized sentence segments comprises;
accessing, from the application on the client device, the data specific to the user of the client device;
comparing the clear text associated with the one or more tags with the data associated with the user of the client device; and
in response to determining a match between the clear text and the data associated with the user of the client device;
replacing, by the service, at least one of the one or more tags with an enriched tag, wherein the enriched tag is created by modifying the at least one or more tags to include the data from the application specific to the user of the client device.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments described herein enable data associated with a large plurality of users to be analyzed without compromising the privacy of the user data. In one embodiment, a user can opt-in to allow analysis of clear text of the user'"'"'s emails. An analysis process can then be performed in which an analysis service receives clear text of an email of a client device; processes the clear text of the email into one or more tokens having one or more tags; enriches one or more tokens in the processed email using data associated with a user of the client device and the one or more tags; and processes the clear text and one or more enriched tokens to generate a data set of one or more feature vectors.
58 Citations
21 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a service, clear text of an email of a client device; processing, by the service, the clear text of the email into a processed email, wherein processing the clear text of the email includes parsing the clear text into one or more tokenized sentence segments including one or more tags, and wherein the one or more tags privatize personally identifiable information within the clear text of the email; enriching the one or more tokenized sentence segments in the processed email using data from an application specific to a user of the client device and the one or more tags, wherein enriching the tokenized sentence segments comprises; accessing, from the application on the client device, the data specific to the user of the client device; comparing the clear text associated with the one or more tags with the data associated with the user of the client device; and in response to determining a match between the clear text and the data associated with the user of the client device; replacing, by the service, at least one of the one or more tags with an enriched tag, wherein the enriched tag is created by modifying the at least one or more tags to include the data from the application specific to the user of the client device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer readable medium programmed with instructions that, when executed by a processing system, cause the processing system to perform operations comprising:
-
receiving, by a service, clear text of an email of a client device; processing, by the service, the clear text of the email into a processed email, wherein processing the clear text of the email includes parsing the clear text into one or more tokenized sentence segments including one or more tags, wherein the one or more tags privatize personally identifiable information within the clear text of the email; enriching the one or more tokenized sentence segments in the processed email using data from an application specific to a user of the client device and the one or more tags, wherein enriching the tokenized sentence segments comprises; accessing, from the application on the client device, the data specific to the user of the client device; comparing the clear text associated with the one or more tags with the data associated with the user of the client device; and in response to determining a match between the clear text and the data associated with the user of the client device, replacing, by the service, at least one of the one or more tags with an enriched tag, wherein the enriched tag is created by modifying the at least one or more tags to include the data from the application specific to the user of the client device. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A system comprising:
-
a non-transitory machine-readable medium storing instructions; a processing system to execute instructions stored on the non-transitory machine-readable medium, the instructions to cause the processing system to; receive clear text of an email of a client device; process the clear text of the email into a processed email, the processed email including one or more tokens having one or more tags, wherein to process the clear text of the email includes to parse the clear text into one or more tokenized sentence segments including one or more tags, wherein the one or more tags privatize personally identifiable information within the clear text of the email; enrich the one or more tokenized sentence segments in the processed email using data from an application specific to a user of the client device and the one or more tags, wherein to enrich the tokenized sentence segments comprises; accessing, from the application on the client device, the data specific to the user of the client device; comparing the clear text associated with the one or more tags with the data associated with the user of the client device; and in response to determining a match between the clear text and the data associated with the user of the client device, replacing at least one of the one or more tags with an enriched tag, wherein the enriched tag is created by modifying the at least one or more tags to include the data from the application specific to the user of the client device. - View Dependent Claims (17, 18, 19, 20, 21)
-
Specification