Learning new words
First Claim
1. A computer implemented method executed on a term learning server, the method comprising:
- receiving, by a term learning server, a batch of differentially private sketches of n-grams, each n-gram a sequence of characters forming a subset of one term in a plurality of terms unknown to the term learning server, the batch of differentially private sketches received from one or more client devices;
determining a randomization value based at least on the batch of differentially private sketches;
determining a randomization tolerance based on a set of differentially private sketch data stored on the term learning server; and
in response to determining that the randomization value differs from the randomization tolerance by greater than a threshold value, generating a privacy randomization warning to the term learning server, wherein the privacy randomization warning indicates to the term learning server to perform one or more operations to maintain privacy of the differentially private sketch data stored on the term learning server.
0 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed for a server learning new words generated by user client devices in a crowdsourced manner while maintaining local differential privacy of client devices. A client device can determine that a word typed on the client device is a new word that is not contained in a dictionary or asset catalog on the client device. New words can be grouped in classifications such as entertainment, health, finance, etc. A differential privacy system on the client device can comprise a privacy budget for each classification of new words. If there is privacy budget available for the classification, then one or more new terms in a classification can be sent to new term learning server, and the privacy budget for the classification reduced. The privacy budget can be periodically replenished.
26 Citations
18 Claims
-
1. A computer implemented method executed on a term learning server, the method comprising:
-
receiving, by a term learning server, a batch of differentially private sketches of n-grams, each n-gram a sequence of characters forming a subset of one term in a plurality of terms unknown to the term learning server, the batch of differentially private sketches received from one or more client devices; determining a randomization value based at least on the batch of differentially private sketches; determining a randomization tolerance based on a set of differentially private sketch data stored on the term learning server; and in response to determining that the randomization value differs from the randomization tolerance by greater than a threshold value, generating a privacy randomization warning to the term learning server, wherein the privacy randomization warning indicates to the term learning server to perform one or more operations to maintain privacy of the differentially private sketch data stored on the term learning server. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A non-transitory machine readable medium storing instructions to cause one or more processors to perform operations comprising:
-
receiving, by a term learning server, a batch of differentially private sketches of n-grams, each n-gram a sequence of characters forming a subset of one term in a plurality of terms unknown to the term learning server, the batch of differentially private sketches received from one or more client devices; determining a randomization value based at least on the batch of differentially private sketches; determining a randomization tolerance based on a set of differentially private sketch data stored on the term learning server; and in response to determining that the randomization value differs from the randomization tolerance by greater than a threshold value, generating a privacy randomization warning to the term learning server, wherein the privacy randomization warning indicates to the term learning server to perform one or more operations to maintain privacy of the differentially private sketch data stored on the term learning server. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A data processing system comprising:
-
a memory to store instructions; one or more processors to execute instructions stored in memory, wherein the instructions cause the one or more processors to perform operations comprising; receiving, by a term learning server, a batch of differentially private sketches of n-grams, each n-gram a sequence of characters forming a subset of one term in a plurality of terms unknown to the term learning server, the batch of differentially private sketches received from one or more client devices; determining a randomization value based at least on the batch of differentially private sketches; determining a randomization tolerance based on a set of differentially private sketch data stored on the term learning server; and in response to determining that the randomization value differs from the randomization tolerance by greater than a threshold value, generating a privacy randomization warning to the term learning server, wherein the privacy randomization warning indicates to the term learning server to perform one or more operations to maintain privacy of the differentially private sketch data stored on the term learning server. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification