System and method for recognizing speech securely using a secure multi-party computation protocol
First Claim
1. A method for recognizing a speech unit stored at a client using hidden Markov models (HMMs) stored at a server, each HMM corresponds to a unit of recognizable speech, wherein the speech unit is represented as feature vectors, and wherein each feature vector is partitioned into two random additive shares such that the server receives only one random additive share from the client, the method comprising the steps of:
- determining iteratively by the server, in response to receiving a random additive share of each feature vector, an additive share of a likelihood of the speech unit with respect to the each HMM using at least one secure multi-party computation protocol to produce additive shares of likelihoods of units of recognizable speech, wherein the secure multi-party computation protocol uses as inputs the two random additive shares of a corresponding feature vector, each of the two random additive shares is provided by the client and by the server respectively; and
transmitting the additive shares of the likelihoods for recognizing the speech unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method recognizes speech securely using a secure multi-party computation protocol. The system includes a client and a server. The client is configured to provide securely speech in a form of an observation sequence of symbols, and the server is configured to provide securely a multiple trained hidden Markov models (HMMs), each trained HMM including a multiple states, a state transition probability distribution and an initial state distribution, and each state including a subset of the observation symbols and an observation symbol probability distribution. The observation symbol probability distributions are modeled by mixtures of Gaussian distributions. Also included are means for determining securely, for each HMM, a likelihood the observation sequence is produced by the states of the HMM, and means for determining a particular symbol with a maximum likelihood of a particular subset of the symbols corresponding to the speech.
-
Citations
14 Claims
-
1. A method for recognizing a speech unit stored at a client using hidden Markov models (HMMs) stored at a server, each HMM corresponds to a unit of recognizable speech, wherein the speech unit is represented as feature vectors, and wherein each feature vector is partitioned into two random additive shares such that the server receives only one random additive share from the client, the method comprising the steps of:
-
determining iteratively by the server, in response to receiving a random additive share of each feature vector, an additive share of a likelihood of the speech unit with respect to the each HMM using at least one secure multi-party computation protocol to produce additive shares of likelihoods of units of recognizable speech, wherein the secure multi-party computation protocol uses as inputs the two random additive shares of a corresponding feature vector, each of the two random additive shares is provided by the client and by the server respectively; and transmitting the additive shares of the likelihoods for recognizing the speech unit. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A server configured for recognizing a speech unit stored at a client using hidden Markov models (HMMs) stored at a server, each HMM corresponds to a unit of recognizable speech, wherein the speech unit is represented as feature vectors, and wherein each feature vector is partitioned into two random additive shares such that the server receives only one random additive share from the client, comprising:
-
means for determining iteratively by the server, in response to receiving a random additive share of each feature vector, an additive share of a likelihood of the speech unit with respect to the each HMM using at least one secure multi-party computation protocol to produce additive shares of likelihoods of units of recognizable speech, wherein the secure multi-party computation protocol uses as inputs the two random additive shares of a corresponding feature vector, each of the two random additive shares is provided by the client and by the server respectively; and means for transmitting the additive shares of the likelihoods for recognizing the speech unit. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification