VARIABLE-COMPONENT DEEP NEURAL NETWORK FOR ROBUST SPEECH RECOGNITION
First Claim
1. A method for recognizing speech, the method comprising:
- capturing speech input;
determining a value for an environment variable;
utilizing a deep neural network (DNN) to recognize the captured speech input, wherein one or more components of the DNN are modeled as a set of functions of the environment variable; and
producing an output of recognized speech.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for speech recognition incorporating environmental variables are provided. The systems and methods capture speech to be recognized. The speech is then recognized utilizing a variable component deep neural network (DNN). The variable component DNN processes the captured speech by incorporating an environment variable. The environment variable may be any variable that is dependent on environmental conditions or the relation of the user, the client device, and the environment. For example, the environment variable may be based on noise of the environment and represented as a signal-to-noise ratio. The variable component DNN may incorporate the environment variable in different ways. For instance, the environment variable may be incorporated into weighting matrices and biases of the DNN, the outputs of the hidden layers of the DNN, or the activation functions of the nodes of the DNN.
193 Citations
12 Claims
-
1. A method for recognizing speech, the method comprising:
-
capturing speech input; determining a value for an environment variable; utilizing a deep neural network (DNN) to recognize the captured speech input, wherein one or more components of the DNN are modeled as a set of functions of the environment variable; and producing an output of recognized speech. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for recognizing speech, the system comprising:
-
a speech capture device; a feature extraction module; an environment variable module, wherein the environment variable module determines a value for an environment variable; and a speech recognition decoder, wherein the speech recognition decoder utilizes a deep neural network (DNN) to recognize speech captured by the speech capture device, wherein one or more components of the DNN are modeled as a set of functions of the environment variable. - View Dependent Claims (8, 9, 10, 11, 12)
-
Specification