Conservative training method for adapting a neural network of an automatic speech recognition device
First Claim
Patent Images
1. A method of adapting a neural network of an automatic speech recognition device, comprising the steps of:
- providing a neural network comprising an input stage for storing at least one voice signal sample, an intermediate stage and an output stage, said output stage outputting phoneme probabilities;
providing a linear stage in said neural network; and
training said linear stage by means of an adaptation set,wherein the step of providing said linear stage comprises the step of providing said linear stage after said intermediate stage;
wherein the step of training said linear stage comprises training said linear stage so that the phoneme probability of a phoneme belonging to an absent class is equal to the phoneme probability of said phoneme calculated by said neural network before the step of providing a linear stage; and
wherein the step of training said linear stage further comprises training said linear stage so that the phoneme probability of the phoneme corresponding to a voice signal sample of said adaptation set is calculated by subtracting the phoneme probabilities of all the phonemes belonging to said absent class from 1.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of adapting a neural network of an automatic speech recognition device, includes the steps of: providing a neural network including an input stage, an intermediate stage and an output stage, the output stage outputting phoneme probabilities; providing a linear stage in the neural network; and training the linear stage by means of an adaptation set; wherein the step of providing the linear stage includes the step of providing the linear stage after the intermediate stage.
-
Citations
16 Claims
-
1. A method of adapting a neural network of an automatic speech recognition device, comprising the steps of:
-
providing a neural network comprising an input stage for storing at least one voice signal sample, an intermediate stage and an output stage, said output stage outputting phoneme probabilities; providing a linear stage in said neural network; and training said linear stage by means of an adaptation set, wherein the step of providing said linear stage comprises the step of providing said linear stage after said intermediate stage; wherein the step of training said linear stage comprises training said linear stage so that the phoneme probability of a phoneme belonging to an absent class is equal to the phoneme probability of said phoneme calculated by said neural network before the step of providing a linear stage; and wherein the step of training said linear stage further comprises training said linear stage so that the phoneme probability of the phoneme corresponding to a voice signal sample of said adaptation set is calculated by subtracting the phoneme probabilities of all the phonemes belonging to said absent class from 1. - View Dependent Claims (2, 3, 4, 5, 6, 14)
-
-
7. A neural network comprising:
-
a computer; an input stage for storing at least one voice signal sample; an intermediate stage; an output stage; and a linear stage which is to be trained by means of an adaptation set, wherein said output stage is configured to output phoneme probabilities; wherein said linear stage is provided after said intermediate stage and is configured to be trained so that the phoneme probability of a phoneme belonging to an absent class is equal to the phoneme probability of said phoneme calculated by said neural network before the step of providing a linear stage; and wherein said linear stage is configured to be trained so that the phoneme probability of the phoneme corresponding to a voice signal sample of said adaptation set is calculated by subtracting the phoneme probabilities of all the phonemes belonging to said absent class from 1. - View Dependent Claims (8, 9, 10, 11, 12, 13)
-
-
15. A method of adapting a multi-layer neural network of an automatic speech recognition device, comprising the steps of:
-
providing a neural network comprising an input stage for storing at least one voice signal sample, an intermediate stage having input connections associated to a first weight matrix and an output stage having input connections associated to a second weight matrix, said output stage outputting phoneme probabilities; providing a linear stage in said neural network after said intermediate stage, said linear stage having a same number of nodes as said intermediate stage; and training said linear stage by means of an adaptation set, said first weight matrix and said second weight matrix being kept fixed during said training, wherein the step of training said linear stage comprises training said linear stage so that the phoneme probability of a phoneme belonging to an absent class is equal to the phoneme probability of said phoneme calculated by said neural network before the step of providing a linear stage; and wherein the step of training said linear stage comprises training said linear stage so that the phoneme probability of the phoneme corresponding to a voice signal sample of said adaptation set is calculated by subtracting the phoneme probabilities of all the phonemes belonging to said absent class from 1.
-
-
16. A multi-layer neural network computation module, comprising:
-
a computer; an input stage for storing at least one voice signal sample; an intermediate stage having input connections associated to a first weight matrix; an output stage having input connections associated to a second weight matrix; and a linear stage configured to be trained by means of an adaptation set, wherein said first weight matrix and said second weight matrix are kept fixed while said linear stage is trained; wherein said output stage is configured to output phoneme probabilities; wherein said linear stage is provided after said intermediate stage, said linear stage having a same number of nodes as said intermediate stage; wherein said linear stage is provided after said intermediate stage and is configured to be trained so that the phoneme probability of a phoneme belonging to an absent class is equal to the phoneme probability of said phoneme calculated by said neural network before the step of providing a linear stage; and wherein said linear stage is configured to be trained so that the phoneme probability of the phoneme corresponding to a voice signal sample of said adaptation set is calculated by subtracting the phoneme probabilities of all the phonemes belonging to said absent class from 1.
-
Specification