Method of automatic processing of a speech signal
First Claim
1. Method of automatic processing of a speech signal comprising:
- an automatic step of determination of at least one sequence of probability models coming from a finite directory of models, each sequence describing the probability of acoustic production of a sequence of symbolic units of a phonological nature coming from a finite alphabet, the said sequence of symbolic units corresponding to at least one given text and the said probability models each including an observable random process corresponding to the acoustic production of symbolic units and a non-observable random process having known probability properties, so-called Markov properties;
a step of determination of a sequence of digital data strings, known as acoustic strings, representing acoustic properties of a speech signal;
a step of alignment between the said sequence of acoustic strings and the said at least one sequence of models, each model being associated with a sub-sequence of acoustic strings, forming an acoustic segment, and each value of the non-observable process of each model being associated with a sub-sequence of acoustic strings forming an acoustic sub-segment in order to deliver a sequence of non-observable process values associating a value with each acoustic string, known as an aligned sequence; and
a step of determination of a confidence index of acoustic alignment for each association between a model of the sequence and an acoustic segment, known as a model alignment confidence index, and corresponding to an estimate of the probability a posteriori of the model given the observation of the corresponding acoustic segment, known as the a posteriori model probability, characterised in that each step of determination of an alignment confidence index for a model comprises the calculation of the value of the said index at least from a combination of;
the probability of observation of each acoustic string given the value of the non-observable process, known as the model probability and determined from known characteristic parameters of the probability model;
probabilities of production a priori of all the models of the said directory, independently of one another, known as the a priori model probabilities; and
the analytical estimation of the average duration of occupancy of the values of the non-observable process of the model.
1 Assignment
0 Petitions
Accused Products
Abstract
This method of automatic processing of a speech signal comprises:
a step of determination of a sequence (H1N) of probability models corresponding to a given text (TXT);
a step of determination of a sequence (O1T) of acoustic strings corresponding to the diction of the said given text (TXT);
a step of alignment between the said sequence (O1T) of acoustic strings and the said sequence (H1N) of models (Hn); and
a step of determination of a confidence index (In) of acoustic alignment for each association between a model (Hn) and an acoustic segment.
It is characterised in that each step (80) of determination of an alignment confidence index (In-) is carried out at least from a combination of the model probability (Pm), a priori model probabilities (P(λi)) and the average duration of occupancy of the models ({overscore (d)}(qij)).
4 Citations
20 Claims
-
1. Method of automatic processing of a speech signal comprising:
-
an automatic step of determination of at least one sequence of probability models coming from a finite directory of models, each sequence describing the probability of acoustic production of a sequence of symbolic units of a phonological nature coming from a finite alphabet, the said sequence of symbolic units corresponding to at least one given text and the said probability models each including an observable random process corresponding to the acoustic production of symbolic units and a non-observable random process having known probability properties, so-called Markov properties;
a step of determination of a sequence of digital data strings, known as acoustic strings, representing acoustic properties of a speech signal;
a step of alignment between the said sequence of acoustic strings and the said at least one sequence of models, each model being associated with a sub-sequence of acoustic strings, forming an acoustic segment, and each value of the non-observable process of each model being associated with a sub-sequence of acoustic strings forming an acoustic sub-segment in order to deliver a sequence of non-observable process values associating a value with each acoustic string, known as an aligned sequence; and
a step of determination of a confidence index of acoustic alignment for each association between a model of the sequence and an acoustic segment, known as a model alignment confidence index, and corresponding to an estimate of the probability a posteriori of the model given the observation of the corresponding acoustic segment, known as the a posteriori model probability, characterised in that each step of determination of an alignment confidence index for a model comprises the calculation of the value of the said index at least from a combination of;
the probability of observation of each acoustic string given the value of the non-observable process, known as the model probability and determined from known characteristic parameters of the probability model;
probabilities of production a priori of all the models of the said directory, independently of one another, known as the a priori model probabilities; and
the analytical estimation of the average duration of occupancy of the values of the non-observable process of the model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification