Adaptive linear prediction speech synthesizer
First Claim
1. A synthesizer for producing a speech signal from segmented parametric description signals, and preceding speech samples of said speech signal comprising means for storing an excitation level adjustment signal for the currently occurring speech segment, means operative in spaced time periods of the currently occurring speech segment responsive to the parametric description signals of said current speech segment, said preceding speech samples, and said excitation level adjustment signal for generating the speech samples of said current speech segment at a predetermined rate;
- means operative in intervals between said spaced time periods responsive to the parametric description signals of the next successive speech segment and said preceding speech samples for forming signals representative of prescribed component codes of the excitation level adjustment signal of the next successive speech segment; and
means operative after termination of the final spaced time period of the current speech segment responsive to said component code signals and said next successive speech segment parametric description signals for producing the excitation level adjustment signal of said next successive speech segment.
0 Assignments
0 Petitions
Accused Products
Abstract
A real-time predictive speech synthesizer produces an artificial speech signal from pitch period segmented codes. Responsive to the predictive parameters of the currently occurring pitch period, preceding speech samples, and the adjusted excitation signal of the current pitch period, a prescribed set of current pitch period speech samples are generated in regularly spaced time periods. In the intervals between spaced time periods, prescribed components of the excitation level adjustment signal of the next successive pitch period are formed from the prediction parameters of the next successive pitch period, the preceding speech samples, and the next successive pitch period excitation signal. After the current pitch period final spaced time period, the formed components are combined with the next successive pitch period energy signal to produce the next successive pitch period excitation level adjustment signal.
-
Citations
26 Claims
-
1. A synthesizer for producing a speech signal from segmented parametric description signals, and preceding speech samples of said speech signal comprising means for storing an excitation level adjustment signal for the currently occurring speech segment, means operative in spaced time periods of the currently occurring speech segment responsive to the parametric description signals of said current speech segment, said preceding speech samples, and said excitation level adjustment signal for generating the speech samples of said current speech segment at a predetermined rate;
- means operative in intervals between said spaced time periods responsive to the parametric description signals of the next successive speech segment and said preceding speech samples for forming signals representative of prescribed component codes of the excitation level adjustment signal of the next successive speech segment; and
means operative after termination of the final spaced time period of the current speech segment responsive to said component code signals and said next successive speech segment parametric description signals for producing the excitation level adjustment signal of said next successive speech segment. - View Dependent Claims (2, 3, 4, 5, 6, 7)
- means operative in intervals between said spaced time periods responsive to the parametric description signals of the next successive speech segment and said preceding speech samples for forming signals representative of prescribed component codes of the excitation level adjustment signal of the next successive speech segment; and
- 8. A synthesizer for producing a prescribed speech signal from concatenated pitch period descriptive parameter codes comprising means for storing first signals representative of predictive parameters of a pitch period speech signal segment, means for storing second signals representative of predictive parameters of the next successive pitch period speech signal segment, means for storing a third signal representative of the energy of said next successive pitch period speech signal segment, means for storing a first set of preceding speech samples, means for storing a second set of preceding speech samples, means for storing an excitation adjustment signal, an excitation signal source, means operative at predetermined spaced time periods responsive to said first signals, said first set of preceding speech samples, the excitation signal of said pitch period from said excitation signal source, and said excitation adjustment signal for generating the speech samples of said pitch period segment, means operative in selected intervals between said spaced time periods responsive to said second signals, said second set of preceding speech samples and the excitation signal of said next successive pitch period from said excitation signal source for forming a plurality of coded signals representative of prescribed components of the next successive pitch period excitation adjustment signal, and means operative after the final spaced time period of said pitch period responsive to said coded signals and said third signal for generating the excitation adjustment signal of said next successive pitch period speech signal.
-
12. Apparatus for synthesizing a speech signal from pitch period segmented linear prediction parameter signals, excitation signals, pitch period speech segment energy signals, and preceding samples of said speech signal comprising means for storing a current pitch period excitation level adjustment signal, first means operative in regularly spaced time periods of the current pitch period responsive to the current pitch period prediction parameter signals, a first group of preceding speech samples, the current pitch period excitation signal, and the current pitch period excitation level adjustment signal for generating speech samples of said current pitch period, second means operative in intervals between said spaced time periods responsive to the next successive pitch period prediction parameter signals, a second group of preceding speech samples and the next successive pitch period excitation signal for forming signals representative of a prescribed set of components of the excitation level adjustment signal of the next successive pitch period, and third means operative upon termination of the final spaced time period of said current pitch period responsive to said next successive pitch period speech energy signal and the prescribed set of component signals for producing the excitation level adjustment signal of the next successive pitch period.
-
13. A method for synthesizing a speech signal from pitch period segmented parametric description signals, pitch period segmented excitation signals, and preceding speech samples of said speech signal comprising the steps of generating speech samples of the currently occurring pitch period in regularly spaced time periods responsive to the current pitch period parametric description signals, said preceding speech samples and the current pitch period adjusted excitation signal;
- forming signals representative of a prescribed set of components of the excitation adjustment signal of the next successive pitch period in intervals between said spaced time periods responsive to the parametric description signals of the next successive pitch period, the preceding speech samples, and the excitation signal of the next successive pitch period; and
producing the excitation adjustment signal of the next successive pitch period upon termination of the last spaced time period of said current pitch period responsive to the next successive pitch period parametric description signals and said component signals.
- forming signals representative of a prescribed set of components of the excitation adjustment signal of the next successive pitch period in intervals between said spaced time periods responsive to the parametric description signals of the next successive pitch period, the preceding speech samples, and the excitation signal of the next successive pitch period; and
-
14. A method for synthesizing an artificial speech signal from segmented parametric description signals and preceding speech samples of the artificial speech signal comprising the steps of receiving parametric description signals of the currently occurring speech segment, parametric description signals of the next successive speech segment, a signal representative of the energy of the next successive speech segment, and a signal representative of the excitation of the currently occurring and next successive speech segments;
- storing an excitation level adjustment signal of the currently occurring speech segment;
generating speech samples of the current speech signal segment in regularly spaced time periods responsive to the current speech segment parametric description signals, the preceding speech samples, the current speech segment excitation signal, and the current speech segment excitation level adjustment signal;
forming signals representative of prescribed components of the next successive speech segment excitation level adjustment signal in intervals between said spaced time periods responsive to the next successive speech segment parametric description signals, the preceding speech samples, and the next successive speech segment excitation signal; and
producing the next successive speech segment excitation level adjustment signal upon termination of the final spaced time period responsive to the next successive speech segment energy signal and the formed component signals. - View Dependent Claims (15, 16, 17, 18)
- storing an excitation level adjustment signal of the currently occurring speech segment;
-
19. A linear prediction synthesizer for producing an artificial speech signal at a real-time pitch period rate comprising means for receiving pitch period segmented predictive parameter signals, pitch period segmented speech signal energy signals, pitch period segmented excitation signals, and signals representative of the number of samples in each pitch period;
- means for storing first and second groups of preceding samples of said speech signal;
means for storing the currently occurring pitch period excitation level adjustment signal;
means operative in regularly spaced time periods of the currently occurring pitch period for generating speech samples of said current pitch period comprising first means for arithmetically combining the current pitch period predictive parameter signals with said first group of preceding speech signal samples, means for multiplying the current pitch period excitation signal with the stored excitation level adjustment signal of said current pitch period, means for summing the output of said first means and the output of said second means to form a speech sample of said current pitch period, and means responsive to the current pitch period sample number signal for disabling said speech generating means when the number of speech samples generated equals said current pitch period sample number signal;
means operative in selected intervals occurring between said spaced time periods for cumulatively forming a group of signals representative of a set of components of the next successive pitch period excitation level adjustment signal comprising third means for arithmetically combining said second group of preceding speech samples with said next succeeding pitch period predictive parameter signals for each speech sample of the next successive pitch period, fourth means responsive to said arithmetically combined signals from said third means for forming a prescribed set of said next succeeding pitch period excitation level adjustment signal components for each speech sample of said next successive pitch period, means for cumulatively combining each excitation level, adjustment signal component with the corresponding excitation level adjustment signal component formed for the preceding speech samples of said next successive pitch period, a plurality of sets of speech sample excitation level adjustment component signals being formed in each selected interval, and means responsive to the next successive pitch period sample number signal equaling the number of operations of said fourth means for disabling said component signal forming means; and
means operative upon the disabling of said sample generating means for producing the next successive pitch period excitation level adjustment signal comprising means for arithmetically combining said formed group of component signals with said next successive pitch period speech energy signal; and
means for applying the produced excitation level adjustment signal to said excitation level adjustment signal storing means.
- means for storing first and second groups of preceding samples of said speech signal;
-
20. A synthesizer for producing an artificial speech signal from pitch period segmented descriptive codes comprising means operating at the beginning of the currently occurring speech signal pitch period for receiving predictive parameter signals of the next succeeding pitch period, a signal representative of the next succeeding pitch period speech energy, a signal representative of the next successive pitch period excitation;
- means for storing first and second groups of preceding samples of said speech signal;
means for storing an excitation adjustment signal for the current pitch period;
means operative at regularly spaced times in the currently occurring pitch period for generating the current pitch period speech samples comprising first means for combining the previously received predictive parameter signals of the current pitch period, the first group of preceding samples, the previously received excitation signal of the current pitch period, and the stored excitation adjustment signal of the current pitch period;
means operative in intervals between said regularly spaced times for cumulatively forming signals representative of a prescribed set of component codes of the next successive pitch period excitation adjustment signal comprising second means for combining the next successive pitch period predictive parameter signals, the second group of preceding samples and the next successive pitch period excitation signal; and
means operative after the final spaced time of the current pitch period for producing the next successive pitch period excitation adjustment signal comprising third means for combining said component code signals with the next succeeding pitch period speech energy signal. - View Dependent Claims (21, 22, 23, 24, 25)
- means for storing first and second groups of preceding samples of said speech signal;
-
26. An artificial speech synthesizer for producing a speech signal from pitch period segmented parametric description signals comprising means for storing a pitch period speech corrective signal;
- means operative in regularly spaced time periods of the currently occurring pitch period responsive to the parametric description signals of said current pitch period and said pitch period corrective signal for generating samples of said pitch period speech segment, means operative in intervals between said spaced time periods responsive to the parametric description signals of the next successive pitch period for forming signals representative of a prescribed set of component codes of the speech corrective signal of the next successive pitch period; and
means operative upon termination of the last spaced time period responsive to said component code signals for producing the corrective signal of the next successive pitch period.
- means operative in regularly spaced time periods of the currently occurring pitch period responsive to the parametric description signals of said current pitch period and said pitch period corrective signal for generating samples of said pitch period speech segment, means operative in intervals between said spaced time periods responsive to the parametric description signals of the next successive pitch period for forming signals representative of a prescribed set of component codes of the speech corrective signal of the next successive pitch period; and
Specification