Speech synthesis system utilizing variable frame rate
First Claim
1. A speech synthesis system comprising:
- input means for receiving frames of speech data, said frames of speech data comprising binary representations of pitch data, energy data, reflection coefficient data and coded frame rate data, wherein said coded frame rate data is indicative of a variable time interval between the start of a current frame of speech data and the start of the next successive frame of speech data;
decoding means coupled to said input means for decoding said frame rate data;
interpolator means coupled to said input means and to said decoding means for providing a variable number of interpolation calculations to define interpolated speech values between adjacent frames of speech data from last implemented speech data in which the number of interpolation calculations and the time interval between the respective starts of adjacent frames of speech data in a given instance are determined by said frame rate data;
speech synthesizer means coupled to said interpolator means for selectively converting said frames of speech data and interpolated values thereof into analog speech signals representative of human speech; and
audio means coupled to said speech synthesizer means for converting said analog signals representative of human speech into audible sounds.
0 Assignments
0 Petitions
Accused Products
Abstract
Speech synthesis system implementable in an integrated circuit device capable of converting frames of speech data at a variable frame rate into analog signals representative of human speech. The frames of speech data comprise digital representations of values of pitch, energy, filter coefficients and coded frame rate data. The speech synthesis system includes a linear predictive coding filter as a speech synthesizer which utilizes the speech data at a varying frame rate to produce digital speech signals representative of human speech. Frames of digital speech data including coded frame rate data are received by an input, with the frame rate data being decoded to control both the rate at which the incoming variable-length frames of speech data are accepted by the speech synthesizer and the number of interpolation calculations required to define interpolated speech values between adjacent incoming frames of speech data. A frame control circuit accomplishes the foregoing utilization of speech data at a variable frame rate by the speech synthesizer by providing for a variable number of interpolation calculations between adjacent speech frames from last implemented speech data in which the number of interpolation calculations in a given instance is determined by the frame rate data. A microprocessor controls the access of selected speech data which is stored in a memory. The system also includes a digital-to-analog converter for converting the digital speech signals produced by the filter into analog signals and a speaker for generating audible sounds in the form of synthesized human speech from the analog signals provided by the digital-to-analog converter.
46 Citations
18 Claims
-
1. A speech synthesis system comprising:
-
input means for receiving frames of speech data, said frames of speech data comprising binary representations of pitch data, energy data, reflection coefficient data and coded frame rate data, wherein said coded frame rate data is indicative of a variable time interval between the start of a current frame of speech data and the start of the next successive frame of speech data; decoding means coupled to said input means for decoding said frame rate data; interpolator means coupled to said input means and to said decoding means for providing a variable number of interpolation calculations to define interpolated speech values between adjacent frames of speech data from last implemented speech data in which the number of interpolation calculations and the time interval between the respective starts of adjacent frames of speech data in a given instance are determined by said frame rate data; speech synthesizer means coupled to said interpolator means for selectively converting said frames of speech data and interpolated values thereof into analog speech signals representative of human speech; and audio means coupled to said speech synthesizer means for converting said analog signals representative of human speech into audible sounds. - View Dependent Claims (2)
-
-
3. A speech synthesis system comprising:
-
input means for receiving frames of speech data including digital speech values and frame rate data indicative of a variable time interval between the start of a current frame of speech data and the start of the next successive frame of speech data; frame control means coupled to said input means for controlling the rate at which new frames of speech data are received by said input means in response to frame rate data included in a current frame of speech data, the time interval between the receipt of successive new frames of speech data by said input means being variable and being determined by said frame rate data; interpolator means coupled to said frame control means for providing a variable number of interpolation calculations to define interpolated speech values between adjacent frames of speech data from last implemented speech data in which the number of interpolation calculations in a given instance is determined by said frame rate data; speech synthesizer means coupled to said input means and to said interpolator means for selectively converting said digital speech values and interpolated values thereof into analog speech signals representative of human speech; and audio means coupled to said speech synthesizer means for converting said analog signals representative of human speech into audible sounds. - View Dependent Claims (4, 15)
-
-
5. A speech synthesis system comprising:
-
memory means for storing selectable speech data, said speech data comprising binary representations of pitch data, energy data, and reflection coefficient data as selectable frames of speech data respectively including coded frame rate data, wherein said coded frame rate data is indicative of a variable time interval between the start of a current frame of speech data and the start of the next successive frame of speech data; controller means for controlling the selective accessing of said speech data from said memory means; input means coupled to said memory means for receiving selected frames of speech data as accessed under control of said controller means; decoding means coupled to said input means for decoding said frame rate data; frame control means coupled to said decoding means and said input means for controlling the rate at which new frames of speech data are received by said input means in response to frame rate data included in a current frame of speech data, the time interval between the receipt of successive new frames of speech data by said input means being variable and being determined by said frame rate data; speech synthesizer means coupled to said memory means and responsive to said frames of speech data for generating analog signals representative of human speech; and audio means coupled to said speech synthesizer means for converting said analog signals representative of human speech into audible sounds.
-
-
6. A speech synthesis system comprising:
-
memory means for storing a plurality of digital speech values indicative of pitch data, energy data, and reflection coefficient data as selectable frames of speech data respectively including coded frame rate data, wherein said coded frame rate data is indicative of a variable time interval between the start of a current frame of speech data and the start of the next successive frame of speech data; controller means for controlling selective accessing of said plurality of digital speech values from said memory means; input means coupled to said memory means for receiving selected frames of speech data as accessed under control of said controller means; decoding means coupled to said input means for decoding said frame rate data; frame control means coupled to said decoding means and said input means for controlling the rate at which new frames of speech data are received by said input means in response to frame rate data included in a current frame of speech data, the time interval between the receipt of successive new frames of speech data by said input means being variable and being determined by said frame rate data; interpolator means coupled to said frame control means for providing a variable number of interpolation calculations to define interpolator speech values between adjacent frames of speech data from last implemented speech data in which the number of interpolation calculations in a given instance is determined by said frame rate data; speech synthesizer means coupled to said input means and to said interpolator means for selectively converting said digital speech values included in the selected frames of speech data and interpolated values thereof into analog speech signals representative of human speech at a data rate determined by said frame rate data; and audio means coupled to said speech synthesizer means for converting said analog signals representative of human speech into audible sounds. - View Dependent Claims (7, 8, 9, 10, 11, 12)
-
-
13. A speech synthesis system comprising:
-
input means for receiving frames of speech data including digital speech values and frame rate data indicative of a variable time interval between the start of a current frame of speech data and the start of the next successive frame of speech data; interpolator means coupled to said input means for providing a variable number of interpolation calculations to define interpolated speech values between adjacent frames of speech data from last implemented speech data in which the number of interpolation calculations and the time interval between the respective starts of adjacent frames of speech data in a given instance are determined by said frame rate data; speech synthesizer means coupled to said interpolator means for selectively converting said digital speech values and interpolated values thereof into analog speech signals representative of human speech; and audio means coupled to said speech synthesizer means for converting said analog signals representative of human speech into audible sounds. - View Dependent Claims (14, 16)
-
-
17. A speech synthesis system comprising:
-
means for providing frames of speech data including digital speech values and frame rate data, wherein said frame rate data is indicative of a variable time interval between the start of a current frame of speech data and the start of the next successive frame of speech data; speech synthesizer means having an input for respectively receiving successive frames of speech data and for selectively converting said digital speech values into analog speech signals representative of human speech; frame control means responsive to said frame rate data for enabling the acceptance of a new frame of speech data by the input of said speech synthesizer means succeeding the previous frame of speech data, the time interval between the acceptance of successive new frames of speech data by the input of said speech synthesizer means as enabled by said frame control means being variable and being determined by said frame rate data of each frame of speech data, the input of said speech synthesizer means thereby being responsive to said frame control means for respectively receiving successive frames of speech data in variably timed relation to each other; and audio means coupled to said speech synthesizer means for converting said analog speech signals representative of human speech into audible sounds. - View Dependent Claims (18)
-
Specification