Speech analysis and synthesis system

US 4,058,676 A
Filed: 07/07/1975
Issued: 11/15/1977
Est. Priority Date: 07/07/1975
Status: Expired due to Term

First Claim

Patent Images

1. A speech analysis and synthesis system capable of processing a plurality of speech channels in real time, said system comprising:

an analyzer including,means for analyzing a plurality of electrical speech signals by Cepstrum technique to generate a pitch period parameter for each speech signal,means for analyzing said plurality of speech signals to generate a voiced/unvoiced decision parameter for each speech signal,means for analyzing said plurality of speech signals by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter for each speech signal; and

a synthesizer including,pitch pulse generator means for receiving said pitch period parameters and generating pitch pulses having a corresponding pitch period,random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said voiced/unvoiced decision parameters, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is invoiced,gain control means connected to said output of said switch means for receiving said power parameters, an output level of said gain control means being dependent upon said power parameters, andlinear prediction filter means connected to said output of said gain control means for receiving and applying said coefficient parameters to said filter means to generate a replica of each of said plurality of speech signals at an output of said filter means.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Method and apparatus for speech analysis and synthesis adapted for analyzing and multiplexing speech signals from a plurality of voice grade telephone lines for further transmission through a single voice grade telephone line. A plurality of specialized digital signal processing techniques are employed to analyze in real time four speech channels in parallel and multiplex speech frame parameters of the channels into a single data output channel for transmission through a suitable media. The received data channel is demultiplexed and the speech frame parameters for the individual channels are utilized to synthesize, in parallel, the four speech signals. Certain of the digital processing techniques utilize the characteristics of speech signals to truncate conventional signal processing time while other processing techniques are substantially statistical analyses of speech to resolve ambiguities, particularly in making the voiced/unvoiced decision for a frame of analyzed speech data.

Citations

92 Claims

1. A speech analysis and synthesis system capable of processing a plurality of speech channels in real time, said system comprising:
- an analyzer including,means for analyzing a plurality of electrical speech signals by Cepstrum technique to generate a pitch period parameter for each speech signal,means for analyzing said plurality of speech signals to generate a voiced/unvoiced decision parameter for each speech signal,means for analyzing said plurality of speech signals by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter for each speech signal; and
  
  a synthesizer including,pitch pulse generator means for receiving said pitch period parameters and generating pitch pulses having a corresponding pitch period,random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said voiced/unvoiced decision parameters, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is invoiced,gain control means connected to said output of said switch means for receiving said power parameters, an output level of said gain control means being dependent upon said power parameters, andlinear prediction filter means connected to said output of said gain control means for receiving and applying said coefficient parameters to said filter means to generate a replica of each of said plurality of speech signals at an output of said filter means.

2. A speech analysis and synthesis system comprising:
- an analyzer including,means for analyzing an electrical speech signal by Cepstrum technique to generate a pitch period parameter,means for analyzing said speech signal to generate a voiced/unvoiced decision parameter, andmeans for analyzing said speech signal by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter; and
  
  a synthesizer including,pitch pulse generator means for receiving said pitch period parameter and generating pitch pulses having a corresponding pitch period,random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced, and from said random noise generator means if said decision is unvoiced,gain control means connected to said output of said switch means for receiving said power parameter, an output level of said gain control means being dependent upon said power parameter, andlinear prediction filter means connected to said output of said gain control means for receiving and applying said coefficient parameters to said filter means to generate a replica or said speech signal at an output of said filter means;
  
  wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter includesmeans for scanning a Cepstrum signal within an auxilliary signal range on each side of a Cepstrum peak amplitude signal,means for detecting and storing a range peak signal within the scanned auxilliary ranges,means for summing the absolute values of the signal within the auxilliary ranges to form a range signal sum,means for dividing said Cepstrum peak amplitude signal by said range peak signal to form a range peak factor,means for dividing said range signal sum by said Cepstrum peak amplitude signal to form an average peak factor,means for first comparing said range peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said range peak factor is above said upper limit and generating an unvoiced decision parameter if said range peak factor is equal to or below said lower limit, andmeans for secondarily comparing said average peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said average peak factor is equal to or less than said lower limit and generating an unvoiced decision parameter if said average peak factor is equal to or greater than said upper limit.
- View Dependent Claims (3, 4, 5, 6, 7)
- - 3. A speech analysis and synthesis system as defined in claim 2 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - means for thirdly comparing said Cepstrum peak amplitude signal with predetermined upper and lower limits and generating a voiced decision parameter if said Cepstrum peak amplitude is equal to or less than said lower limit and generating an unvoiced decision parameter if said Cepstrum peak amplitude is equal to or greater than said upper limit.
  - 4. A speech analysis and synthesis system as defined in claim 3 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - means for monitoring said speech signal and counting the number of zero crossings in a predetermined time period; and
      
      means for comparing said number of zero crossings with a predetermined limit and generating an unvoiced decision parameter if said number of zero crossings is above said limit and generating a voiced decision parameter if said number of zero crossings is below said limit.
  - 5. A speech analysis and synthesis system as defined in claim 4 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - estimate register means;
      
      means for comparing the output of said estimate register with a first autocorrelation coefficient produced by said linear prediction technique and setting said register to said first autocorrelation coefficient value if said coefficient is equal to or less than said register value; and
      
      means for integrating the value in said register means if said coefficient is greater than said estimate register value;
      
      means for subtracting said estimate register value from said first autocorrelation coefficient value to generate a difference value; and
      
      means for comparing said difference value with predetermined upper and lower limits and generating a voiced decision parameter if said difference is greater than said upper limit and generating an unvoiced decision parameter if said difference signal is less than said lower limit.
  - 6. A speech analysis and synthesis system as defined in claim 5 wherein said means for analyzing said speech signals to generate a voiced/unvoiced decision parameter further includes:
    - means for comparing an error signal generated by said linear prediction technique with predetermined upper and lower limits and generating a voiced decision parameter if said error signal is equal to or less than said lower limit and generating an unvoiced decision parameter if said error signal is equal to or greater than said upper limit.
  - 7. A speech analysis and synthesis system as defined in claim 6 wherein said means for analyzing electrical speech signal by Cepstrum technique includes a signal conditioning system for conditioning said Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having,means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum, andmeans for adding said weighting signal tosaid Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

8. A speech analysis and synthesis system comrising:
- an analyzer including,means for analyzing an electrical speech signal by Cepstrum technique to generate a pitch period parameter,means for analyzing said speech signal to generate a voiced/unvoiced decision parameter, andmeans for analyzing said speech signal by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter; and
  
  a synthesizer including,pitch pulse generator means for receiving said pitch period parameter and generating pitch pulses having a corresponding pitch period,random noise generator means,switch means connected to said pitch pulsed generator means to said random noise generator means for receiving said voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced, and from said random noise generator means if said decision is unvoiced,gain control means connected to said output of said switch means for receiving said power parameter, an output level of said gain control means being dependent upon said power parameter, andlinear prediction filter means connected to said output of said gain control means for receiving and applying said coefficient parameters to said filter means to generate a replica or said speech signal at an output of said filter means;
  
  wherein said means for analyzing an electrical speech signal by Cepstrum technique includes means for conditioning a Cepstrum signal generated by said Cepstrum technique, said signal conditioning means generating a weighting signal which linearly increases in value during the time span of said Cepstrum and said means for conditioning said Cepstrum further adding said weighting signal to said Cepstrum during said time span to create a weighted Cepstrum signal.
- View Dependent Claims (9)
- - 9. A speech analysis and synthesis system as defined in claim 8 wherein said signal conditioning means includes:
    - means for generating said weighting signal having a first adder with an output connected as an input to a register, said register having an output connected as a first input to said first adder, said first adder having as a second input an incrementing constant connected to said first adder through a first gate, said gate being enabled as an incremental element of said Cepstrum signal is gated through a second gate to a first input to a second adder, whereby said register is incremented for each element of said Cepstrum signal which is gated into said second adder; and
      
      means for adding said weighting signal to said Cepstrum by connecting the output of said register as a second input to said second adder, whereby the incremented output of said register is added to each element of said Cepstrum signal and an output of said second adder is said weighted Cepstrum signal.

10. A method of analyzing and synthesizing, in real time, speech signals in a plurality of speech channels, said method comprising the steps of:
- analyzing said speech signals by,analyzing each of said plurality of speech signals by a Cepstrum technique to generate a pitch period parameter,analyzing each of said plurality of speech signals to generate a voiced/unvoiced decision parameter, andanalyzing each of said pluality of signals by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter; and
  
  synthesizing said speech signals by,generating pitch pulses having a pitch period corresponding to said pitch period parameter generated in said first analyzing step,generating a random noise signal,switching the output of said generated pitch pulses to an output of a switch means if said voiced/unvoiced decision parameter generated during said second analysis step is voiced and switching said random noise signal to said output of said switch means if said voiced/unvoiced decision is unvoiced,controlling the gain of said output of said switch means with gain control means in accordance with said generated power parameter, andconnecting the output of said gain control means to the input of linear prediction filter means and applying said coefficient parameters to said filter means to generate a replica of each of said plurality of speech signals at an output of said filter means.

11. A method of analyzing and synethesizing speech signals comprising the steps of:
- analyzing said speech signals by, analyzing said electrical speech signal by a Cepstrum technique to generate a pitch period parameter,analyzing said speech signal to generate a voiced/unvoiced decision parameter,analyzing said speech signal by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter; and
  
  p1 synthesizing said speech signal by,generating pitch pulses having a pitch period corresponding to said pitch period parameter generated in said first analyzing step,generating a random noise signal,switching the output of said generated pitch pulses to an output of a switch means if said voiced/unvoiced decision parameter generated during said second analysis step is voiced and switching said random noise signal to said output of said switch means if said voiced/unvoiced decision is unvoiced,controlling the gain of said output of said switch means with gain control means in accordance with said generated power parameter, andconnecting the output of said gain control means to the input of linear prediction filter means and applying said coefficient parameters to said filter means to generate a replica of said speech signal to an output of said filter means;
  
  wherein said step of analyzing said speech signal to generate a voiced/unvoiced decision parameter includesscanning a Cepstrum signal within an auxilliary signal range on each side of a Cepstrum peak amplitude signal,detecting and storing a range peak signal within the scanned auxilliary ranges,summing the absolute values of the signals within the auxilliary ranges to form a range signal sum,dividing said Cepstrum peak amplitude signal by said range peak signal to form a range peak factor;
  
  dividing said range signal sum by said Cepstrum peak amplitude signal to form an average peak factor,first comparing said range peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said range peak factor is above said upper limit and generating an unvoiced decision parameter if said range peak factor is equal to or below said lower limit, andsecondarily comparing said average peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said average peak factor is equal to or less than said lower limit and generating an unvoiced decision parameter if said average peak factor is equal to or greater than said upper limit.
- View Dependent Claims (12, 13, 14, 15)
- - 12. A method of analyzing and synthesizing speech signals as defined in claim 11 wherein said step of analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - thirdly comparing said Cepstrum amplitude factor with predetermined upper and lower limits and generating a voiced decision parameter if said Cepstrum peak amplitude is equal to or less than said lower limit and generating an unvoiced decision parameter if said Cepstrum peak amplitude is equal to or greater than said upper limit.
  - 13. A method of analyzing and synthesizing speech signals as defined in claim 12 wherein said step of analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - monitoring said speech signal and counting the number of zero crossings in a predetermined time period; and
      
      comparing said number of zero crossings with a predetermined limit and generating an unvoiced decision parameter if said number of zero crossings is above said limit and generating a voiced decision parameter if said number of zero crossings is below said limit.
  - 14. A method of analyzing and synthesizing speech signals as defined in claim 13 wherein said step of analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - providing an estimate register;
      
      comparing the output of said estimate register with the first autocorrelation coefficient produced by said linear prediction technique and setting said register to said first autocrorrelation coefficient value if said coefficient is equal to or less than said register value; and
      
      integrating the value in said register means if said coefficient is greater than said estimate register value;
      
      subtracting said estimate register value from said first autocorrelation coeficient value to generate a difference value; and
      
      comparing said difference value with predetermined upper and lower limits and generating a voiced decision parameter if said difference is greater than said upper limit and generating an unvoiced decision parameter if said difference signal is less than said lower limit.
  - 15. A method of analyzing and synthesizing speech signals as defined in claim 14 wherein said step of analyzing said speech signals to generate a voiced/unvoiced decision parameter further includes:
    - comparing an error signal generated by said linear prediction technique with predetermined upper and lower limits and generating a voiced decision parameter if said error signal is equal to or less than said lower limit and generating an unvoiced decision parameter if said error signal is equal to or greater than said upper limit.

16. A method of analyzing and synthesizing speech signals comprising the steps of:
- analyzing said speech signals by,analyzing said electrical speech signal by a Cepstrum technique to generate a pitch period parameter,analyzing said speech signal to generate a voiced/unvoiced decision parameter,analyzing said speech signal by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter; and
  
  synthesizing said speech signal by,generating pitch pulses having a pitch period corresponding to said pitch period parameter generated in said first analyzing step,generating a random noise signal,switching the output of said generated pitch pulses to an output of a switch means if said voiced/unvoiced decision parameter generated during said second analysis step is voiced and switching said random noise signal to said output of said switch means if said voiced/unvoiced decision is unvoiced,controlling the gain of said output of said switch means with gain control means in accordance with said generated power parameter, andconnecting the output of said gain control means to the input of linear prediction filter means and applying said coefficient parameters to said filter means to generate a replica of said speech signal to an output of said filter means;
  
  wherein said step of analyzing electrical speech signals by Cepstrum technique includes a signal conditioning step for conditioning a Cepstrum signal which has a dimension in the direction of an ordinate axis, which is the amplitude of said Cepstrum signal and has a dimension in the direction of the abscissa which is a quantity representing time, said signal conditioning step having the steps of,generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum, andadding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

17. A method of analyzing and synthesizing speech signals comprising the steps of:
- analyzing said speech signals by,analyzing said electrical speech signal by a Cepstrum technique to generate a pitch period parameter,analyzing said speech signal to generate a voiced/unvoiced decision parameter,analyzing said speech signal by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter; and
  
  synthesizing said speech signal by,generating pitch pulses having a pitch period corresponding to said pitch period parameter generated in said first analyzing step,generating a random noise signal,switching the output of said generated pitch pulses to an output of a switch means if said voiced/unvoiced decision parameter generated during said second analysis step is voiced and switching said random noise signal to said output of said switch means if said voiced/unvoiced decision is unvoiced,controlling the gain of said output of said switch means with gain control means in accordance with said generated power parameter, and `connecting the output of said gain control means to the input of linear prediction filter means and applying said coefficient parameters to said filter means to generate a replica of said speech signal to an output of said filter means;
  
  wherein said pitch pulse generating step includesgenerating a sequence of control signals in response to receipt of said pitch period parameter, andgenerating a substantially periodic, amplitude increasing pitch pulse excitation signal in response to said sequence of control signals.
- View Dependent Claims (18)
- - 18. A method of analyzing and synthesizing speech signals as defined in claim 17 wherein:
    - said control signal sequence generating step includes counting in a counter producing a parallel output count in response to processing clock signals; and
      
      generating said pitch pulse excitation signal includes addressing a read-only memory means by said parallel count output from said counter, the output of said read-only memory being said excitation signal.

19. A system for analyzing a plurality of speech signals in real time and generating representative control parameters therefor, said system comprising:
- means for analyzing a plurality of electrical speech signals by Cepstrum technique to generate a pitch period parameter for each of said signals;
  
  means for analyzing each of said plurality of speech signals to generate a voiced/unvoiced decision parameter;
  
  means for analyzing each of said plurality of speed signals by linear prediction technique to generate a predetermined number of coefficent parameters and a power parameter; and
  
  means for combining said pitch period parameter, said voiced/unvoiced decision parameter, said coefficient parameters and said power parameter into a composite signal representative of each of said plurality of speech signals for transmission in compressed form to a point of utilization.

20. A system for analyzing a speech signal and generating representative control parameters therefor, said system comprising:
- means for analyzing an electrical speech signal by Cepstrum technique to generate a pitch period parameter;
  
  means for analyzing said speech signal to generate a voiced/unvoiced decision parameter;
  
  means for analyzing said speech signal by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameters; and
  
  means for combining said pitch period parameter, said voiced/unvoiced decision parameter, said coefficient parameters and said power parameter into a composite signal representative of said speech signal;
  
  wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter includesmeans for scanning a Cepstrum signal within an auxilliary signal range on each side of a Cepstrum peak amplitude signal,means for detecting and storing a range peak signal within the scanned auxilliary ranges;
  
  means for summing the absolute values of the signals within the auxilliary ranges to form a range signal sum,means for dividing said Cepstrum peak amplitude signal by said range peak signal to form a range peak factor,means for dividing said range signal sum by said Cepstrum peak amplitude signal to form an average peak factor,means for first comparing said range peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said range peak factor is above said upper limit and generating an unvoiced decision parameter if said range peak factor is equal to or below said lower limit, andmeans for secondarily comparing said average peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said average peak factor is equal to or less than said lower limit and generating an unvoiced decision parameter if said average peak factor is equal to or greater than said upper limit.
- View Dependent Claims (21, 22, 23, 24, 25)
- - 21. A speech analysis system as defined in claim 20 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - means for thirdly comparing said Cepstrum peak amplitude signal with predetermined upper and lower limits and generating a voiced decision parameter if said Cepstrum peak amplitude is equal to or less than said lower limit and generating an unvoiced decision parameter if said Cepstrum peak amplitude is equal to or greater than said upper limit.
  - 22. A speech analysis system as defined in claim 21 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - means for monitoring said speech signal and counting the number of zero crossings in a predetermined time period; and
      
      means for comparing said number of zero crossings with a predetermined limit and generating an unvoiced decision parameter if said number of zero crossings is above said limit and generating a voiced decision parameter if said number of zero crossings is below said limit.
  - 23. A speech analysis system as defined in claim 22 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - estimate register means;
      
      means for comparing the output of said estimate register with the first autocorrelation coefficient produced by said linear prediction technique and setting said register to said first autocorrelation coefficient value if said coefficient equal to or less than said register value; and
      
      means for integrating the value in said register means if said coefficient is greater than said estimate register value;
      
      means for subtracting said estimate register value from said first autocorrelation coefficient value to generate a difference value; and
      
      means for comparing said difference value with predetermined upper and lower limits and generating a voiced decision parameter if said difference is greater than said upper limit and generating an unvoiced decision parameter if said difference signal is less than said lower limit.
  - 24. A speech analysis system as defined in claim 23 wherein said means for analyzing said speech signals to generate a voiced/unvoiced decision parameter further includes:
    - means for comparing an error signal generated by said linear prediction technique with predetermined upper and lower limits and generating a voiced decision parameter if said error signal is equal to or less than said lower limit and generating an unvoiced decision parameter if said error signal is equal to or greater than said upper limit.
  - 25. A speech analysis system as defined in claim 24 wherein said means for analyzing an electrical speech signal by Cepstrum technique includes a signal conditioning system for conditioning said Cepstrum signal which has, in the direction of the ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having:
    - means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum signal; and
      
      means for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

26. A system for analyzing a speech signal and generating representative control parameters therefor, said system comprising:
- means for analyzing an electrical speech signal by Cepstrum technique to generate a pitch period parameter;
  
  means for analyzing said speech signal to generate a voiced/unvoiced decision parameter;
  
  means for analyzing said speech signal by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter; and
  
  means for combining said pitch period parameter, said voiced/unvoiced decision parameter, said coefficient parameters and said power parameter into a composite signal representative of said speech signal;
  
  wherein said means for analyzing an electrical speech signal by Cepstrum technique includes a signal conditioning system for conditioning a Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having,means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum, andmeans for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

27. A method for analyzing a plurality of speech signals in real time and generating representative control parameters therefor, said method comprising the steps of:
- analyzing each of a plurality of speech signals by a Cepstrum technique to generate a pitch period parameter;
  
  analyzing each of said plurality of speech signals to generate a voiced/unvoiced decision parameter; and
  
  analyzing each of said plurality of speech signals by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter.

28. A method for analyzing a speech signal and generating representative control parameters therefor, said method comprising the steps of:
- analyzing said speech signal by a Cepstrum technique to generate a pitch period parameter;
  
  analyzing said speech signal to generate a voiced/unvoiced decision parameter; and
  
  analyzing said speech signal by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter;
  
  wherein said step of analyzing said speech signal to generate a voiced/unvoiced decision parameter includesscanning a Cepstrum signal within an auxilliary signal range on each side of a Cepstrum peak amplitude signal,detecting and storing a range peak signal within the scanned auxilliary ranges,summing the absolute values of the signals within the auxilliary ranges to form a range signal sum,dividing said Cepstrum peak amplitude signal by said range peak signal to form a range peak factor,dividing said range signal sum by said Cepstrum peak amplitude signal to form an average peak factor,first comparing said range peak factor with predetermined upper and lower limits and generating an voiced decision parameter if said range peak factor is above said upper limit and generating an unvoiced decision parameter if said range peak factor is equal to or below said lower limit, andsecondarily comparing said average peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said average peak factor is equal to or less than said lower limit and generating an unvoiced decision parameter if said average peak factor is equal to or greater than said upper limit.
- View Dependent Claims (29, 30, 31, 32, 33)
- - 29. A method of analyzing a speech signal as defined in claim 28 wherein said step of analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - thirdly comparing said Cepstrum peak amplitude signal with predetermined upper and lower limits and generating a voiced decision parameter if said Cepstrum peak amplitude is equal to or less than said lower limit and generating an unvoiced decision parameter if said Cepstrum peak amplitude is equal to or greater than said upper limit.
  - 30. A method of analyzing a speech signal as defined in claim 29 wherein said step of analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - monitoring said speech signal and counting the number of zero crossings in a predetermined time period; and
      
      comparing said number of zero crossings with a predetermined limit and generating an unvoiced decision parameter if said number of zero crossings is above said limit and generating a voiced decision parameter if said number of zero crossings is below said limit.
  - 31. A method analyzing a speech signal as defined in claim 30 wherein said step of analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - providing an estimate register;
      
      comparing the output of said estimate register with the first autocorrelation coefficient produced by said linear prediction technique and setting said register to said first autocorrelation coefficient value if said coefficient is equal to or less than said register value;
      
      integrating the value in said register means if said coefficient is greater than said estimate register value;
      
      substracting said estimate register value from said first autocorrelation coefficient value to generate a difference value; and
      
      comparing said difference value with predetermined upper and lower limits and generating a voiced decision parameter if said difference is greater than said upper limit and generating an unvoiced decision parameter if said difference signal is less than said lower limit.
  - 32. A method of analyzing a speech signal as defined in claim 30 wherein said step of analyzing said speech signals to generate a voiced/unvoiced decision parameter further includes:
    - comparing an error signal generated by said linear prediction technique with predetermined upper and lower limits and generating a voiced decision parameter if said error signal is equal to or less than said lower limit and generating an unvoiced decision parameter if said error signal is equal to or greater than said upper limit.
  - 33. A method of analyzing a speech signal as defined in claim 32 wherein said step of analyzing by a Cepstrum technique includes conditioning said Cepstrum signal, which has, in the direction of the ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning step including:
    - generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum signal; and
      
      adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

34. A method for analyzing a speech signal and generating representative control parameters therefor, said method comprising the steps of:
- analyzing said speech signal by a Cepstrum technique to generate a pitch period parameter;
  
  analyzing said speech signal to generate a voiced/unvoiced decision parameter; and
  
  analyzing said speech signal by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter;
  
  wherein said analyzing electrical speech signal by Cepstrum technique includes the step of conditioning a Cepstrum signal, which has a dimension in the direction of an ordinate axis which is the amplitude of said Cepstrum signal and has dimension in the direction of the abscissa which is a quantity representing time, said signal conditioning step including,generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum, andadding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

35. A system for synthesizing speech in real time from a plurality of sets of control parameters representative of a plurality of speech signals, said parameters including a pitch period parameter, a voiced/unvoiced decision parameter, a predetermined number of coefficient parameters and a power parameter, said system comprising:
- pitch pulse generator means for receiving said pitch period parameter and generating pitch pulses having a corresponding pitch period;
  
  random noise generator means;
  
  switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said voiced/unvoiced decision parameter, an output of said switch means being derived from the output of said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is unvoiced;
  
  gain control means connected to said output of said switch means, for receiving said power parameter, an output level of said gain control means being dependent upon said power parameter; and
  
  linear prediction filter means connected to said output of said control means for receiving and applying said coefficient parameters to said filter means to generate a replica of each of said plurality of speech signals at an output of said filter means.

36. A system for synthesizing speech from a combination of control parameters representative of a speech signal, said parameters including a pitch period parameter, a voiced/unvoiced decision parameter, a predetermined number of coefficient parameters and a power parameter, said system comprising:
- pitch pulse generator means for receiving said pitch period parameter and generating pitch pulses having a corresponding pitch period;
  
  random noise generator means;
  
  switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said voiced/unvoiced decision parameter, an output of said switch means being derived from the output of said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is unvoiced;
  
  gain control means connected to said output of said switch means, for receiving said power parameter, an output level of said gain control means being dependent upon said power parameter; and
  
  linear prediction filter means connected to said output of said control means, for receiving and applying said coefficient parameters to said filter means to generate a replica of said speech signal at an output of said filter means;
  
  wherein said pitch pulse generator means includesmeans responsive to receipt of said pitch period parameter for generating a sequence of control signals, andmeans responsive to said sequence of control signals for generating a substantially periodic, amplitude increasing pitch pulse excitation signal.

37. A method of synthesizing speech in real time from a plurality of combinations of control parameters representative of a plurality of speech signals, said parameter including a pitch period parameter, a voiced/unvoiced decision parameter, a predetermined number of coefficient parameters and a power parameter, said method comprising the steps of:
- receiving said pitch period parameter and generating pitch period pulses having a corresponding pitch period;
  
  generating a random noise signal;
  
  switching said pitch period signals to an output of a switch means when said voiced/unvoiced decision parameter is voiced and switching said random noise signal to said output of said switch means if said voiced/unvoiced decision parameter is unvoiced;
  
  controlling the gain of the output of said switch means in response to said power parameter; and
  
  connecting the output of said gain controlled switch means to a linear prediction filter means connected to said coefficient parameters to generate a replica of each of said plurality of speech signals at an output of said filter means.

38. A method of synthesizing speech from a combination of control parameters representative of a speech signal, said parameters including a pitch period parameter, a voiced/unvoiced decision parameter, a predetermined number of coefficient parameters and a power parameter, said method comprising the steps of:
- receiving said pitch period parameter and generating pitch period pulses having a corresponding pitch period;
  
  generating a random noise signal;
  
  switching said pitch period signals to an output of a switch means when said voiced/unvoiced decision parameter is voiced and switching said random noise signal to said output of said switch means if said voiced/unvoiced decision parameter is unvoiced;
  
  controlling the gain of the output of said switch means in response to said power parameter; and
  
  connecting the output of said gain controlled switch means to a linear prediction filter means connected to said coefficient parameters to generate a replica of said speech signal at an output of said filter means;
  
  wherein said pitch pulse generating step includesgenerating a sequence of control signals in response to receipt of said pitch period parameter; and
  
  generating a substantially periodic, amplitude increasing pitch period pulse excitation signal in response to said sequence of control signals.

39. A system for speech analysis and synthesis, comprising:
- an analyzer including,means for analyzing a plurality of speech signal channels in real time by Cepstrum technique to generate a pitch period parameter for each of said speech signal channels,means for analyzing each of said plurality of speech signal channels, to generate a voiced/unvoiced decision parameter for each channel,means for analyzing each of said plurality of speech signal channels by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter for each of said channels,means for combining said pitch period parameter, coefficient parameters, power parameter, and voiced/unvoiced decision parameter into a set of parameters for each of said speech channels,means for combining said sets of parameters for each speech channel into a single transmission channel signal,means for transmitting said single transmission channel signal to a receiver, andmeans for separating said received single transmission channel signal into respective sets of parameters for a respective plurality of synthesis channels; and
  
  a synthesizer including,pitch pulse generator means for receiving said pitch period parameter for each of said synthesis channels and generating pitch pulses having a corresponding pitch period,random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said voiced/unvoiced decision parameters for each of said synthesis channels, an output of said switch means being said pitch pulse generator means if said decision for a channel is voiced and said output of said switch means being said random noise generator means if said decision for a channel is unvoiced,gain control means connected to said output of said switch means for receiving said power parameters for each of said synthesis channels, an output level of said gain control means being dependent upon said power parameter for each of said synthesis channels, andlinear prediction filter means connected to said output of said gain control means for receiving and applying said coefficient parameters for each of said synthesis channels to said filter means to generate a replica of each of said speech signals at an output of said filter means.

40. A system for speech analysis and synthesis, comprising:
- an analyzer including,means for analyzing a plurality of speech signal channels by Cepstrum technique to generate a pitch period parameter for each of said speech signal channels,means for analyzing each of said plurality of speech signal channels, to generate a voiced/unvoiced decision parameter for each channel,means for analyzing each of said plurality of speech signal channels by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter for each of said channels,means for combining said pitch period parameter, coefficient parameters, power parameter, and voiced/unvoiced decision parameter into a set of parameters for each of said speech channels,means for combining said sets of parameters for each speech channel into a single transmission channel signal,means for transmitting said single transmission channel signal to a receiver, andmeans for separating said received single transmission channel signal into respective sets of parameters for a respective plurality of synthesis channels; and
  
  a synthesizer including,pitch pulse generator means for receiving said pitch period parameter for each of said synthesis channels and generating pitch pulses having a corresponding pitch period,random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said voiced/unvoiced decision parameters for each of said synthesis channels, an output of said switch means being said pitch pulse generator means if said decision for a channel is voiced and said output of said switch means being said random noise generator means if said decision for a channel is unvoiced,gain control means connected to said output of said switch means for receiving said power parameters for each of said synthesis channels, an output level of said gain control means being dependent upon said power parameter for each of said synthesis channels, andlinear prediction filter means connected to said output of said gain control means for receiving and applying said coefficient parameters for each of said synthesis channels to said filter means to generate a replica of each of said speech signals at an output of said filter means;
  
  wherein said means for analyzing each of said plurality of speech signal channels said speech signal to generate a voiced/unvoiced decision parameter includesmeans for scanning a Cepstrum signal within an auxilliary signal range on each side of a Cepstrum peak amplitude signal,means for detecting and storing a range peak signal within the scanned auxilliary ranges,means for summing the absolute values of the signal within the auxilliary ranges to form a range signal sum,means for dividing said Cepstrum peak amplitude signal by said range peak signal to form a range peak factor,means for dividing said range signal sum by said Cepstrum peak amplitude signal to form an average peak factor,means for first comparing said range peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said range peak factor is above said upper limit and generating an unvoiced decision parameter if said range peak factor is equal to or below said lower limit, andmeans for secondarily comparing said average peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said average peak factor is equal to or less than said lower limit and generating an unvoiced decision parameter if said average peak factor is equal to or greater than said upper limit.
- View Dependent Claims (41, 42, 43, 44, 45, 46)
- - 41. A speech analysis and synthesis system as defined in claim 40 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - means for thirdly comparing said Cepstrum peak amplitude factor signal with predetermined upper and lower limits and generating a voiced decision parameter if said Cepstrum peak amplitude is equal to or less than said lower limit and generating an unvoiced decision parameter if said Cepstrum peak amplitude is equal to or greater than said upper limit.
  - 42. A speech analysis and synthesis system as defined in claim 41 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - means for monitoring said speech signal and counting the number of zero crossings in a predetermined time period; and
      
      means for comparing said number of zero crossings with a predetermined limit and generating an unvoiced decision parameter if said number of zero crossings is above said limit and generating a voiced decision parameter if said number of zero crossings is below said limit.
  - 43. A speech analysis and synthesis system as defined in claim 42 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - estimate register means;
      
      means for comparing the output of said estimate register with the first autocorrelation coefficient produced by said linear prediction technique and setting said register to said first autocorrelation coefficient value if said coefficient is equal to or less than said register value;
      
      means for integrating the value in said register means if said coefficient is greater than said estimate register value;
      
      means for subtracting said estimate register value from said first autocorrelation coefficient value to generate a difference value; and
      
      means for comparing said difference value with predetermined upper and lower limits and generating a voiced decision parameter if said difference is greater than said upper limit and generating an unvoiced decision parameter if said difference signal is less than said lower limit.
  - 44. A speech analysis and synthesis system as defined in claim 43 wherein said means for analyzing said speech signals to generate a voiced/unvoiced decision parameter further includes:
    - means for comparing an error signal generated by said linear prediction technique with predetermined upper and lower limits and generating a voiced decision parameter if said error signal is equal to or less than said lower limit and generating an unvoiced decision parameter if said error signal is equal to or greaer than said upper limit.
  - 45. A system for synthesizing speech as defined in claim 44 wherein said pitch pulse generator means includes:
    - means responsive to receipt of said pitch period parameter for generating a sequence of control signals; and
      
      means responsive to said sequence of control signals for generating a substantially periodic, amplitude increasing pitch pulse excitation signal.
  - 46. A speech analysis and synthesis system as defined in claim 45 wherein said means for analyzing the plurality of speech signal channels by Cepstrum technique to generate a pitch period parameter includes a signal conditioning system for conditioning said Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having:
    - means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time to said Cepstrum signal; and
      
      means for adding weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

47. A system for speech analysis and synthesis, comprising:
- an analyzer including,means for analyzing a plurality of speech signal channels by Cepstrum technique to generate a pitch period parameter for each of said speech signal channels,means for analyzing each of said plurality of speech signal channels, to generate a voiced/unvoiced decision parameter for each channel,means for analyzing each of said 1lurality of speech signal channels by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter for each of said channels,means for combining said pitch period parameter, coefficient parameters, power parameter, and voiced/unvoiced decision parameter into a set of parameters for each of said speech channels,means for combining said sets of parameters for each speech channel into a single transmission channel signal,means for transmitting said single transmission channel signal to a receiver, andmeans for separating said received single transmission channel signal into respective sets of parameters for a respective plurality of synthesis channels; and
  
  a synthesizer including,pitch pulse generator means for receiving said pitch period parameter for each of said synthesis channels and generating pitch pulses having a corresponding pitch period,random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said voiced/unvoiced decision parameters for each of said synthesis channels, an output of said switch means being said pitch pulse generator means if said decision for a channel is voiced and said output of said switch means being said random noise generator means if said decision for a channel is unvoiced,gain control means connected to said output of said switch means for receiving said power parameters for each of said synthesis channels, an output level of said gain control means being dependent upon said power parameter for each of said synthesis channels, andlinear prediction filter means connected to said output of said gain control means for receiving and applying said coefficient parameters for each of said synthesis channels to said filter means to generate a replica of each of said speech signals at an output of said filter means;
  
  wherein said means for analyzing the plurality of speech signal channels by Cepstrum technique to generate a pitch period parameter includes a signal conditioning system for conditioning a Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having,means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum, andmeans for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

48. A system for analyzing a plurality of speech signal channels, in real time, said system comprising an analyzer including:
- means for analyzing a plurality of speech signal channels by Cepstrum technique to generate a pitch period parameter for each of said speech signal channels;
  
  means for analyzing each of said plurality of speech signal channels to generate a voiced/unvoiced decision parameter for each channel; and
  
  means for analyzing each of said plurality of speech signal channels by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter for each of said channels.

49. A system for analyzing a plurality of speech signal channels, comprising an analyzer including:
- means for analyzing a plurality of speech signal channels by Cepstrum technique to generate a pitch period parameter for each of said speech signal channels;
  
  means for analyzing each of said plurality of speech signal channels to generate a voiced/unvoiced decision parameter for each channel; and
  
  means for analyzing each of said plurality of speech signal channels by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter for each of said channels;
  
  wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter includesmeans for scanning a Cepstrum signal within an auxilliary signal range on each side of Cepstrum peak amplitude signal,means for detecting and storing a range peak signal within the scanned auxilliary ranges,means for summing the absolute values of the signal within the auxilliary ranges to form a range signal sum,means for dividing said Cepstrum peak amplitude signal by said range peak signal to form a range peak factor;
  
  means for dividing said range signal sum by said Cepstrum peak amplitude signal to form an average peak factor,means for first comparing said range peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said range peak factor is above said upper limit and generating an unvoiced decision parameter if said range peak factor is equal to or below said lower limit, andmeans for secondarily comparing said average peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said average peak factor is equal to or less than said lower limit and generating an unvoiced decision parameter if said average peak factor is equal to or greater than said upper limit.
- View Dependent Claims (50, 51, 52, 53, 54)
- - 50. A speech analysis system as defined in claim 49 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - means for thirdly comparing said Cepstrum peak amplitude signal with predetermined upper and lower limits and generating a voiced decision parameter if said Cepstrum peak amplitude is equal to or less than said lower limit and generating an unvoiced decision parameter if said Cepstrum peak amplitude is equal to or greater than said upper limit.
  - 51. A speech analysis system as defined in claim 50 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - means for monitoring said speech signal and counting the number of zero crossings in a predetermined time period; and
      
      means for comparing said number of zero crossings with a predetermined limit and generating an unvoiced decision parameter if said number of zero crossings is above said limit and generating a voiced decision parameter if said number of zero crossings is below said limit.
  - 52. A speech analysis system as defined in claim 51 wherein said means for analyzing said speech signal to generate a voiced/unvoiced decision parameter further includes:
    - estimate register means;
      
      means for comparing the output of said estimate register with a first autocorrelation coefficient produced by said linear prediction technique and setting said register to said first autocorrelation coefficient value if said coefficient is equal to or less than said register value; and
      
      means for integrating the value in said register means if said coefficient is greater than said estimate register value;
      
      means for subtracting said estimate register value from said first autocorrelation coefficient value to generate a difference value; and
      
      means for comparing said difference value with predetermined upper and lower limits and generating a voiced decision parameter if said difference is greater than said upper limit and generating an unvoiced decision parameter if said difference signal is less than said lower limit.
  - 53. A speech analysis system as defined in claim 52 wherein said means for analyzing said speech signals to generate a voiced/unvoiced decision parameter further includes:
    - means for comparing an error signal generated by said linear prediction technique with predetermined upper and lower limits and generating a voiced decision parameter if said error signal is equal to or less than said lower limit and generating an unvoiced decision parameter if said error signal is equal to or greater than said upper limit.
  - 54. A speech analysis system as defined in claim 53 wherein said means for analyzing a plurality of speech signal channels by Cepstrum technique includes a signal conditioning system for conditioning said Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having:
    - means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum signal; and
      
      means for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

55. A system for analyzing a plurality of speech signal channels, comprising an analyzer including:
- means for analyzing a plurality of speech signal channels by Cepstrum technique to generate a pitch period parameter for each of said speech signal channels;
  
  means for analyzing each of said plurality of speech signal channels to generate a voiced/unvoiced decision parameter for each channel; and
  
  means for analyzing each of said plurality of speech signal channels by linear prediction technique to generate a predetermined number of coefficient parameters and a power parameter for each of said channels;
  
  wherein said means for analyzing a plurality of speech signal channels by Cepstrum technique includes a signal conditioning system for conditioning a Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having,means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum, andmeans for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

56. A system for synthesizing speech in a plurality of speech synthesis channels in real time, said system comprising:
- pitch pulse generator means for receiving pitch period parameters for each of said synthesis channels and generating pitch pulses having a corresponding pitch period;
  
  random noise generator means;
  
  switch means connected to said pitch pulse generator means and to said random noise generator means for receiving voiced/unvoiced decision parameters for each of said synthesis channels, an output of said switch means being derived from said pitch pulse generator means if said decision for a channel is voiced and from said random noise generator means if said decision for a channel is unvoiced;
  
  gain control means connected to said output of said switch means for receiving power parameters for each of said synthesis channels, an output level of said gain control means being dependent upon said power parameter for each of said synthesis channels; and
  
  linear prediction filter means connected to said output of said gain control means for receiving and applying a plurality of coefficient parameters for each of said synthesis channels to said filter means to generate a replica of each of said speech signals at an output of said filter means.

57. A system for speech synthesis, comprising a synthesizer including:
- pitch pulse generator means for receiving pitch period parameters for each of said synthesis channels and generating pitch pulses having a corresponding pitch period;
  
  random noise generator means;
  
  switch means connected to said pitch pulse generator means and to said random noise generator means for receiving voiced/unvoiced decision parameters for each of said synthesis channels, an output of said switch means being derived from said pitch pulse generator means if said decision for a channel is voiced and from said random noise generator means if said decision for a channel is unvoiced;
  
  gain control means connected to said output of said switch means for receiving power parameters for each of said synthesis channels, an output level of said gain control means being dependent upon said power parameter for each of said synthesis channels; and
  
  linear prediction filter means connected to said output of said gain control means for receiving and applying a plurality of coefficient parameters for each of said synthesis channels to said filter means to generate a replica of each of said speech signals at an output of said filter means;
  
  wherein said pitch pulse generator means includesmeans responsive to receipt of said pitch period parameters for generating a sequence of control signals, andmeans responsive to said sequence of control signals for generating a substantially periodic, amplitude increasing pitch pulse excitation signal.
- View Dependent Claims (58)
- - 58. A system for speech synthesis as defined in claim 57 wherein:
    - said control signal sequence generating means includes a counter producing a parallel output count in response to processing clock signals; and
      
      said means for generating said pitch pulse excitation signal includes a read-only memory means addressed by said parallel count output from said counter, the output of said read-only memory being said excitation signal.

59. A speech analysis system for generating in real time a plurality of sets of paramaters each representative of an analog speech signal as produced by a model speech generator, said system comprising:
- means for sampling each of said analog speech signals to produce a plurality of digital speech data signals;
  
  means for separating a fixed number of said speech data signals into a frame of speech data signals;
  
  means for analyzing said frame of said speech data with a Cepstrum generator to produce a digital speech pitch period parameter for said frame of data;
  
  means for analyzing said frame of data with a linear prediction coefficient generator to produce a plurality of digital coefficeint generator to produce a plurality of digital coefficeint parameters and a digital power parameter;
  
  means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter; and
  
  means for combining, for each speech signal, said digital pitch period parameter, said plurality of digital coefficient parameters, said digital power parameter, and said digital voiced/unvoiced decision parameter into a set of parameters representative of each of said analog speech signals.

60. A speech analysis system for generating a set of parameters representative of an analog speech signal as produced by a model speech generator, said system comprising:
- means for sampling said analog speech signal to produce a plurality of digital speech data signals;
  
  means for separating a fixed number of said speech data signals into a frame of speech data signals;
  
  means for analyzing said frame of said speech data with a Cepstrum generator to produce a digital speech pitch period parameter for said frame of data;
  
  means for analyzing said frame of data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter;
  
  means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter; and
  
  means for combining said digital pitch period parameter, said plurality of digital coefficient parameters, said digital power parameter, and said digital voiced/unvoiced decision paramater into said set of parameters representative of said analog speech signal;
  
  wherein said means for analyzing said frame of speech data to produce a digital voiced/unvoiced decision parameter includesmeans for scanning a Cepstrum signal within an auxilliary signal range on each side of a Cepstrum peak amplitude signal,means for detecting and storing a range peak signal within the scanned auxilliary ranges,means for summing the absolute values of the signal within the auxilliary ranges to form a range signal sum,means for dividing said Cepstrum peak amplitude signal by said range peak signal to form a range peak factor,means for dividing said range signal sum by said Cepstrum peak amplitude signal to form an average peak factor,means for first comparing said range peak factor will predetermined upper and lower limits and generating a voiced decision parameter if said range peak factor is above said upper limit and generating an unvoiced decision parameter if said range peak factor is equal to or below said lower limit, andmeans for secondarily comparing said average peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said average peak factor is equal to or less than said lower limit and generating an unvoiced decision parameter if said average peak factor is equal to or greater than said upper limit.
- View Dependent Claims (61, 62, 63, 64, 65)
- - 61. A speech analysis system as defined in claim 60 wherein said means for analyzing said frame of speech data to produce a voiced/unvoiced decision parameter further includes:
    - means for thirdly comparing said Cepstrum peak amplitude signal factor with predetermined upper and lower limits and generating a voiced decision parameter if said Cepstrum peak amplitude is equal to or less than said lower limit and generating an unvoiced decision parameter if said Cepstrum peak amplitude is equal to or greater than said upper limit.
  - 62. A speech analysis system as defined in claim 61 wherein said means for analyzing said frame of speech data to produce a voiced/unvoiced decision parameter further includes:
    - means for monitoring said speech signal and counting the number of zero crossings in a predetermined time period; and
      
      means for comparing said number of zero crossings with a predetermined limit and generating an unvoiced decision parameter if said number of zero crossings is above said limit and generating a voiced decision parameter if said number of zero crossings is below said limit.
  - 63. A speech analysis system as defined in claim 62 wherein said means for analyzing said frame of speech data to produce a voiced/unvoiced decision parameter further includes:
    - estimate register means;
      
      means for comparing the output of said estimate register with a first autocorrelation coefficient produced by said linear prediction technique and setting said register to said first autocorrelation coefficient value if said coefficient is equal to or less than said register value; and
      
      means for integrating the value in said register means if said coefficient is greater than said estimate register value;
      
      means for subtracting said estimate register value from said first autocorrelation coefficient value to generate a difference value; and
      
      means for comparing said difference value with predetermined upper and lower limits and generating a voiced decision parameter if said difference is greater than said upper limit and generating an unvoiced decision parameter if said difference signal is less than said lower limit.
  - 64. A speech analysis system as defined in claim 63 wherein said means for analyzing said frame of speech data to produce a voiced/unvoiced decision parameter further includes:
    - means for comparing an error signal generated by said linear prediction techinque with predetermined upper and lower limits and generating a voiced decision parameter if said error signal is equal to or less than said lower limit and generating an unvoiced decision parameter if said error signal is equal to or greater than said upper limit.
  - 65. A speech analysis system as defined in claim 64 wherein said means for said means for analyzing said frame of speech data by Cepstrum technique includes a signal conditioning system for conditioning said Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having:
    - means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum signal; and
      
      means for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

66. A speech analysis system for generating a set of parameters representative of an analog speech signal as produced by a model speech generator, said system comprising:
- means for sampling said analog speech signal to produce a plurality of digital speech data signals;
  
  means for separating a fixed number of said speech data signals into a frame of speech data signals;
  
  means for analyzing said frame of said speech data with a Cepstrum generator to produce a digital speech pitch period parameter for said frame of data;
  
  means for analyzing said frame of data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter;
  
  means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter; and
  
  means for combining said digital pitch period parameter, said plurality of digital coefficient parameters, said digital power parameter, and said digital voiced/unvoiced decision parameter into said set of parameters representative of said analog speech signal;
  
  wherein said means for analyzing said frame of speech data by Cepstrum technique includes a signal conditioning system for conditioning a Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having,means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum, andmeans for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

67. A pitch pulse generating system for use in a speech synthesizer which receives a pitch period parameter, said pitch pulse generating system comprising:
- means responsive to receipt of said pitch period parameter for generating a sequence of control signals; and
  
  means responsive to said sequence of control signals for generating a substantially periodic, amplitude increasing pitch pulse excitation signal.
- View Dependent Claims (68)
- - 68. A pitch pulse generating system as defined in claim 67 wherein:
    - said control signal sequence generating means includes a counter producing a parallel output count in response to processing clock signals; and
      
      said means for generating said pitch pulse excitation signal includes a read-only memory means addressed by said parallel count output from said counter and the output of said read-only memory being said excitation signal.

69. A speech analysis system for generating a set of parameters representative of an analog speech signal as produced by a model speech generator, said system comprising:
- means for sampling said analog speech signal to produce a plurality of digital speech data signals;
  
  means for separating a fixed number of said speech data signals into a frame of speech data signals;
  
  means for analyzing said frame of said speech data with a Cepstrum generator to produce a digital speech pitch period parameter for said frame of data;
  
  means for analyzing said frame of data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter;
  
  means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter; and
  
  means for combining said digital pitch period parameter, said plurality of digital coefficient parameters, said digital power parameter, and said digital voiced/unvoiced decision parameter into said set of parameters representative of said analog speech signal;
  
  wherein said means for analyzing said frame of said speech data to produce a speech pitch period parameter includes means for interpolating between sample values in a sample data Cepstrum signal, and wherein a pitch period value is initially chosen as corresponding to a maximum amplitude sample, said interpolating system havingmeans for storing said maximum value sample, an immediately preceding sample, and an immediately following sample,means for subtracting said immediately preceding sample from said immediately following sample to form a sample difference,means for dividing said sample difference by four to form a sample difference quotient,means for adding said immediately preceding sample to said immediately following sample to form a sample sum,means for dividing said sample sum by two to form a sample sum quotient,means for subtracting said maximum amplitude sample from said sample sum quotient to form a weighted sample sum quotient,means for dividing said sample difference quotient by said weighted sample sum quotient to form a pitch value increment, andmeans for adding said pitch value increment to said pitch period value to form an interpolated pitch value.

70. A speech analysis system for generating a set of parameters representative of an analog speech signal as produced by a model speech generator, said system comprising:
- means for sampling said analog speech signal to produce a plurality of digital speech data signals;
  
  means for separating a fixed number of said speech data signals into a frame of speech data signals;
  
  means for analyzing said frame of said speech data with a Cepstrum generator to produce a digital speech pitch period parameter for said frame of data;
  
  means for analyzing said frame of data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter;
  
  means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter; and
  
  means for combining said digital pitch period parameter, said plurality of digital coefficient parameters, said digital power parameter, and said digital voiced/unvoiced decision parameter into said set of parameters representative of said analog speech signal;
  
  wherein said means for analyzing said frame of data with a linear prediction coefficient generator includes a signal conditioning means for a frame of digital speech data samples, said signal conditioning means havingmeans for generating a window signal having a characteristic waveshape which is completed in a data sample span which is shorter than said frame of data;
  
  means for centering said window signal data span in said frame of speech data samples; and
  
  means for multiplying corresponding sample positions of said frame of speech data and said window signal to generate a limited and windowed frame of speech data.
- View Dependent Claims (71)
- - 71. A speech analysis system as defined in claim 70 wherein said means for analyzing said frame of said speech data to produce a speech pitch period parameter includes means for interpolating between sample values in a sample data Cepstrum signal, and wherein a pitch period value is initially chosen as corresponding to a maximum amplitude sample, said interpolating system having:
    - means for storing said maximum value sample, an immediately preceding sample, and an immediately following sample;
      
      means for subtracting said immediately preceding sample from said immediately following sample to form a sample difference;
      
      means for dividing said sample difference by four to form a sample difference quotient;
      
      means for adding said immediately preceding sample to said immediately following sample to form a sample sum;
      
      means for dividing said sample sum by two (2) to form a sample sum quotient;
      
      means for subtracting said maximum amplitude sample from said sample sum quotient to form a weighted sample sum quotient;
      
      a means for dividing said sample difference quotient by said weighted sample sum quotient to form a pitch value increment; and
      
      means for adding said pitch value increment to said pitch period value to form an interpolated pitch value.

72. A speech analysis and synthesis system for generating a plurality of sets of parameters representative of a plurality of analog speech signals as produced by a model speech generator and reproducing in real time from said sets of parameters a replica of each of said analog speech signals, said system comprising:
- an analyzer including,means for sampling each analog speech signal to produce a plurality of digital speech data signals,means for separating said plurality of said speech data signals into a frame of speech data signals,means for analyzing said frame of speech data with a Cepstrum generator to produce a digital pitch period parameter for said frame of data,means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter, andmeans for analyzing said frame of speech data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter; and
  
  a synthesizer including,digital pitch pulse generator means for receiving said digital pitch period parameter and generating for each original speech signal pitch pulses having a corresponding pitch period,digital random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said digital voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision in unvoiced,gain control means connected to said output of said switch means for receiving said digital power parameter, a digital output level of said gain control means being dependent upon said power parameter,digital linear prediction filter means connected to said output of said gain control means for receiving and applying said digital coefficient parameters to said filter means to generate a digital replica of each of said original speech signals at an output of said filter means, andmeans for converting said digital output of said filter means to a corresponding analog replica of each of said original analog speech signals.

73. A speech analysis and synthesis system for generating a set of parameters representative of an analog speech signal as produced by a model speech generator and reproducing from said set of parameters a replica of said analog speech signal, said system comprising:
- an analyzer including,means for sampling said analog speech signal to produce a plurality of digital speech data signals,means for separating said plurality of said speech data signals into a frame of speech data signals,means for analyzing said frame of speech data with a Cepstrum generator to produce a digital pitch period parameter for said frame of data,means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter, andmeans for analyzing said frame of speech data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter; and
  
  a synthesizer including,digital pitch pulse generator means for receiving said digital pitch period parameter and generating pitch pulses having a corresponding pitch period,digital random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said digital voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is unvoiced,gain control means connected to said output of said switch means for receiving said digital power parameter, a digital output level of said gain control means being dependent upon said power parameter,digital linear prediction filter means connected to said output of said gain control means for receiving and applying said digital coefficient parameters to said filter means to generate a digital replica of said speech signal at an output of said filter means, andmeans for converting said digital output of said filter means to a corresponding analog replica of said analog speech signal;
  
  wherein said means for analyzing said frame of speech data to produce a digital voiced/unvoiced decision parameter includesmeans for scanning a Cepstrum signal within an auxilliary signal range on each side of a Cepstrum peak amplitude signal,means for detecting and storing a range peak signal within the scanned auxilliary ranges,means for summing the absolute values of the signal within the auxilliary ranges to form a range signal sum,means for dividing said Cepstrum peak amplitude signal by said range peak signal to form a range peak factor;
  
  means for dividing said range signal sum by said Cepstrum peak amplitude signal to form an average peak factor,means for first comparing said range peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said range peak factor is equal to or above said upper limit and generating an unvoiced decision if said range peak factor is equal to or below said lower limit, andmeans for secondarily comparing said average peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said average peak factor is equal to or less than said lower limit and generating an unvoiced decision parameter if said average peak factor is equal to or greater than said upper limit.
- View Dependent Claims (74, 75, 76, 77, 78)
- - 74. A speech analysis and synthesis system as defined in claim 73 wherein said means for analyzing said frame of speech data signal to produce a digital voiced/unvoiced decision parameter further includes:
    - means for thirdly comparing said Cepstrum peak amplitude signal with predetermined upper and lower limits and generating a voiced decision parameter if said Cepstrum peak amplitude is equal to or less than said lower limit and generating an unvoiced decision parameter if said Cepstrum peak amplitude is equal to or greater than said upper limit.
  - 75. A speech analysis and synthesis system as defined in claim 74 wherein said means for analyzing said frame of speech data to produce a digital voiced/unvoiced decision parameter further includes:
    - means for monitoring said speech signal and counting the number of zero crossings in a predetermined time period; and
      
      means for comparing said number of zero crossings with a predetermined limit and generating an unvoiced decision parameter if said number of zero crossings is above said limit and generating a voiced decision parameter if said number of zero crossings is below said limit.
  - 76. A speech analysis and synthesis system as defined in claim 75 wherein said means for analyzing said frame of speech data to produce a digital voiced/unvoiced decision parameter further includes:
    - estimate register means;
      
      means for comparing the output of said estimate register with a first autocorrelation coefficient produced by said linear prediction technique and setting said register to said first autocorrelation coefficient value if said coefficient is equal to or less than said register value;
      
      means for integrating the value in said register means if said coefficient is greater than said estimate register value;
      
      means for subtracting said estimate register value from said first autocorrelation coefficient value to generate a difference value; and
      
      means for comparing said difference value with predetermined upper and lower limits and generating a voiced decision parameter if said difference is greater than said upper limit and generating an unvoiced decision parameter if said difference signal is less than said lower limit.
  - 77. A speech analysis and synthesis system as defined in claim 75 wherein said means for analyzing said frame of speech data to produce a digital voiced/unvoiced decision parameter further includes;
    - means for comparing an error signal generated by said linear prediction coefficient generator with predetermined upper and lower limits and generating a voiced decision parameter if said error signal is equal to or less than said lower limit and generating an unvoiced decision parameter if said error signal is equal to or greater than said upper limit.
  - 78. A speech analysis and synthesis system as defined in claim 77 wherein said means for analyzing said frame of speech data with a Cepstrum generator includes a signal conditioning system for conditioning said Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal, and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having:
    - means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum signal; and
      
      means for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

79. A speech analysis and synthesis system for generating a set of parameters representative of an analog speech signal as produced by a model speech generator and reproducing from said set of parameters a replica of said analog speech signal, said system comprising:
- an analyzer including,means for sampling said analog speech signal to produce a plurality of digital speech data signals,means for separating said plurality of said speech data signals into a frame of speech data signals,means for analyzing said frame of speech data with a Cepstrum generator to produce a digital pitch period parameter for said frame of data,means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter, andmeans for analyzing said frame of speech data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter; and
  
  a synthesizer including,digital pitch pulse generator means for receiving said digital pitch period parameter and generating pitch pulses having a corresponding pitch period,digital random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said ditigal voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is unvoiced,gain control means connected to said output of said switch means for receiving said digital power parameter, a digital output level of said gain control means being dependent upon said power parameter,digital linear prediction filter means connected to said output of said gain control means for receiving and applying said ditigal coefficient parameters to said filter means to generate a digital replica of said speech signal at an output of said filter means, andmeans for converting said digital output of said filter means to a corresponding analog replica of said analog speech signal;
  
  wherein said means for analyzing said frame of speech data with a Cepstrum generator includes a signal conditioning system for conditioning a Cepstrum signal which has, in the direction of an ordinate axis, an amplitude dimension of said Cepstrum signal and, in the direction of the abscissa, a quantity representing time, said signal conditioning system having,means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum, andmeans for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

80. A speech analysis and synthesis system for generating a set of parameters representative of an analog speech signal as produced by a model speech generator and reproducing from said set of parameters a replica of said analog speech signal, said system comprising:
- an analyzer including,means for sampling said analog speech signal to produce a plurality of digital speech data signals,means for separating said plurality of said speech data signals into a frame of speech data signals,means for analyzing said frame of speech data with a Cepstrum generator to produce a digital pitch period parameter for said frame of data,means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter, andmeans for analyzing said frame of speech data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter; and
  
  a synthesizer including,digital pitch pulse generator means for receiving said digital pitch period parameter and generating pitch pulses having a corresponding pitch period,digital random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said digital voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is unvoiced,gain control means connected to said output of said switch means for receiving said digital power parameter, a digital output level of said gain control means being dependent upon said power parameter,digital linear prediction filter means connected to said output of said gain control means for receiving and applying said digital coefficient parameters to said filter means to generate a digital replica of said speech signal at an output of said filter means, andmeans for converting said digital output of said filter means to a corresponding analog replica of said analog speech signal;
  
  wherein said digital pitch pulse generator means comprisesmeans responsive to receipt of said digital pitch period parameter, for generating a sequence of control signals, andmeans responsive to said sequence of control signals for generating a substantially periodic, amplitude increasing pitch pulse excitation signal.
- View Dependent Claims (81)
- - 81. A speech analysis and synthesis system as defined in claim 80 wherein:
    - said control signal sequence generating means includes a counter producing a parallel output count in response to processing clock signals; and
      
      said means for generating said pitch pulse excitation signal includes a read-only memory means addressed by said parallel count output from said counter and the output of said read-only memory being said excitation signal.

82. A speech analysis and synthesis system for generating a set of parameters representative of an analog speech signal as produced by a model speech generator and reproducing from said set of parameters a replica of said analog speech signal, said system comprising:
- an analyzer including,means for sampling said analog speech signal to produce a plurality of digital speech data signals,means for separating said plurality of said speech data signals into a frame of speech data signals,means for analyzing said frame of speech data with a Cepstrum generator to produce a digital pitch period parameter for said frame of data,means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter, andmeans for analyzing said frame of speech data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter; and
  
  a synthesizer including,digital pitch pulse generator means for receiving said digital pitch period parameter and generating pitch pulses having a corresponding pitch period,digital random noise generator means,switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said digital voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is unvoiced,gain control means connected to said output of said switch means for receiving said digital power parameter, a digital output level of said gain control means being dependent upon said power parameter,digital linear prediction filter means connected to said output of said gain control means for receiving and applying said digital coefficient parameters to said filter means to generate a digital replica of said speech signal at an output of said filter means, andmeans for converting said digital output of said filter means to a corresponding analog replica of said analog speech signal;
  
  wherein said means for analyzing said frame of said speech data to produce a digital pitch period parameter includes means for interpolating between sample values in a sample data Cepstrum signal wherein a pitch period value is initially chosen as corresponding to a maximum amplitude sample, said interpolating system havingmeans for storing said maximum value sample, an immediately preceding sample, and an immediately following sample,means for substracting said immediately preceding sample from said immediately following sample to form a sample difference,means for dividing said sample difference by four to form a sample difference quotient,means for adding said immediately preceding sample to said immediately following sample to form a sample sum,means for dividing said sample sum by two to form a sample sum quotient,means for subtracting said maximum amplitude sample from said sample sum quotient to form a weighted sample sum quotient,means for dividing said sample difference quotient by said weighted sample sum quotient to form a pitch value increment, andmeans for adding said pitch value increment to said pitch period value to form an interpolated pitch value.

83. A speech analysis and synthesis system for generating a set of parameters representative of an analog speech signal as produced by a model speech generator and reproducing from said set of parameters a replica of said analog speech signal, said system comprising:
- an analyzer including,means for sampling said analog speech signal to produce a plurality of digital speech data signals,means for separating said plurality of said speech data signals into a frame of speech data signals,means for analyzing said frame of speech data with a Cepstrum generator to produce a digital pitch period parameter for said frame of data,means for analyzing said frame of speech data with a voiced/unvoiced decision generator to produce a digital voiced/unvoiced decision parameter, andmeans for analyzing said frame of speech data with a linear prediction coefficient generator to produce a plurality of digital coefficient parameters and a digital power parameter; and
  
  a synthesizer including,digital pitch pulse generator means for receiving said digital pitch period parameter and generating pitch pulses having a corresponding pitch period,digital random noise generator means,switch means connected to said pitch pulses generator means and to said random noise generator means for receiving said digital voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is unvoiced,gain control means connected to said output of said switch means for receiving said digital power parameter, a digital output level of said gain control means being dependent upon said power parameter,digital linear prediction filter means connected to said output of said gain control means for receiving and applying said digital coefficient parameters to said filter means to generate a digital replica of said speech signal at an output of said filter means, andmeans for converting said digital output of said filter means to a corresponding analog replica of said analog speech signal;
  
  wherein said means for analyzing said frame of data with a linear prediction coefficient generator includes a signal conditioning means for a frame of digital speech data samples, said signal conditioning means havingmeans for generating a window signal having a characteristic waveshape which is completed in a data sample span which is shorter than said frame of data,means for centering said window signal data sample span in said frame of speech data samples, andmeans for multiplying corresponding sample positions of said frame of speech data and said window signal to generate a limited and windowed frame of speech data.
- View Dependent Claims (84)
- - 84. A speech analysis and synthesis system as defined in claim 83 wherein said means for analyzing said frame of said speech data to produce a digital pitch period parameter includes means for interpolating between sample values in a sample data Cepstrum signal wherein a pitch period value is initially chosen as corresponding to a maximum amplitude sample, said interpolating system having:
    - means for storing said maximum value sample, an immediately preceding sample, and an immediately following sample;
      
      means for subtracting said immediately preceding sample from said immediately following sample to form a sample difference;
      
      means for dividing said sample difference by four to form a sample difference quotient;
      
      means for adding said immediately preceding sample to said immediately following sample to form a sample sum;
      
      means for dividing said sample sum by two (2) to form a sample sum quotient;
      
      means for subtracting said maximum amplitude sample from said sample sum quotient to form a weighted sample sum quotient;
      
      means for dividing said sample difference quotient by said weighted sample sum quotient to form a pitch value increment; and
      
      means for adding said pitch value increment to said pitch period value to form an interpolated pitch value.

85. A speech synthesis system for generating in real time an analog replica of each of a plurality of original analog speech signals represented by corresponding sets of digital parameters, each set of said parameters including a digital speech pitch period parameter, a digital voiced/unvoiced decision parameter, a plurality of digital coefficient parameters and a digital power parameter, said synthesis system including:
- digital pitch pulse generator means for receiving said digital pitch period parameter and generating pitch pulses having a corresponding pitch period;
  
  digital random noise generator means;
  
  switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said digital voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is unvoiced;
  
  gain control means connected to said output of said switch means for receiving said digital power parameter, a digital output level of said gain control means being dependent upon said power parameter;
  
  digital linear prediction filter means connected to said output of said gain control means for receiving and applying said digital coefficient parameters to said filter means to generate a digital replica of each of said plurality of speech signals at an output of said filter means; and
  
  means for converting said digital output of said filter means to a corresponding analog replica of each of said plurality of analog speech signals.

86. A speech synthesis system for generating an analog replica of an original analog speech signal represented by a set of digital parameters, said parameters including a digital speech pitch period parameter, a digital voiced/unvoiced decision parameter, a plurality of digital coefficient parameters and a digital power parameter, said synthesis system including:
- digital pitch pulse generator means for receiving said digital pitch period parameter and generating pitch pulses having a corresponding pitch period;
  
  digital random noise generator means;
  
  switch means connected to said pitch pulse generator means and to said random noise generator means for receiving said digital voiced/unvoiced decision parameter, an output of said switch means being derived from said pitch pulse generator means if said decision is voiced and from said random noise generator means if said decision is unvoiced;
  
  gain control means connected to said output of said switch means for receiving said digital power parameter, a digital output level of said gain control means being dependent upon said power parameter;
  
  digital linear prediction filter means connected to said output of said gain control means for receiving and applying said digital coefficient parameters to said filter means to generate a digital replica of said speech signal at an output of said filter means; and
  
  means for converting said digital output of said filter means to a corresponding analog replica of said analog speech signal;
  
  wherein said digital pitch pulse generator means includesmeans responsive to receipt of said digital pitch period parameter, for generating a sequence of control signals, andmeans responsive to said sequence of control signals for generating a substantially periodic, amplitude increasing pitch pulse excitation signal.
- View Dependent Claims (87)
- - 87. A speech synthesis system as defined in claim 86 wherein:
    - said control signal sequence generating means includes a counter producing a parallel output count in response to processing clock signals; and
      
      said means for generating said pitch pulse excitation signal includes a read-only memory means addressed by said parallel count output from said counter and the output of said read-only memory being said excitation signal.

88. A signal conditioning system for conditioning a Cepstrum signal having as a dimension, in the direction of the ordinate axis, the amplitude of said Cepstrum signal and as a dimension, in the direction of the abscissa, a quantity representing time, said signal conditioning comprising:
- means for generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum signal; and
  
  means for adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.
- View Dependent Claims (89)
- - 89. A signal conditioning system for conditioning a Cepstrum signal as defined in claim 88 wherein:
    - said means for generating said weighting signal includes a first adder having an output connected as an input to a register, said register having an output connected as a first input to said first adder, said first adder having as a second input an incrementing constant connected to said first adder through a first gate, said gate being enabled as an incremental element of said Cepstrum signal is gated through a second gate to a first input to a second adder, whereby said register is incremented for each element of said Cepstrum signal which is gated into said second adder; and
      
      said means for adding said weighting signal to said Cepstrum signal includes means for connecting the output of said register as a second input to said second adder, whereby the incremented output of said register is added to each element of said Cepstrum signal and an output of said second adder is said weighted Cepstrum signal.

90. A method for conditioning a Cepstrum signal having as a dimension in the direction of the ordinate axis the amplitude of said Cepstrum signal and as a dimension in the direction of the abscissa a quantity representing time, said method comprising the steps of:
- generating a weighting signal which linearly increases in value with the dimension in the direction of the abscissa corresponding to said quantity representing time of said Cepstrum signal; and
  
  adding said weighting signal to said Cepstrum signal along said dimension in the direction of the abscissa thereof to generate a weighted Cepstrum signal.

91. A system for generating a voiced/unvoiced decision parameter in a speech analysis system wherein a pitch parameter is generated by a Cepstrum technique and coefficient parameters are generated by a linear prediction technique, said system comprising:
- means for scanning said Cepstrum signal within an auxilliary sample range on each side of a Cepstrum peak amplitude sample;
  
  means for detecting and storing a range peak sample within the scanned auxilliary ranges;
  
  means for summing the absolute values of the samples within the auxilliary ranges to form a range sample sum;
  
  means for dividing said Cepstrum peak amplitude sample by said range peak sample to form a range peak factor;
  
  means for dividing said range sample sum by said Cepstrum peak amplitude sample to form an average peak factor;
  
  means for first comparing said range peak factor with predetermined upper and lower limits and generating an unvoiced decision parameter if said range peak factor is above said upper limit and generating an unvoiced decision parameter if said range peak factor is equal to or below said lower limit; and
  
  means for secondly comparing said average peak factor with predetermined upper and lower limits and generating a voiced decision parameter if said average peak factor is equal to or less than said lower limit and generating an unvoiced decision parameter if said average peak factor is equal to or greater than said upper limit.

92. A signal conditioning system for a frame of digital speech data samples, said system comprising:
- means for generating a window signal having a characteristic waveshape which is completed in a data sample span which is shorter than said frame of data;
  
  means for centering said window signal data sample span in said frame of speech data samples; and
  
  means for multiplying corresponding sample positions of said frame of speech data and said window signal to generate a limited and windowed frame of speech data.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Sofstats International Incorporated
Original Assignee
International Communication Systems Corporation
Inventors
Thompson, Robert L., Wade, Fred B., Wilkes, Arthur L.
Primary Examiner(s)
Claffy, Kathleen H.
Assistant Examiner(s)
Kemeny, E. S.

Application Number

US05/593,861
Time in Patent Office

862 Days
Field of Search

179/1 SA, 179/1 SM
US Class Current

704/220
CPC Class Codes

G10L 19/06 Determination or coding of ...

H04B 1/66 for reducing bandwidth of s...

Speech analysis and synthesis system

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

92 Claims

Specification

Solutions

Use Cases

Quick Links

Speech analysis and synthesis system

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

92 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links