Speech coding system and method including voicing cut off frequency analyzer

US 6,078,880 A
Filed: 07/13/1998
Issued: 06/20/2000
Est. Priority Date: 07/13/1998
Status: Expired due to Term

First Claim

Patent Images

1. A method of encoding a speech signal, said method comprising the steps of:

obtaining at least one frame of said speech signal;

estimating a voicing cutoff frequency for said at least one frame of said speech signal, including the step of providing said at least one frame to a bandpass filter having outputs corresponding to eight voicing cutoff frequency bands represented by three voicing index signal bits, and filtering said at least one frame of said speech signal to determine a voicing cutoff frequency for said at least one frame of said speech signal, wherein each of said eight voicing frequency cutoff frequency bands corresponds to a voicing cutoff frequency value selected from the group comprising 0 Hz, 571 Hz, 1143 Hz, 1714 Hz, 2286 Hz, 2857 Hz, 3249 Hz, and 4000 Hz.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A speech coding system and associated method relies on a speech encoder (15) and a speech decoder (20). The speech encoder (15) includes a voicing cut off frequency analyzer (60). Voicing cut off frequency analyzer (60) includes voicing cut off frequency estimator (61) and voicing cut off frequency quantizer (62). Voicing cut off frequency estimator (61) estimates a voicing cut off frequency value for respective samples of an input speech waveform (1). To accomplish this, voicing cut off frequency estimator (61) utilizes a bandpass filter to estimate a frequency above which a sample of speech is voiced and below which the sample of speech is unvoiced. Voicing cut off frequency quantizer (62) quantizes the estimated voicing cut off frequency value and provides, for respective samples, a voicing cut off frequency index signal (6) which may be stored or transmitted. Voicing cut off frequency index signal (6) may comprise as few as 1 bit, and in a preferred embodiment, as few as 3 bits.

70 Citations

View as Search Results

13 Claims

1. A method of encoding a speech signal, said method comprising the steps of:
- obtaining at least one frame of said speech signal;
  
  estimating a voicing cutoff frequency for said at least one frame of said speech signal, including the step of providing said at least one frame to a bandpass filter having outputs corresponding to eight voicing cutoff frequency bands represented by three voicing index signal bits, and filtering said at least one frame of said speech signal to determine a voicing cutoff frequency for said at least one frame of said speech signal, wherein each of said eight voicing frequency cutoff frequency bands corresponds to a voicing cutoff frequency value selected from the group comprising 0 Hz, 571 Hz, 1143 Hz, 1714 Hz, 2286 Hz, 2857 Hz, 3249 Hz, and 4000 Hz.

2. A method of encoding a speech signal, said method comprising the steps of:
- obtaining at least one frame of said speech signal;
  
  estimating a voicing cutoff frequency for said at least one frame of said speech signal;
  
  quantizing said voicing cutoff frequency to provide a voicing cutoff frequency index signal corresponding to said at least one frame, said voicing cutoff frequency index signal comprising one or more binary digits;
  
  storing said voicing cutoff frequency index signal in memory; and
  
  utilizing said voicing cutoff frequency value, including the step of generating harmonics to produce voiced excitation, the number of harmonics generated being determined according to the formula;
  
  ##EQU7## wherein;
  
  nh is the number of harmonics generated,fsel is the integer representation of said voicing cutoff frequency index signal, andf₀ is the fundamental frequency of said voiced portion of synthesized speech.
- View Dependent Claims (3)
- - 3. The method of claim 2 wherein said voiced excitation comprises epoch (i), and wherein:
    - ##EQU8##

4. A device including a voicing cutoff frequency analyzer, for encoding a speech signal, said voicing cutoff frequency analyzer comprising:
- a voicing cutoff frequency estimator adapted to receive respective frames of said speech signal and to determine a corresponding voicing cutoff frequency value for each of said respective frames, and to provide said voicing cutoff frequency value to an output port, said voicing cutoff frequency estimator comprising a bandpass filter means; and
  
  a voicing cutoff frequency quantizer in communication with said output port of said voicing cutoff frequency estimator, said voicing cutoff frequency quantizer being adapted to receive said voicing cutoff frequency value at an input port and to provide a voicing cutoff frequency index signal at an output port;
  
  wherein said bandpass filter means comprises;
  
  a first band adapted to pass frequencies from about 0 Hz to about 570 Hz;
  
  a second band adapted to pass frequencies from about 571 Hz to about 1142 Hz;
  
  a third band adapted to pass frequencies from about 1143 Hz to about 1713 Hz;
  
  a fourth band adapted to pass frequencies from about 1714 Hz to about 2285 Hz;
  
  a fifth band adapted to pass frequencies from about 2286 Hz to about 2856 Hz;
  
  a sixth band adapted to pass frequencies from about 2857 Hz to about 3248 Hz; and
  
  a seventh band adapted to pass frequencies from about 3249 Hz to about 4000 Hz.

5. A method for generating a voicing cutoff frequency identifying a frequency below which periodic voicing predominates and above which noise components predominate, said method comprising the steps of:
- applying speech to be analyzed to a bank of filters including bandpass filters, to thereby generate signals in disparate bands;
  
  rendering unipolar the signals in each of the disparate bands, to produce unipolar signals;
  
  filtering said unipolar signals associated with each of said disparate bands, to thereby eliminate direct signal components in the resulting filtered unipolar signals associated with each of said disparate bands;
  
  autocorrelating each of said filtered unipolar signals associated with said disparate bands, to thereby determine the maximum value of the autocorrelated signal associated with each band;
  
  determining a voicing cutoff frequency by comparing each of said maximum values with a fixed threshold value, to thereby associate with each of said bands one of a true and false state; and
  
  selecting as a putative voicing cutoff frequency the upper frequency of that band which both (a) has a true state and (b) represents a frequency below which no two adjacent bands have a zero state.
- View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
- - 6. A method according to claim 5, wherein said step of filtering said unipolar signals comprises the step of high-pass filtering.
  - 7. A method according to claim 6, wherein said step of filtering said unipolar signals comprises the step of low-pass filtering said unipolar signals in each of said disparate bands, to thereby eliminate high-frequency components.
  - 8. A method according to claim 5, wherein said putative voicing cutoff frequency is the highest frequency in the associated one of said bands, said method further comprising the steps of:
    - if the pitch period is less than a particular number of sample lags, the first half-frame rms value differs from the second half-frame rms value by a factor of four, and the putative voicing frequency is one of (a) zero voicing and (b) the lowest possible non-zero voicing frequency, the voicing cutoff frequency is deemed to be that band representing zero frequency.
  - 9. A method according to claim 8, further comprising the steps of:
    - if (a) the putative voicing frequency is one of (i) zero voicing and (ii) the lowest possible non-zero voicing frequency,(b) the previous frame'"'"'s RMS power is less than the current frame'"'"'s RMS power, and (c) next future frame'"'"'s RMS power is greater than the current frame'"'"'s RMS power, and (d) and the current frame RMS power is less than a first particular value, then said putative voicing cutoff frequency is deemed to be the voicing cutoff frequency.
  - 10. A method according to claim 9, comprising the further steps of:
    - if (a) the previous frame'"'"'s voicing cutoff frequency is within a frequency range represented any of the three lowest possible voicing frequencies, including zero, and (b) the current putative voicing cutoff frequency is zero, and (c) the previous frame'"'"'s RMS value is less than about four-fifths of the current frame'"'"'s RMS value, and (d) the next future frame'"'"'s RMS is less than about four-fifths of the current frame'"'"'s RMS value, then said putative voicing cutoff frequency is deemed to be the voicing cutoff frequency.
  - 11. A method according to claim 9, comprising the further steps of:
    - if (a) the putative voicing cutoff frequency represents one of the two lowest possible voicing frequencies, including zero frequency, (b) the current frame'"'"'s RMS value is less than about four-fifths the previous frame'"'"'s RMS value, and (c) the current frame'"'"'s RMS value is less than about four-fifths of the future frame'"'"'s RMS value, and (d) the current frame'"'"'s RMS is less than a second particular value, then said putative voicing cutoff frequency is deemed to be the voicing cutoff frequency.
  - 12. A method according to claim 11, comprising the further steps of:
    - if (a) said putative voicing frequency is one of the four lowest possible, including zero, (b) the current frame'"'"'s RMS is greater than a third particular value, (c) the zero crossing count is less than a particular value and (d) the peakiness ratio is greater than a fifth particular value, then the voicing cutoff frequency is set to a value greater than four.
  - 13. A method according to claim 11, comprising the further steps of:
    - if the putative voicing cutoff frequency is one of the four lowest possible values, and the current frame'"'"'s RMS value is greater than a selected value significantly greater than said fifth particular value, then the voicing cutoff frequency is set to a value greater than four.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Lockheed Martin Corporation (Martin Marietta Corporation)
Original Assignee
Lockheed Martin Corporation (Martin Marietta Corporation)
Inventors
Grabb, Mark Lewis, Zinser, Richard Louis Jr., Koch, Steven Robert, Brooksby, Glen William
Primary Examiner(s)
Hudspeth, David R.
Assistant Examiner(s)
Smits, Talivaldis Ivars

Application Number

US09/114,660
Time in Patent Office

708 Days
Field of Search

704/208, 704/214
US Class Current

704/208
CPC Class Codes

G10L 2025/937 Signal energy in various fr...

G10L 25/93 Discriminating between voic...

Speech coding system and method including voicing cut off frequency analyzer

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

70 Citations

13 Claims

Specification

Solutions

Use Cases

Quick Links

Speech coding system and method including voicing cut off frequency analyzer

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

70 Citations

13 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links