Spectral magnitude representation for multi-band excitation speech coders

US 5,754,974 A
Filed: 02/22/1995
Issued: 05/19/1998
Est. Priority Date: 02/22/1995
Status: Expired due to Term

First Claim

Patent Images

1. A method for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the method comprising the steps of:

processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal;

processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames;

processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, andquantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal,wherein the processing of the speech frames to determine spectral magnitudes and the quantizing and encoding of the spectral magnitudes is done in such a manner that spectral magnitudes independent of the voicing information are available for later synthesizing.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method for encoding a speech signal into digital bits including the steps of dividing the speech signal into speech frames representing time intervals of the speech signal, determining voicing information for frequency bands of the speech frames, and determining spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands. The method further includes quantizing and encoding the spectral magnitudes and the voicing information. The steps of determining, quantizing and encoding the spectral magnitudes is done is such a manner that the spectral magnitudes independent of voicing information are available for later synthesizing.

120 Citations

View as Search Results

19 Claims

1. A method for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the method comprising the steps of:
- processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal;
  
  processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames;
  
  processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, andquantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal,wherein the processing of the speech frames to determine spectral magnitudes and the quantizing and encoding of the spectral magnitudes is done in such a manner that spectral magnitudes independent of the voicing information are available for later synthesizing.
- View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
- - 3. The subject matter of claim 1 or 2, wherein the speech signal is processed to estimate a parameter representative of the fundamental frequency, and the determined frequencies are harmonic multiples of the fundamental frequency.
  - 4. The subject matter of claim 3, wherein the parameter representative of the fundamental frequency is quantized and encoded for each of the speech frames, so that the digital bits include information representing the spectral magnitudes, voicing information, and fundamental frequency.
  - 5. The subject matter of claim 4, wherein the digital bits include redundant bits providing forward error correction coding.
  - 6. The subject matter of claim 5 wherein the forward error correction coding includes Golay codes and Hamming codes.
  - 7. The subject matter of claim 3, wherein the processing of the speech frames to determine the spectral magnitudes is done independently of the voicing information for the frame.
  - 8. The subject matter of claim 7, wherein the voicing information represents whether particular frequency bands within a speech frame are processed as voiced or unvoiced bands, and the processing to determine spectral magnitudes determines the spectral magnitude for a particular determined frequency independently of whether the determined frequency is in a frequency band that is voiced or unvoiced.
  - 9. The subject matter of claim 3, wherein the processing to determine spectral magnitudes includes a spectral transformation of the speech frames from time domain samples to frequency samples, and wherein the spectral magnitudes are formed as weighted sums of the frequency samples.
  - 10. The subject matter of claim 9, wherein weights used in producing the weighted sums have the effect of compensating for the sampling grid used in the spectral transformation.
  - 11. The subject matter of claim 1 or 2, wherein the processing of the speech frames to determine the spectral magnitudes is done independently of the voicing information for the frame.
  - 12. The subject matter of claim 11, wherein the voicing information represents whether particular frequency bands within a speech frame are processed as voiced or unvoiced bands, and the processing to determine spectral magnitudes determines the spectral magnitude for a particular determined frequency independently of whether the determined frequency is in a frequency band that is voiced or unvoiced.
  - 13. The subject matter of claim 1 or 2, wherein the processing to determine spectral magnitudes includes a spectral transformation of the speech frames from time domain samples to frequency samples, and wherein the spectral magnitudes are formed as weighted sums of the frequency samples.
  - 14. The subject matter of claim 13 wherein weights used in producing the weighted sums have the effect of compensating for the sampling grid used in the spectral transformation.

2. Apparatus for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the apparatus comprising:
- means for processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal;
  
  means for processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames;
  
  means for processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, andmeans for quantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal,wherein the processing of the speech frames to determine spectral magnitudes and the quantizing and encoding of the spectral magnitudes is done in such a manner that spectral magnitudes independent of the voicing information are available for later synthesizing.

15. A method for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the method comprising the steps of:
- processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal;
  
  processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames;
  
  processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, andquantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal,wherein the processing to determine spectral magnitudes includes a spectral transformation of the speech frames from time domain samples to frequency samples, and wherein the spectral magnitudes are formed as weighted sums of the frequency samples.
- View Dependent Claims (17, 18, 19)
- - 17. The subject matter of claim 15 or 16, wherein weights used in producing the weighted sums have the effect of compensating for the sampling grid used in the spectral transformation.
  - 18. The subject matter of claim 15 or 16, wherein the speech signal is processed to estimate a parameter representative of the fundamental frequency, and the determined frequencies are harmonic multiples of the fundamental frequency.
  - 19. The subject matter of claim 18, wherein the parameter representative of the fundamental frequency is quantized and encoded for each of the speech frames, so that the digital bits include information representing the spectral magnitudes, voicing information, and fundamental frequency.

16. Apparatus for encoding a speech signal into a plurality of digital bits from which the speech signal can later be synthesized, the apparatus comprising:
- means for processing the speech signal to divide the signal into a plurality of speech frames, each of the speech frames representing a time interval of the speech signal,means for processing the speech frames to determine voicing information for a plurality of frequency bands of the speech frames;
  
  means for processing the speech frames to determine spectral magnitudes representative of the magnitudes of the spectrum at determined frequencies across the frequency bands, andmeans for quantizing and encoding the spectral magnitudes and the voicing information for subsequent use in decoding and synthesizing the speech signal,wherein the processing to determine spectral magnitudes includes a spectral transformation of the speech frames from time domain samples to frequency samples, and wherein the spectral magnitudes are formed as weighted sums of the frequency samples.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Digital Voice Systems, Inc.
Original Assignee
Digital Voice Systems, Inc.
Inventors
Griffin, Daniel W., Hardwick, John C.
Primary Examiner(s)
MacDonald, Allen R.
Assistant Examiner(s)
Smits, Talivaldis Ivars

Application Number

US08/392,188
Time in Patent Office

1,182 Days
Field of Search

395/2.14, 395/2.15, 395/2.16, 395/2.17, 395/2.28, 395/2.31, 395/2.32, 395/2.73, 395/2.75, 395/2.77, 381/41, 381/51
US Class Current

704/206
CPC Class Codes

G10L 19/10 the excitation function bei...

G10L 25/93 Discriminating between voic...

Spectral magnitude representation for multi-band excitation speech coders

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

120 Citations

19 Claims

Specification

Solutions

Use Cases

Quick Links

Spectral magnitude representation for multi-band excitation speech coders

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

120 Citations

19 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links