Voiced/unvoiced estimation of an acoustic signal

US 5,216,747 A
Filed: 11/21/1991
Issued: 06/01/1993
Est. Priority Date: 09/20/1990
Status: Expired due to Term

First Claim

Patent Images

1. A method for encoding an acoustic signal, the method comprising the steps of:

A. breaking the signal into segments, each of the segments representing one of a succession of time intervals;

B. breaking each of said segments into a plurality of frequency bands; and

C. considering in turn each of the segments as the current segment, and for each of a plurality of said frequency bands of the current segment making a voiced/unvoiced decision by a method comprising the steps of;

evaluating a voicing measure for said frequency band;

making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold;

determining an energy measure of the current segment;

determining a measure of the signal energy of one or more recent prior segments;

comparing the energy measure of the current segment to the measure of the signal energy of the one or more recent prior segments; and

adjusting the threshold to make a voiced decision more likely when the energy measure of the current segment is greater than the measure of the signal energy of the one or more recent prior segments.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The pitch estimation method is improved. Sub-integer resolution pitch values are estimated in making the initial pitch estimate; the sub-integer pitch values are preferably estimated by interpolating intermediate variables between integer values. Pitch regions are used to reduce the amount of computation required in making the initial pitch estimate. Pitch-dependent resolution is used in making the initial pitch estimate, with higher resolution being used for smaller values of pitch. The accuracy of the voiced/unvoiced decision is improved by making the decision dependent on the energy of the current segment relative to the energy of recent prior segments; if the relative energy is low, the current segment favors an unvoiced decision; if high, it favors a voiced decision. Voiced harmonics are generated using a hybrid approach; some voiced harmonics are generated in the time domain, whereas the remaining harmonics are generated in the frequency domain; this preserves much of the computational savings of the frequency domain approach, while at the same time improving speech quality. Voiced harmonics generated in the frequency domain are generated with higher frequency accuracy; the harmonics are frequency scaled, transformed into the time domain with a Discrete Fourier Transform, interpolated and then time scaled.

Citations

10 Claims

1. A method for encoding an acoustic signal, the method comprising the steps of:
- A. breaking the signal into segments, each of the segments representing one of a succession of time intervals;
  
  B. breaking each of said segments into a plurality of frequency bands; and
  
  C. considering in turn each of the segments as the current segment, and for each of a plurality of said frequency bands of the current segment making a voiced/unvoiced decision by a method comprising the steps of;
  
  evaluating a voicing measure for said frequency band;
  
  making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold;
  
  determining an energy measure of the current segment;
  
  determining a measure of the signal energy of one or more recent prior segments;
  
  comparing the energy measure of the current segment to the measure of the signal energy of the one or more recent prior segments; and
  
  adjusting the threshold to make a voiced decision more likely when the energy measure of the current segment is greater than the measure of the signal energy of the one or more recent prior segments.
- View Dependent Claims (4, 5, 6)
- - 4. The method of claim, 1, 2 or 3 wherein the energy measure of the current segment ξ
    - ₀ is ##EQU18## wherein ω
      
      is frequency, H(ω
      
      ) is a frequency dependent weighting function, and S_w (ω
      
      ) is the Fourier transform of the acoustic signal.
  - 5. The method of claim 1, 2 or 3 wherein the voicing measure,D₁, is ##EQU19## wherein w is a windowing function, S_w (ω
    - ) is the Fourier transform of the acoustic signal, S_w (ω
      
      ) is the voiced spectrum used to model the acoustic signal, ω
      
      is frequency, and Ω
      
      _i are the boundaries of the frequency bands.
  - 6. The method of claim 1, 2 or 3 wherein said threshold, T.sub.ξ
    - (P,ω
      
      ), is updated according to the equation
      space="preserve" listing-type="equation">T.sub.ξ
      
      (P,ω
      
      )=T(P,ω
      
      )·
      
      M(ξ
      
      .sub.0,ξ
      
      .sub.avg,ξ
      
      .sub.min,.xi..sub.max)
      wherein ξ
      
      ₀ is the energy measure of the current segment, ξ
      
      _avg is an average local energy calculated according to the recurrence equation
      space="preserve" listing-type="equation">ξ
      
      .sub.avg =(1-γ
      
      .sub.0)ξ
      
      .sub.avg +γ
      
      .sub.0 ·
      
      ξ
      
      .sub.0
      ξ
      
      _max is a maximum local energy calculated according to the recurrence equation ##EQU20## ξ
      
      _min is a minimum local energy calculated according to the recurrence equation ##EQU21## M(ξ
      
      ₀, ξ
      
      _avg, ξ
      
      _min, ξ
      
      _max) is calculated by the equation ##EQU22## P is pitch, and λ
      
      ₀, λ
      
      ₁, λ
      
      ₂, μ
      
      , ξ
      
      _silence γ
      
      ₀, γ
      
      ₁, γ
      
      ₂, γ
      
      ₃, γ
      
      ₄, are constants.

2. A method for encoding an acoustic signal, the method comprising the steps of:
- A. breaking the signal into segments, each of the segments representing one of a succession of time intervals;
  
  B. breaking each of said segments into a plurality of frequency bands; and
  
  C. considering in turn each of the segments as the current segment, and for each of a plurality of said frequency bands of the current segment making a voiced/unvoiced decision by a method comprising the steps of;
  
  evaluating a voicing measure for said frequency band;
  
  making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold;
  
  determining an energy measure of the current segment;
  
  determining a measure of the signal energy of one or more recent prior segments;
  
  comparing the energy measure of the current segment to the measure of the signal energy of the one or more recent prior segments; and
  
  adjusting the threshold to make an unvoiced decision more likely when the energy measure of the current segment is less than the measure of the signal energy of the one or more recent prior segments.
- View Dependent Claims (3)
- - 3. The method of claim 2 comprising the further step ofadjusting the threshold to make a voiced decision more likely when the energy measure of the current segment is greater than the measure of the signal energy of the one or more recent prior segments.

7. A method for encoding an acoustic signal, the method comprising the steps of:
- A. breaking the signal into segments, each of the segments representing one of a succession of time intervals;
  
  B. considering in turn each of the segments as the current segment, and making a voiced/unvoiced decision for at least a frequency band of the current segment by a method comprising the steps of;
  
  evaluating a voicing measure for said frequency band;
  
  making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold;
  
  determining an energy measure of the current segment;
  
  determining a measure of the signal energy of one or more consecutive preceding segments;
  
  comparing the energy measure of the current segment to the measure of the signal energy of the consecutive preceding segments;
  
  adjusting the threshold to make a voiced decision more likely when the energy measure of the current segment is greater than the measure of the signal energy of the consecutive preceding segments.
- View Dependent Claims (10)
- - 10. The method of any of claims 7, 8, or 9 wherein said consecutive preceding segments are those segments immediately preceding the current segment.

8. A method for encoding an acoustic signal, the method comprising the steps of:
- A. breaking the signal into segments, each of the segments representing one of a succession of time intervals;
  
  B. considering in turn each of the segments as the current segment, and making a voiced/unvoiced decision for at least a frequency band of the current segment by a method comprising the steps of;
  
  evaluating a voicing measure for said frequency band;
  
  making the voiced/unvoiced decision for said frequency band based upon a comparison between the voicing measure and a threshold;
  
  determining an energy measure of the current segment;
  
  determining a measure of the signal energy of one or more consecutive preceding segments;
  
  comparing the energy measure of the current segment to the measure of the signal energy of the consecutive preceding segments;
  
  adjusting the threshold to make a voiced decision less likely when the energy measure of the current segment is less than the measure of the signal energy of the consecutive preceding segments.
- View Dependent Claims (9)
- - 9. The method of claim 8 comprising the futher step of:
    - adjusting the threshold to make a voiced decision more likely when the energy measure of the current segment is greater than the measure of the signal energy of the consecutive preceding segments.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Digital Voice Systems, Inc.
Original Assignee
Digital Voice Systems, Inc.
Inventors
Hardwick, John C., Lim, Jae S.
Primary Examiner(s)
Fleming, Michael R.
Assistant Examiner(s)
Doerrler, Michelle

Application Number

US07/795,803
Time in Patent Office

558 Days
Field of Search

381/29-51, 395/2
US Class Current

704/208
CPC Class Codes

G10L 19/087   using mixed excitation mode...

G10L 25/90   Pitch determination of spee...

G10L 25/93   Discriminating between voic...

Voiced/unvoiced estimation of an acoustic signal

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

Citations

10 Claims

Specification

Solutions

Use Cases

Quick Links

Voiced/unvoiced estimation of an acoustic signal

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

10 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links