Multi-stage pitch and mixed voicing estimation for harmonic speech coders

US 6,438,517 B1
Filed: 04/27/2000
Issued: 08/20/2002
Est. Priority Date: 05/19/1998
Status: Expired due to Term

First Claim

Patent Images

1. A method of modeling the voiced or unvoiced characteristics of a segment of an input signal, comprising the steps of:

receiving a pitch value associated with said input speech signal;

comparing a synthesized speech signal to said input speech signal on a harmonic by harmonic basis;

for each harmonic, determining whether said harmonic is voiced or unvoiced;

counting the number of said harmonics that are voiced;

calculating a cut-off frequency of said input speech signal, using the ratio of the results of said counting step and the total number of said harmonics, such that said cut-off frequency represents a frequency below which said speech signal is assumed to be voiced and above which said speech signal is comprised of both voiced and unvoiced speech; and

generating a synthesized representation of said speech signal using said pitch value such that for each harmonic that falls below the cut-off frequency the harmonics are assumed to be voiced and for each harmonic above the cut-off frequency the harmonics are assumed to be mixed using both voiced and unvoiced energies for each harmonic.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A “multi-stage” method of estimating pitch in a speech encoder (FIG. 2). In a first stage of the method, a set of candidate pitch values is selected, such as by using a cost function that operates on said speech signal (steps 21-23). In a second stage of the method, a best candidate is selected. Specifically, in the second stage, pitch values calculated from previous speech segments are used to calculate an average pitch value (step 25). Then, depending on whether the average pitch value is short or long, one of two different analysis-by-synthesis (ABS) processes is then repeated for each candidate, such that for each iteration, a synthesized signal is derived from that pitch candidate and compared to a reference signal to provide an error value. A time domain ABS process is used if the average pitch is short (step 27), whereas a frequency domain ABS process is used if the average pitch is long (step 28). After the ABS process provides an error for each pitch candidate, the pitch candidate having the smallest error is deemed to be the best candidate.

15 Citations

View as Search Results

4 Claims

1. A method of modeling the voiced or unvoiced characteristics of a segment of an input signal, comprising the steps of:
- receiving a pitch value associated with said input speech signal;
  
  comparing a synthesized speech signal to said input speech signal on a harmonic by harmonic basis;
  
  for each harmonic, determining whether said harmonic is voiced or unvoiced;
  
  counting the number of said harmonics that are voiced;
  
  calculating a cut-off frequency of said input speech signal, using the ratio of the results of said counting step and the total number of said harmonics, such that said cut-off frequency represents a frequency below which said speech signal is assumed to be voiced and above which said speech signal is comprised of both voiced and unvoiced speech; and
  
  generating a synthesized representation of said speech signal using said pitch value such that for each harmonic that falls below the cut-off frequency the harmonics are assumed to be voiced and for each harmonic above the cut-off frequency the harmonics are assumed to be mixed using both voiced and unvoiced energies for each harmonic.
- View Dependent Claims (2, 3, 4)
- - 2. The method of claim 1, wherein said step of generating a synthesized representation is performed by sampling said input speech at harmonics of said pitch.
  - 3. The method of claim 1, wherein said step of determining whether said harmonic is voiced or unvoiced is performed by comparing an error value provided by said comparing step to a threshold associated with said harmonic.
  - 4. The method of claim 1, wherein said step of calculating a cut-off frequency is performed by multiplying said ratio times an encoding frequency range.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Texas Instruments, Inc.
Original Assignee
Texas Instruments, Inc.
Inventors
Yeldener, Suat
Primary Examiner(s)
{haeck over (S)}mits, Ta̅livaldis Ivars
Assistant Examiner(s)
ARMSTRONG, ANGELA A

Application Number

US09/559,040
Time in Patent Office

845 Days
Field of Search

704/206, 704/207, 704/208, 704/220
US Class Current

704/208
CPC Class Codes

G10L 19/02 using spectral analysis, e....

G10L 19/10 the excitation function bei...

Multi-stage pitch and mixed voicing estimation for harmonic speech coders

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

15 Citations

4 Claims

Specification

Solutions

Use Cases

Quick Links

Multi-stage pitch and mixed voicing estimation for harmonic speech coders

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

15 Citations

4 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links