Multi-stage pitch and mixed voicing estimation for harmonic speech coders

US 6,456,965 B1
Filed: 05/19/1998
Issued: 09/24/2002
Est. Priority Date: 05/20/1997
Status: Expired due to Term

First Claim

Patent Images

1. A method of estimating the pitch of a segment of a speech signal, comprising the steps of:

selecting a set of initial pitch candidates by dividing the pitch range into sub-ranges, applying a pitch cost function to input samples, and selecting a pitch candidate for each said sub-range for which the pitch cost function is maximized, determining an input pitch period using at least one previously calculated pitch value from prior segments of said speech signal;

determining whether said determined pitch period from prior segments is short or long; and

for each pitch candidate, if said average pitch period is short having just a few harmonics such that it is easier to match time domain waveforms, using a time domain pitch estimation process to evaluate each said pitch candidate, or if said average pitch period is long being more than a few harmonics and not easier to match time domain waveforms, using a frequency domain pitch estimation process to evaluate each said pitch candidate.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A “multi-stage” method of estimating pitch in a speech encoder (FIG. 2). In a first stage of the method, a set of candidate pitch values is selected, such as by using a cost function that operates on said speech signal (steps 21-23). In a second stage of the method, a best candidate is selected. Specifically, in the second stage, pitch values calculated from previous speech segments are used to calculate an average pitch value (step 25). Then, depending on whether the average pitch value is short or long, one of two different analysis-by-synthesis (ABS) processes is then repeated for each candidate, such that for each iteration, a synthesized signal is derived from that pitch candidate and compared to a reference signal to provide an error value. A time domain ABS process is used if the average pitch is short (step 27), whereas a frequency domain ABS process is used if the average pitch is long (step 28). After the ABS process provides an error for each pitch candidate, the pitch candidate having the smallest error is deemed to be the best candidate.

Citations

8 Claims

1. A method of estimating the pitch of a segment of a speech signal, comprising the steps of:
- selecting a set of initial pitch candidates by dividing the pitch range into sub-ranges, applying a pitch cost function to input samples, and selecting a pitch candidate for each said sub-range for which the pitch cost function is maximized, determining an input pitch period using at least one previously calculated pitch value from prior segments of said speech signal;
  
  determining whether said determined pitch period from prior segments is short or long; and
  
  for each pitch candidate, if said average pitch period is short having just a few harmonics such that it is easier to match time domain waveforms, using a time domain pitch estimation process to evaluate each said pitch candidate, or if said average pitch period is long being more than a few harmonics and not easier to match time domain waveforms, using a frequency domain pitch estimation process to evaluate each said pitch candidate.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1, wherein said selecting step is performed using a frequency domain cost function.
  - 3. The method of claim 1, wherein said selecting step is performed using a time domain cost function.
  - 4. The method of claim 1, wherein said sub-ranges are determined logarithmically with smaller sub-ranges for shorter pitch periods and longer sub-ranges for longer pitch periods.
  - 5. The method of claim 1, wherein said time domain pitch estimation process is an analysis by synthesis process.
  - 6. The method of claim 1, wherein said frequency domain pitch estimation process is an analysis by synthesis process.
  - 7. The method of claim 1, wherein said time domain pitch estimation process and said frequency domain pitch estimation process provide an error value for each said pitch candidate and further comprising the step of determining which one of said pitch candidates has a minimum error value.
  - 8. The method of claim 1, wherein said step of determining an input pitch period is performed by calculating an average pitch period from a number of said prior segments.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Texas Instruments, Inc.
Original Assignee
Texas Instruments, Inc.
Inventors
Yeldener, Suat
Primary Examiner(s)
Dorvil, Richemond
Assistant Examiner(s)
ARMSTRONG, ANGELA A

Application Number

US09/081,410
Time in Patent Office

1,589 Days
Field of Search

704/207, 704/208, 704/214, 704/223, 704/220
US Class Current

704/207
CPC Class Codes

G10L 25/90 Pitch determination of spee...

Multi-stage pitch and mixed voicing estimation for harmonic speech coders

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

Multi-stage pitch and mixed voicing estimation for harmonic speech coders

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links