Voiced speech preprocessing employing waveform interpolation or a harmonic model
First Claim
1. A speech codec comprisinga failure detection circuit configured to initiate a frequency transformation of a speech signal using a harmonic model circuit when said failure detection circuit detects at least one of a long term pre-processing circuit failure, a long term processing circuit failure, and an irregular voice speech portion of the speech signal;
- a classifier configured to process parameters that identify a transition region between at least two portions of the speech signal, one of the at least two portions of the speech signal being a voiced portion; and
a periodic smoothing circuit configured to smooth the transition region represented by at least one of a weighted representation of the speech signal, a residual signal, and the speech signal using at least one of an interpolated pitch lag and a constant pitch lag, the interpolated pitch lag being derived from a pitch track corresponding to the voiced portion of the speech signal, wherein the periodic smoothing circuit is configured to use at least one of a forward pitch extension and a backward pitch extension.
12 Assignments
0 Petitions
Accused Products
Abstract
Voiced speech preprocessing employs waveform interpolation or a harmonic model circuit to smooth a transition region and simplify speech coding. At low bit rates, the speech is coded by a system that maintains a high perceptual quality in the transition region from a voiced (quasi-periodic) portion of the speech signal to an unvoiced (non-periodic) portion of the speech signal. Similarly, the transition region from an unvoiced portion to a voiced portion is conditioned to maintain a high perceptual quality at a low bandwidth. The transition region from one type of voiced region to another type of voiced region is also smoothed. The transition region is smoothed to create a quasi-periodic speech signal.
31 Citations
29 Claims
-
1. A speech codec comprising
a failure detection circuit configured to initiate a frequency transformation of a speech signal using a harmonic model circuit when said failure detection circuit detects at least one of a long term pre-processing circuit failure, a long term processing circuit failure, and an irregular voice speech portion of the speech signal; -
a classifier configured to process parameters that identify a transition region between at least two portions of the speech signal, one of the at least two portions of the speech signal being a voiced portion; and
a periodic smoothing circuit configured to smooth the transition region represented by at least one of a weighted representation of the speech signal, a residual signal, and the speech signal using at least one of an interpolated pitch lag and a constant pitch lag, the interpolated pitch lag being derived from a pitch track corresponding to the voiced portion of the speech signal, wherein the periodic smoothing circuit is configured to use at least one of a forward pitch extension and a backward pitch extension. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A speech coding system comprising:
-
a failure detection circuit configured to initiate a frequency transformation of a speech signal using a harmonic model circuit when said failure detection circuit detects at least one of a long term pre-processing circuit failure, a long term processing circuit failure, and an irregular voice speech portion of the speech signal;
a classifier that is configured to detect a transition region between at least two portions of the speech signal, at least one portion of the speech signal being a periodic portion; and
a periodic smoothing circuit that is configured to smooth the transition region using at least one of a forward pitch extension and a backward pitch extension, with either being derived from a pitch track corresponding to the periodic portion of the speech signal. - View Dependent Claims (7, 8, 9)
-
-
10. A method of smoothing a transition region comprising:
-
initiating a frequency transformation of a speech signal using a harmonic model circuit when at least one of a long term pre-processing circuit failure, a long term processing circuit failure, and an irregular voice speech portion of the speech signal is detected;
detecting a transition region between a periodic portion and a second portion of the speech signal; and
smoothing the transition region using at least one of a forward pitch extension and a backward pitch extension, with either being derived from a pitch track corresponding to the periodic portion of the speech signal. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A speech codec comprising
a failure detection circuit configured to initiate a waveform interpolation of a speech signal in the time domain when said failure detection circuit detects at least one of a long term pre-processing circuit failure, a long term processing circuit failure, and an irregular voice speech portion of the speech signal; -
a classifier configured to process parameters that identify a transition region between at least two portions of the speech signal, one of the at least two portions of the speech signal being a voiced portion; and
a periodic smoothing circuit configured to smooth the transition region represented by at least one of a weighted representation of the speech signal, a residual signal, and the speech signal using at least one of an interpolated pitch lag and a constant pitch lag, the interpolated pitch lag being derived from a pitch track corresponding to the voiced portion of the speech signal, wherein the periodic smoothing circuit is configured to use at least one of a forward pitch extension and a backward pitch extension. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A speech coding system comprising:
-
a failure detection circuit configured to initiate a waveform interpolation of a speech signal in the time domain when said failure detection circuit detects at least one of a long term pre-processing circuit failure, a long term processing circuit failure, and an irregular voice speech portion of the speech signal;
a classifier that is configured to detect a transition region between at least two portions of the speech signal, at least one portion of the speech signal being a periodic portion; and
a periodic smoothing circuit that is configured to smooth the transition region using at least one of a forward pitch extension and a backward pitch extension, with either being derived from a pitch track corresponding to the periodic portion of the speech signal. - View Dependent Claims (21, 22, 23, 24)
-
-
25. A method of smoothing a transition region comprising:
-
initiating a waveform interpolation of a speech signal in the time domain when at least one of a long term pre-processing circuit failure, a long term processing circuit failure, and an irregular voice speech portion of the speech signal is detected;
detecting a transition region between a periodic portion and a second portion of the speech signal; and
smoothing the transition region using at least one of a forward pitch extension and a backward pitch extension, with either being derived from a pitch track corresponding to the periodic portion of the speech signal. - View Dependent Claims (26, 27, 28, 29)
-
Specification