Look-ahead pitch determination
First Claim
1. A method of pitch determination for a speech signal, said speech signal having a plurality of frames, each of said plurality of frames having a first subframe and a second subframe, said plurality of frames including a present frame, a previous frame, and a subsequent frame, wherein said present frame is between said previous frame and said subsequent frame, wherein a first subframe of said present frame is a look-ahead subframe of said previous frame, and wherein a first subframe of said subsequent frame is a look-ahead subframe of said present frame, said method comprising the steps of:
- calculating a look-ahead pitch of said look-ahead subframe of said present frame;
storing said look-ahead pitch of said look-ahead subframe of said present frame to be retrieved for calculating a pitch of a second subframe of said subsequent frame;
retrieving a look-ahead pitch of said look-ahead subframe of said previous frame; and
using said look-ahead pitch of said look-ahead subframe of said previous frame and said look-ahead pitch of said look-ahead subframe of said present frame to determine a pitch of said second subframe of said present frame;
wherein said steps of calculating, storing, retrieving and using are repeated for each of said plurality of frames.
12 Assignments
0 Petitions
Accused Products
Abstract
An encoding system is presented for coding and processing an input signal on a frame-by-frame basis. The encoding system processes each frame in two subframes of a first half and a second half. In determining the pitch of a given frame, the encoding system determines the pitch of the first half of the subsequent in a look-ahead fashion, and uses the look-ahead pitch information to estimate and correct the pitch of the second half subframe of the given frame. The encoding system also determines the pitch of the first half subframe of the given frame to further estimate and correct the pitch of the second half subframe of the given frame. The look-ahead pitch may also be used as the pitch of the first half subframe of the subsequent frame. The encoding system further calculates a normalized correlation using the pitch of the look-ahead subframe and may use the normalized correlation to correct and estimate the pitch of the second half subframe of the first frame.
39 Citations
10 Claims
-
1. A method of pitch determination for a speech signal, said speech signal having a plurality of frames, each of said plurality of frames having a first subframe and a second subframe, said plurality of frames including a present frame, a previous frame, and a subsequent frame, wherein said present frame is between said previous frame and said subsequent frame, wherein a first subframe of said present frame is a look-ahead subframe of said previous frame, and wherein a first subframe of said subsequent frame is a look-ahead subframe of said present frame, said method comprising the steps of:
-
calculating a look-ahead pitch of said look-ahead subframe of said present frame;
storing said look-ahead pitch of said look-ahead subframe of said present frame to be retrieved for calculating a pitch of a second subframe of said subsequent frame;
retrieving a look-ahead pitch of said look-ahead subframe of said previous frame; and
using said look-ahead pitch of said look-ahead subframe of said previous frame and said look-ahead pitch of said look-ahead subframe of said present frame to determine a pitch of said second subframe of said present frame;
wherein said steps of calculating, storing, retrieving and using are repeated for each of said plurality of frames. - View Dependent Claims (2, 3, 4, 5)
calculating a normalized pitch correlation of said look-ahead subframe of said present frame; and
storing said normalized pitch correlation to be retrieved for calculating said pitch of said second subframe of said subsequent frame.
-
-
3. The method of claim 2 further comprising the steps of:
-
retrieving a normalized pitch correlation of said look-ahead subframe of said previous frame; and
using said normalized pitch correlation of said look-ahead subframe of said previous frame and said normalized pitch correlation of said look-ahead subframe of said present frame to determine said pitch of said second subframe of said present frame.
-
-
4. The method of claim 1, wherein each of said plurality of subframes is about 10 milliseconds.
-
5. The method of claim 1, wherein said using determines said pitch of said second subframe of said present frame based on an overall pitch contour.
-
6. A speech coding system for encoding a speech signal, said speech signal having a plurality of frames, each of said plurality of frames having a first subframe and a second subframe, said plurality of frames including a present frame, a previous frame, and a subsequent frame, wherein said present frame is between said previous frame and said subsequent frame, wherein a first subframe of said present frame is a look-ahead subframe of said previous frame, and wherein a first subframe of said subsequent frame is a look-ahead subframe of said present frame, said system comprising:
-
a pitch estimator configured to calculate a look-ahead pitch of said look-ahead subframe of said present frame; and
a memory configured to store said look-ahead pitch of said look-ahead subframe of said present frame to be retrieved for calculating a pitch of a second subframe of said subsequent frame, said memory retaining a look-ahead pitch of said look-ahead subframe of said previous frame;
wherein said pitch estimator uses said look-ahead pitch of said look-ahead subframe of said previous frame and said look-ahead pitch of said look-ahead subframe of said present frame to determine a pitch of said second subframe of said present frame;
wherein said pitch estimator determines a pitch of said second subframe of each of said plurality of frames in the same manner as determining said pitch of said second subframe of said present frame. - View Dependent Claims (7, 8, 9, 10)
-
Specification