Pitch detection of speech signals
First Claim
1. A system for determining a pitch of speech from a speech signal, the system including:
- (1) an input device to receive the speech and generate the speech signal; and
(2) a processor structured to;
(a) distinguish the speech signal into voiced, unvoiced or silenced sections using speech signal energy levels;
(b) apply a Fourier Transform to the voiced speech signal section and obtain speech signal parameters;
(c) determine peaks of the Fourier transformed voiced speech signal section;
(d) track the speech signal parameters of the determined peaks to select partials; and
(e) determine the pitch from the selected partials using a two-way mismatch error calculation.
2 Assignments
0 Petitions
Accused Products
Abstract
Pitch detection of speech signals finds numerous applications in karaoke, voice recognition and scoring applications. While most of the existing techniques rely on time domain methods, the invention utilizes frequency domain methods. There is provided a method and system for determining the pitch of speech from a speech signal. The method includes the steps of: producing or obtaining the speech signal; distinguishing the speech signal into voiced, unvoiced or silence sections using speech signal energy levels; applying a Fourier Transform to the speech signal and obtaining speech signal parameters; determining peaks of the Fourier transformed speech signal; tracking the speech signal parameters of the determined peaks to select partials; and determining the pitch from the selected partials using a two-way mismatch error calculation.
-
Citations
41 Claims
-
1. A system for determining a pitch of speech from a speech signal, the system including:
-
(1) an input device to receive the speech and generate the speech signal; and
(2) a processor structured to;
(a) distinguish the speech signal into voiced, unvoiced or silenced sections using speech signal energy levels;
(b) apply a Fourier Transform to the voiced speech signal section and obtain speech signal parameters;
(c) determine peaks of the Fourier transformed voiced speech signal section;
(d) track the speech signal parameters of the determined peaks to select partials; and
(e) determine the pitch from the selected partials using a two-way mismatch error calculation. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of determining a pitch of speech from a speech signal, the method including the steps of:
-
producing or obtaining the speech signal;
distinguishing the speech signal into voiced, unvoiced or silenced sections using speech signal energy levels;
applying a Fourier Transform to the voiced speech signal section and obtaining speech signal parameters;
determining peaks of the Fourier transformed voiced speech signal section;
tracking the speech signal parameters of the determined peaks to select partials; and
determining the pitch from the selected partials using a two-way mismatch error calculation. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A system for determining a pitch of speech from a speech signal, the system comprising:
-
(1) an input device to receive the speech and generate the speech signal; and
(2) a processor structured to;
(a) distinguish the speech signal into voiced, unvoiced or silenced speech signal sections using speech signal energy levels;
(b) apply a windowing procedure to the voiced speech signal section to generate a frame;
(c) apply a Fourier Transform to the frame and obtain speech signal parameters;
(d) determine peaks of the Fourier transformed frame;
(e) track the speech signal parameters of the determined peaks to select partials; and
(f) determine the pitch from the selected partials using a two-way mismatch error calculation. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
-
39. A system for estimating a pitch of speech from a speech signal, the system including:
-
(1) an input device to receive the speech and produce the speech signal;
(2) a memory unit or storage unit adapted to communicate required data to a processing unit; and
(3) the processing unit operating on the speech signal and structured to;
(a) section the speech signal into voiced, unvoiced or silenced sections using speech signal energy levels;
(b) apply a Fast Fourier Transform to the voiced speech signal section and generate speech signal parameters;
(c) determine peaks of the Fourier transformed voiced speech signal section;
(d) track the speech signal parameters of the determined peaks to select partials; and
(e) calculate the pitch from the selected partials using a two-way mismatch error calculation. - View Dependent Claims (40)
-
-
41. A system for determining a pitch of speech from a speech signal, comprising:
-
means for producing or obtaining the speech signal;
means for distinguishing the speech signal into voiced, unvoiced or silenced speech signal sections using speech signal energy levels;
means for applying a Fourier Transform to the voiced speech signal section and obtaining speech signal parameters;
means for determining peaks of the Fourier transformed voiced speech signal section;
means for tracking the speech signal parameters of the determined peaks to select partials; and
means for determining the pitch from the selected partials using a two-way mismatch error calculation.
-
Specification