System for analyzing human speech

US 4,791,671 A
Filed: 01/15/1985
Issued: 12/13/1988
Est. Priority Date: 02/22/1984
Status: Expired due to Fees

First Claim

Patent Images

1. A method of analyzing human speech for determining the pitch of speech segments while using more than one pitch detection algorithm, characterized by comprising the steps of:

(a) determining an amplitude spectrum of a speech segment in a first elementary pitch meter, and determining significant peak positions in said spectrum,(b) determining an autocorrelation function and significant peak positions therein in a second elementary pitch meter,(c) utilizing said significant peak positions of the amplitude spectrum and the autocorrelation function, respectively, as input data for selecting a value for the pitch and period, respectively, and determining a sequence of consecutive integral multiples of said value, and the determination of intervals around said value and the multiples thereof, these intervals defining apertures of a mask, said apertures corresponding to harmonic multiplication factors,(d) computing a quality figure for each pitch and period, respectively, in accordance with a criterion indicating the degree to which the significant peak positions and mark apertures match,(e) repeating steps (c) and (d) for consecutive higher values of the pitch and period, respectively, up to a predetermined highest value, to provide a sequence of quality figures associated with these pitch and period values, respectively,(f) selecting a predetermined number of values of said pitch and period, respectively, having the highest quality figures,(g) converting the values for the respective periods into values for pitch, and(h) combining the predetermined numbers of selected values for pitch, and for pitch converted from period, with their associated quality figures to form an estimation of the most likely pitch.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The pitch of human speech segments is analyzed using at least two different pitch detection algorithms, a respective plurality of most likely values of pitch is selected by each of those algorithms, and these values and their respective quality figures are analyzed statistically to determine the most likely pitch. One algorithm operates in the frequency domain, by analyzing an amplitude spectrum, and the other algorithm operates in the time domain using an autocorrelation function. Significant peak positions of the amplitude spectrum or autocorrelation function are evaluated in respective harmonic sieves, to provide respective quality figures indicating the degree to which peak frequency or period periods of the spectrum or autocorrelation function output match the apertures of the harmonic sieve. A predetermined number of values of pitch, and of period, are selected having the highest quality figures. After conversion of the values for period into values of pitch, these values with their associated quality figures are analyzed statistically to form an estimation of the most likely pitch.

38 Citations

View as Search Results

8 Claims

1. A method of analyzing human speech for determining the pitch of speech segments while using more than one pitch detection algorithm, characterized by comprising the steps of:
- (a) determining an amplitude spectrum of a speech segment in a first elementary pitch meter, and determining significant peak positions in said spectrum,(b) determining an autocorrelation function and significant peak positions therein in a second elementary pitch meter,(c) utilizing said significant peak positions of the amplitude spectrum and the autocorrelation function, respectively, as input data for selecting a value for the pitch and period, respectively, and determining a sequence of consecutive integral multiples of said value, and the determination of intervals around said value and the multiples thereof, these intervals defining apertures of a mask, said apertures corresponding to harmonic multiplication factors,(d) computing a quality figure for each pitch and period, respectively, in accordance with a criterion indicating the degree to which the significant peak positions and mark apertures match,(e) repeating steps (c) and (d) for consecutive higher values of the pitch and period, respectively, up to a predetermined highest value, to provide a sequence of quality figures associated with these pitch and period values, respectively,(f) selecting a predetermined number of values of said pitch and period, respectively, having the highest quality figures,(g) converting the values for the respective periods into values for pitch, and(h) combining the predetermined numbers of selected values for pitch, and for pitch converted from period, with their associated quality figures to form an estimation of the most likely pitch.

2. An apparatus for analyzing human speech to determine a pitch of speech segments, using more than one pitch detection algorithm, comprisinga first elementary pitch meter, operating in the frequency domain, for determining a first plurality of significant peak frequencies in a speech segment,means for computing a quality figure for each of said significant peak frequencies,a second elementary pitch meter, operating in the time domain, for determining significant peak periods of said segment,means for computing a quality figure for each of said significant peak periods,means for determining period-derived frequencies corresponding to said significant peak periods,means for selecting a predetermined number of values of said significant peak frequencies and said significant peak period-derived frequencies, respectively having the highest quality figures, andcombining said selected values of frequency and period-derived frequency with the associated quality figures to form an estimate of the most likely pitch.
- View Dependent Claims (3, 4, 5, 6, 7, 8)
- - 3. An apparatus as claimed in claim 2, characterized in that said first and second means for computing a quality figure each comprise a respective harmonic sieve for selecting a value for pitch, based respectively on frequency or period, and determining a sequence of consecutive integral multiples of this value and intervals around this value and the multiples thereof, these intervals defining apertures of a mask, said apertures corresponding to harmonic multiplication factors;
    - and computing a quality figure in accordance with a criterion indicating the degree to which significant peak positions and the mask apertures match; and
      
      repeating said selecting and computing steps for consecutively higher values of pitch up to a predetermined higher value.
  - 4. An apparatus as claimed in claim 3, characterized in that said first elementary pitch meter comprises windowing means for determining an amplitude spectrum of said speech segment, means for computing a Fourier transform of said amplitude spectrum, and means for determining the significant peak positions in said amplitude spectrum.
  - 5. An apparatus as claimed in claim 4, characterized in that said second elementary time meter comprises means for determining significant peak positions in an autocorrelation function of said segment, said significant peak positions corresponding to said significant peak periods.
  - 6. An apparatus as claimed in claim 5, characterized in that said means for combining selects one of said frequencies as the most likely pitch.
  - 7. An apparatus as claimed in claim 3, characterized in that said means for combining selects one of said frequencies as the most likely pitch.
  - 8. An apparatus as claimed in claim 2, characterized in that said means for combining selects one of said frequencies as the most likely pitch.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
US Philips Corporation (Koninklijke Philips N.V.)
Original Assignee
US Philips Corporation (Koninklijke Philips N.V.)
Inventors
Willems, Leonardus F.
Primary Examiner(s)
Kemeny, Emanuel S.

Application Number

US06/691,594
Time in Patent Office

1,428 Days
Field of Search

381/29-31, 381/38, 381/41, 381/47, 381/49, 364/513.5
US Class Current

704/217
CPC Class Codes

G10L 25/90 Pitch determination of spee...

System for analyzing human speech

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

38 Citations

8 Claims

Specification

Solutions

Use Cases

Quick Links

System for analyzing human speech

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

38 Citations

8 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links