Apparatus and method for creating pitch wave signals and apparatus and method compressing, expanding and synthesizing speech signals using these pitch wave signals
First Claim
1. A speech synthesizing apparatus, the apparatus comprising:
- storage means for storing rhythm information representing the rhythm of a sample of unit speech sound, pitch information representing the pitch of the sample, and spectrum information showing variation with time in the fundamental frequency component and harmonic wave component of a pitch wave signal created by making substantially identical the time lengths of sections each equivalent to the unit pitch of a speech signal. representing the wave of the sample with such information brought into correspondence with the sample;
prediction means for inputting text information representing a text, and creating prediction information representing the result of predicting the pitch and spectrum of a unit speech sound constituting the text based on the text information;
retrieval means for identifying a sample having a pitch and spectrum having the highest correlation with the pitch and spectrum of the unit speech sound constituting the text based on the pitch information, spectrum information and prediction information; and
signal synthesizing means for creating a synthesized speech signal representing a speech sound in which the speech sound has a rhythm represented by the rhythm information brought into correspondence with the sample identified by the retrieval means, the variation with time in the fundamental frequency component and harmonic wave component is represented by the spectrum information brought into correspondence with the sample identified by the retrieval means, and the time length of the section equivalent to the unit pitch is a time length represented by the pitch information brought into correspondence with the sample identified by the retrieval means.
4 Assignments
0 Petitions
Accused Products
Abstract
A pitch wave signal creation method as a preliminary process for efficiently coding a speech wave signal having a fluctuated pitch period is provided. A speech signal compressing/expanding apparatus and a speech signal synthesizing apparatus using the method, and a signal processing associated therewith are further provided. The pitch wave creation method of the invention is essentially comprised of a method of detecting the instantaneous pitch period of each pitch wave element of the speech wave signal, and a process of converting a corresponding pitch wave element into a normalized pitch wave element having a predetermined fixed time length by expanding and compressing the pitch wave element on a time axis while retaining its wave pattern based on the each detected instantaneous pitch period. The speech signal having a pitch fluctuation can be compressed in high quality and high efficiency by coding or synthesizing the speech wave signal using the pitch wave signal creation method of the invention.
-
Citations
3 Claims
-
1. A speech synthesizing apparatus, the apparatus comprising:
-
storage means for storing rhythm information representing the rhythm of a sample of unit speech sound, pitch information representing the pitch of the sample, and spectrum information showing variation with time in the fundamental frequency component and harmonic wave component of a pitch wave signal created by making substantially identical the time lengths of sections each equivalent to the unit pitch of a speech signal. representing the wave of the sample with such information brought into correspondence with the sample;
prediction means for inputting text information representing a text, and creating prediction information representing the result of predicting the pitch and spectrum of a unit speech sound constituting the text based on the text information;
retrieval means for identifying a sample having a pitch and spectrum having the highest correlation with the pitch and spectrum of the unit speech sound constituting the text based on the pitch information, spectrum information and prediction information; and
signal synthesizing means for creating a synthesized speech signal representing a speech sound in which the speech sound has a rhythm represented by the rhythm information brought into correspondence with the sample identified by the retrieval means, the variation with time in the fundamental frequency component and harmonic wave component is represented by the spectrum information brought into correspondence with the sample identified by the retrieval means, and the time length of the section equivalent to the unit pitch is a time length represented by the pitch information brought into correspondence with the sample identified by the retrieval means. - View Dependent Claims (2)
-
-
3. A speech synthesis method, wherein rhythm information representing the rhythm of a sample of unit speech sound, pitch information representing the pitch of the sample, and spectrum information showing variation with time in the fundamental frequency component and harmonic wave component of a pitch wave signal created by making substantially identical the time lengths of sections each equivalent to the unit pitch of a speech signal representing the wave of the sample are stored with such information brought into correspondence with the sample;
-
text information representing a text is inputted, and prediction information representing the result of predicting the pitch and spectrum of a unit speech sound constituting the text is created based on the text information;
a sample having a pitch and spectrum having the highest correlation with the pitch and spectrum of the unit speech sound constituting the text is identified based on the pitch information, spectrum information and prediction information; and
a synthesized speech signal representing a speech sound in which the speech sound has a rhythm represented by the rhythm information brought into correspondence with the identified sample, the variation with time in the fundamental frequency component and harmonic wave component is represented by the spectrum information brought into correspondence with the sample identified by the retrieval means, and the time length of the section equivalent to the unit pitch is a time length represented by the pitch information brought into correspondence with the sample identified by the retrieval means is created.
-
Specification