Speech signal modification and concatenation method by gradually changing speech parameters
First Claim
1. A speech signal modification and concatenation method for concatenating two spoken speech signals having different speaker individuality, each spoken speech signal consisting of a plurality of phonemes and communicating a predetermined message including a plurality of words, said method comprising the step of:
- concatenating the speech signals by modifying a parameter indicating a characteristic of the speech signals in a manner such that the parameter is gradually changed from a value indicating a feature of one of the speech signals to a value indicating a feature of the other speech signal over a predetermined period, the concatenated signal having a first section corresponding to the one of the speech signals, a second section corresponding to said predetermined period, and a third section corresponding to the other speech signal, wherein a listener listening to a spoken message including a plurality of words hears said first, second, and third sections in turn.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech signal modification and concatenation method is provided, in which spoken messages having different voice characteristics can be concatenated without causing a sense of incompatibility, and it is possible to efficiently perform addition or modification of spoken messages. In the speech signal modification and concatenation method, when two speech signals having different voice characteristics are concatenated, the speech signals are concatenated by modifying a parameter indicating a character of speech signals in a manner such that the parameter is gradually changed from a value indicating a feature of one of the speech signals to a value indicating a feature of the other speech signal over a predetermined period. Accordingly, a time-scaled change of a feature amount of spoken sounds can be performed; thus, even if two speech signals of different speakers are concatenated, it is possible to avoid an abrupt change of voice characteristics in the concatenation section, and thus possible to concatenate speech signals without causing a sense of incompatibility to listeners.
46 Citations
12 Claims
-
1. A speech signal modification and concatenation method for concatenating two spoken speech signals having different speaker individuality, each spoken speech signal consisting of a plurality of phonemes and communicating a predetermined message including a plurality of words, said method comprising the step of:
-
concatenating the speech signals by modifying a parameter indicating a characteristic of the speech signals in a manner such that the parameter is gradually changed from a value indicating a feature of one of the speech signals to a value indicating a feature of the other speech signal over a predetermined period, the concatenated signal having a first section corresponding to the one of the speech signals, a second section corresponding to said predetermined period, and a third section corresponding to the other speech signal, wherein a listener listening to a spoken message including a plurality of words hears said first, second, and third sections in turn. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
in a phoneme which corresponds to the two speech signals, determining each pitch correspondence between the two signals;
generating a spectrum, for every corresponding pitch, by combining, with respect to a boundary frequency, a portion above the boundary frequency among the spectrum of one speech signal and a portion below the boundary frequency among the spectrum of the other speech signal, and determining the generated spectrum as a spectrum at the relevant pitch; and
with respect to the generation of spectra, changing the boundary frequency for each unit time.
-
-
7. A speech signal modification and concatenation method as claimed in claim 6, wherein the change of the boundary frequency is performed such that the boundary frequency increases by a fixed amount for each unit time.
-
8. A speech signal modification and concatenation method as claimed in claim 6, wherein the change of the boundary frequency is performed such that:
-
the boundary frequency gradually increases from a value at the start of change to a value at the end of change; and
the rate of change is lower in a stage of relatively low boundary frequencies near the start of change, while the rate of change is higher in a stage of relatively high boundary frequencies near the end of change.
-
-
9. A speech signal modification and concatenation method as claimed in claim 1, wherein the parameter is a fundamental frequency of spoken sounds, and the fundamental frequency is gradually changed in the predetermined period.
-
10. A speech signal modification and concatenation method as claimed in claim 9, wherein the change of the fundamental frequency comprises the steps of:
-
calculating an average fundamental frequency of each speech signal;
determining a frequency value to be changed per unit time for the fundamental frequency, based on the difference between the two average fundamental frequencies and the predetermined period for the change of the parameter; and
with the determined value as a unit of the amount of change, changing the fundamental frequency for each unit time such that the fundamental frequency is modified from the average fundamental frequency of one speech signal to that of the other speech signal.
-
-
11. A speech signal modification and concatenation method as claimed in claim 1, wherein each of a spectrum of spoken sounds and a fundamental frequency of spoken sounds is used as the parameter, and:
-
the change of the spectrum comprises the steps of;
in a phoneme which corresponds to the two speech signals, determining each pitch correspondence between the two signals;
generating a spectrum, for every corresponding pitch, by combining, with respect to a boundary frequency, a portion above the boundary frequency among the spectrum of one speech signal and a portion below the boundary frequency among the spectrum of the other speech signal, and determining the generated spectrum as a spectrum at the relevant pitch; and
with respect to the generation of spectra, changing the boundary frequency for each unit time, and the change of the fundamental frequency comprises the steps of;
calculating an average fundamental frequency of each speech signal;
determining a frequency value to be changed per unit time for the fundamental frequency, based on the difference between the two average fundamental frequencies and the predetermined period for the change of the parameter; and
with the determined value as a unit of the amount of change, changing the fundamental frequency for each unit time such that the fundamental frequency is modified from the average fundamental frequency of one speech signal to that of the other speech signal.
-
-
12. A speech signal modification and concatenation method as claimed in claim 11, wherein the spectrum of spoken sounds and the fundamental frequency of spoken sounds are changed in parallel.
Specification