Voice signal conversation method and system
First Claim
1. A method of converting a voice signal as spoken by a source speaker into a converted voice signal the acoustic characteristics thereof resemble those of a target speaker, the method comprising:
- a determination step of determining a function for transforming acoustic characteristics of the source speaker into acoustic characteristics close to those of the target speaker on the basis of samples of the voices of the source and target speakers, anda transformation step of transforming acoustic characteristics of the source speaker voice signal to be converted by applying said transformation function,wherein said determination step comprises a step of determining a function for conjoint transformation of characteristics of the source speaker relating to the spectral envelope and of characteristics of the source speaker relating to the pitch and wherein said transformation step comprises applying said conjoint transformation function,wherein said step of determining a conjoint transformation function comprises,a step of analyzing source and target speaker voice samples grouped into frames to obtain for each frame information relating to the spectral envelope and to the pitch,a step of concatenating information relating to the spectral envelope and information relating to the pitch for each of the source and target speakers,a step of determining a model representing common acoustic characteristics of source speaker and target speaker voice samples, anda step of determining said conjoint transformation function from said model and the voice samples, andwherein said steps of analyzing the source and target speaker voice samples are adapted to produce said information relating to the spectral envelope in the form of cepstral coefficients.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of converting a voice signal spoken by a source speaker into a converted voice signal having acoustic characteristics that resemble those of a target speaker. The method includes the following steps: determining (1) at least one function for the transformation of the acoustic characteristics of the source speaker into acoustic characteristics similar to those of the target speaker; and transforming the acoustic characteristics of the voice signal to be converted using the at least one transformation function. The method is characterized in that: (i) the aforementioned transformation function-determining step (1) consists in determining (1) a function for the joint transformation of characteristics relating to the spectral envelope and characteristics relating to the fundamental frequency of the source speaker; and (ii) the transformation includes the application of the joint transformation function.
38 Citations
16 Claims
-
1. A method of converting a voice signal as spoken by a source speaker into a converted voice signal the acoustic characteristics thereof resemble those of a target speaker, the method comprising:
-
a determination step of determining a function for transforming acoustic characteristics of the source speaker into acoustic characteristics close to those of the target speaker on the basis of samples of the voices of the source and target speakers, and a transformation step of transforming acoustic characteristics of the source speaker voice signal to be converted by applying said transformation function, wherein said determination step comprises a step of determining a function for conjoint transformation of characteristics of the source speaker relating to the spectral envelope and of characteristics of the source speaker relating to the pitch and wherein said transformation step comprises applying said conjoint transformation function, wherein said step of determining a conjoint transformation function comprises, a step of analyzing source and target speaker voice samples grouped into frames to obtain for each frame information relating to the spectral envelope and to the pitch, a step of concatenating information relating to the spectral envelope and information relating to the pitch for each of the source and target speakers, a step of determining a model representing common acoustic characteristics of source speaker and target speaker voice samples, and a step of determining said conjoint transformation function from said model and the voice samples, and wherein said steps of analyzing the source and target speaker voice samples are adapted to produce said information relating to the spectral envelope in the form of cepstral coefficients. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for converting a voice signal as spoken by a source speaker into a converted voice signal the acoustic characteristics thereof resemble ones of a target speaker, the system comprising:
-
means for determining at least one function for transforming acoustic characteristics of the source speaker into acoustic characteristics similar to ones of the target speaker on the basis of voice samples as spoken by the source and target speakers; means for transforming acoustic characteristics of the source speaker voice signal to be converted by applying said transformation function, wherein said means for determining at least one transformation function comprise a unit for determining a function for conjoint transformation of characteristics of the source speaker relating to the spectral envelope and of characteristics of the source speaker relating to the pitch and wherein said transformation means include for applying said conjoint transformation function; means for analyzing the voice signal to be converted, adapted to produce information relating to the spectral envelope in the form of cepstral coefficients and relating to the pitch of the voice signal to be converted; and synthesizer means for forming a converted voice signal from at least said spectral envelope and pitch information transformed simultaneously. - View Dependent Claims (16)
-
Specification