Voice conversion method and system
First Claim
Patent Images
1. A voice conversion method comprising:
- performing speech analysis on speech of a source speaker to attain speech information comprising a first spectrum;
converting the first spectrum to a second spectrum, wherein converting the first spectrum to the second spectrum comprises compensating for at least one spectral difference between the speech of the source speaker and speech of a target speaker;
in response to converting the first spectrum to the second spectrum, generating a third spectrum, wherein generating the third spectrum comprises selecting, based on at least the second spectrum, at least one speech unit from a corpus comprising a plurality of speech units of the target speaker;
generating a replaced spectrum by replacing at least part of the second spectrum with at least part of the third spectrum; and
performing speech reconstruction based at least on the replaced spectrum.
2 Assignments
0 Petitions
Accused Products
Abstract
A method, system and computer program product for voice conversion. The method includes performing speech analysis on the speech of a source speaker to achieve speech information; performing spectral conversion based on said speech information, to at least achieve a first spectrum similar to the speech of a target speaker; performing unit selection on the speech of said target speaker at least using said first spectrum as a target; replacing at least part of said first spectrum with the spectrum of the selected target speaker'"'"'s speech unit; and performing speech reconstruction at least based on the replaced spectrum.
11 Citations
31 Claims
-
1. A voice conversion method comprising:
-
performing speech analysis on speech of a source speaker to attain speech information comprising a first spectrum; converting the first spectrum to a second spectrum, wherein converting the first spectrum to the second spectrum comprises compensating for at least one spectral difference between the speech of the source speaker and speech of a target speaker; in response to converting the first spectrum to the second spectrum, generating a third spectrum, wherein generating the third spectrum comprises selecting, based on at least the second spectrum, at least one speech unit from a corpus comprising a plurality of speech units of the target speaker; generating a replaced spectrum by replacing at least part of the second spectrum with at least part of the third spectrum; and performing speech reconstruction based at least on the replaced spectrum. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A voice conversion system comprising:
-
speech analysis means for performing speech analysis on speech of a source speaker to attain speech information comprising a first spectrum; spectral conversion means for converting the first spectrum to a second spectrum, wherein converting the first spectrum to the second spectrum comprises compensating for at least one spectral difference between the speech of the source speaker and speech of a target speaker; unit selection means for, in response to the converting of the first spectrum to the second spectrum, generating a third spectrum, wherein generating the third spectrum comprises selecting, based on at least the second spectrum, at least one speech unit from a corpus comprising a plurality of speech units of the target speaker; spectrum replacement means for generating a replaced spectrum by replacing at least part of said second spectrum with at least part of the third spectrum; and speech reconstruction means for performing speech reconstruction based at least on the replaced spectrum. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer readable storage device comprising computer readable instructions which, when executed by at least one processor, cause performance of a voice conversion method comprising:
-
performing speech analysis on speech of a source speaker to attain speech information comprising a first spectrum; converting the first spectrum to a second spectrum, wherein converting the first spectrum to the second spectrum comprises compensating for at least one spectral difference between the speech of the source speaker and speech of a target speaker; in response to converting the first spectrum to the second spectrum, generating a third spectrum, wherein generating the third spectrum comprises selecting, based on at least the second spectrum, at least one speech unit from a corpus comprising a plurality of speech units of the target speaker; generating a replaced spectrum by replacing at least part of the second spectrum with at least part of the third spectrum; and performing speech reconstruction based at least on the replaced spectrum. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
-
24. A voice conversion system comprising:
-
a speech analyzer configured to perform speech analysis on speech of a source speaker to attain speech information comprising a first spectrum; a spectral converter configured to convert the first spectrum to a second spectrum, wherein converting the first spectrum to the second spectrum comprises compensating for at least one spectral difference between the speech of the source speaker and speech of a target speaker; a unit selector configured to, in response to conversion of the first spectrum to the second spectrum, generate a third spectrum, wherein generating the third spectrum comprises selecting, based on at least the second spectrum, at least one speech unit from a corpus comprising a plurality of speech units of the target speaker; a spectrum generator configured to generate a replaced spectrum by replacing at least part of said second spectrum with at least part of the third spectrum; and a speech reconstructor configured to perform speech reconstruction based at least on the replaced spectrum. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31)
-
Specification