Voice synthesizing apparatus using database having different pitches for each phoneme represented by same phoneme symbol
First Claim
Patent Images
1. A voice synthesizing apparatus comprising:
- a timbre storing device that stores voice feature parameters of a plurality of phoneme, each parameter having a plurality of different pitches for each phoneme represented by a same phoneme symbol and being indexed by a phoneme name and a pitch;
a phoneme template storing device that stores a plurality of templates each having a sequence of feature parameters disposed at a predetermined time interval and being indexed by a phoneme name and a pitch, the templates including a stationary template derived from voices having stable phonemes and an articulation template derived from voices in a concatenated part of the phonemes;
a note template storing device that stores a plurality of templates each having a sequence of feature parameters disposed at a predetermined time interval and being indexed by a phoneme name and a pitch, the templates including at least a note attack template having feature parameters in a voice rising part and a note-to-note template having feature parameters in a pitch changing part;
a reading device that reads the feature parameter from the timbre storing device and the templates from the phoneme template storing device and the note template storing device by using information regarding the phoneme and a pitch of a voice to be synthesized changing over time as indices; and
a voice synthesizer that synthesizes a voice in accordance with the read feature parameter added with the templates read from the phoneme template storing device and the note template storing device.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice synthesizing apparatus comprises: a memory that stores phoneme pieces having a plurality of different pitches for each phoneme represented by a same phoneme symbol; a reading device that reads a phoneme piece by using a pitch as an index; and a voice synthesizer that synthesizes a voice in accordance with the read phoneme piece.
-
Citations
10 Claims
-
1. A voice synthesizing apparatus comprising:
-
a timbre storing device that stores voice feature parameters of a plurality of phoneme, each parameter having a plurality of different pitches for each phoneme represented by a same phoneme symbol and being indexed by a phoneme name and a pitch; a phoneme template storing device that stores a plurality of templates each having a sequence of feature parameters disposed at a predetermined time interval and being indexed by a phoneme name and a pitch, the templates including a stationary template derived from voices having stable phonemes and an articulation template derived from voices in a concatenated part of the phonemes; a note template storing device that stores a plurality of templates each having a sequence of feature parameters disposed at a predetermined time interval and being indexed by a phoneme name and a pitch, the templates including at least a note attack template having feature parameters in a voice rising part and a note-to-note template having feature parameters in a pitch changing part; a reading device that reads the feature parameter from the timbre storing device and the templates from the phoneme template storing device and the note template storing device by using information regarding the phoneme and a pitch of a voice to be synthesized changing over time as indices; and a voice synthesizer that synthesizes a voice in accordance with the read feature parameter added with the templates read from the phoneme template storing device and the note template storing device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A voice synthesizing method comprising:
-
reading a feature parameter, by using as indices information regarding a phoneme and a pitch of a voice to be synthesized, from a timbre storing means which stores voice feature parameters of a plurality of phoneme, each parameter having a plurality of different pitches for each phoneme represented by a same phoneme symbol, and the feature parameter being indexed by a phoneme name and a pitch; reading a template, by using as indices information regarding a phoneme and a pitch of a voice to be synthesized changing over time, from a phoneme template storing means which stores a plurality of templates, each template having a sequence of feature parameters disposed at a predetermined time interval and being indexed by a phoneme name and a pitch, the templates including a stationary template derived from voices having stable phonemes and an articulation template derived from voices in a concatenated part of the phonemes; reading a template, by using as indices information regarding a phoneme and a pitch of a voice to be synthesized, from a note template storing means which stores a plurality of templates, each template having a sequence of feature parameters disposed at a predetermined time interval and being indexed by a phoneme name and a pitch, the templates including at least a note attack template having feature parameters in a voice rising part and a note-to-note template having feature parameters in a pitch changing part; and synthesizing a voice in accordance with the read feature parameter added with the templates read from the phoneme template storing means and the note template storing means.
-
-
10. A computer-readable storage medium having encoded thereon, program code including instructions which when executed cause:
-
reading a feature parameter, by using as indices information regarding a phoneme and a pitch of a voice to be synthesized, from a timbre storing means which stores voice feature parameters of a plurality of phoneme, each parameter having a plurality of different pitches for each phoneme represented by a same phoneme symbol, and the feature parameter being indexed by a phoneme name and a pitch; reading a template, by using as indices information regarding a phoneme and a pitch of a voice to be synthesized changing over time, from a phoneme template storing means which stores a plurality of templates, each template having a sequence of feature parameters disposed at a predetermined time interval and being indexed by a phoneme name and a pitch, the templates including a stationary template derived from voices having stable phonemes and an articulation template derived from voices in a concatenated part of the phonemes; reading a template, by using as indices information regarding a phoneme and a pitch of a voice to be synthesized, from a note template storing means which stores a plurality of templates, each template having a sequence of feature parameters disposed at a predetermined time interval and being indexed by a phoneme name and a pitch, the templates including at least a note attack template having feature parameters in a voice rising part and a note-to-note template having feature parameters in a pitch changing part; and synthesizing a voice in accordance with the read feature parameter added with the templates read from the phoneme template storing means and the note template storing means.
-
Specification