Voice synthesizing system, segment generation apparatus for generating segments for voice synthesis, voice synthesizing method and storage medium storing program therefor
First Claim
1. A voice synthesizing system synthesizing a predetermined voice waveform by overlaying a plurality of voice waveform segments in a waveform concatenation method, comprising:
- a compressed pitch segment database storing respective voice waveform segments compressed per pitch unit;
a pitch developing portion reading out compressed data of the voice waveform segment from said compressed pitch segment database and decompressing the read out compressed data for reproducing an original voice waveform segment when the voice waveform segment necessary for voice waveform synthesis is demanded;
a cache processing portion temporarily storing the voice waveform segment already used in voice waveform synthesis, and when voice waveform segment necessary for voice waveform synthesis is demanded, returning demanded voice waveform segment to a demander when demanded voice waveform segment is already stored, and obtaining the voice waveform segment from said compressed pitch segment database via said pitch developing portion to hold the obtained voice waveform segment and conjunction therewith to return to the demander when demanded voice waveform segment is not stored.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice synthesizing system can make necessary calculation amount satisfactorily small and can make necessary file size small. The system includes a compressed pitch segment database storing compressed voice waveform segments, a pitch developing portion reading out the voice waveform segment from the database and decompressing the compressed data for reproducing an original voice waveform segment when the voice waveform segment necessary for voice waveform synthesis is demanded, and a cache processing portion temporarily storing the voice waveform segment already used in voice waveform synthesis, and when voice waveform segment necessary for voice waveform synthesis is demanded, returning demanded voice waveform segment to a demander when demanded voice waveform segment is already stored, and obtaining the voice waveform segment from the database via the pitch developing portion to hold the obtained voice waveform segment and return to the demander when demanded voice waveform segment is not stored.
-
Citations
37 Claims
-
1. A voice synthesizing system synthesizing a predetermined voice waveform by overlaying a plurality of voice waveform segments in a waveform concatenation method, comprising:
-
a compressed pitch segment database storing respective voice waveform segments compressed per pitch unit; a pitch developing portion reading out compressed data of the voice waveform segment from said compressed pitch segment database and decompressing the read out compressed data for reproducing an original voice waveform segment when the voice waveform segment necessary for voice waveform synthesis is demanded; a cache processing portion temporarily storing the voice waveform segment already used in voice waveform synthesis, and when voice waveform segment necessary for voice waveform synthesis is demanded, returning demanded voice waveform segment to a demander when demanded voice waveform segment is already stored, and obtaining the voice waveform segment from said compressed pitch segment database via said pitch developing portion to hold the obtained voice waveform segment and conjunction therewith to return to the demander when demanded voice waveform segment is not stored. - View Dependent Claims (2, 3, 4)
-
-
5. A voice waveform segment generating apparatus for voice synthesis extracting a plurality of voice waveform segments from a voice waveform of an original human speech and generating information for selecting voice waveform segment necessary for voice synthesis among extracted voice waveform segments, comprising:
-
a sequential representative pitch segment determining portion selecting a range where voice waveform segments are regarded as the same voice waveform segment in a sequential zone and selecting representative voice waveform segment among voice waveform segments in said range; a pitch segment registering portion storing said representative waveform segment and the voice waveform segments out of said range in a database in compressed form; and a continuity table generating portion calculating number of sequential voice waveform segments in said range and amplitude multiplying factor per voice waveform segment with respect to said voice waveform segment and storing in a storage device in a form of table. - View Dependent Claims (6, 7, 8, 9)
-
-
10. A voice waveform segment generating apparatus for voice synthesis extracting a plurality of voice waveform segments from a voice waveform of an original human speech and generating information for selecting voice waveform segment necessary for voice synthesis among extracted voice waveform segments, comprising:
-
a representative pitch segment determining portion selecting a set of voice waveform segments which can be regarded as the same voice waveform and selecting representative voice waveform segment among voice waveform segments in said set; a pitch segment registering portion storing said representative waveform segment and the voice waveform segments out of said set in a database in compressed form; and a pitch index table generating portion calculating amplitude multiplying factor per each voice waveform segment in said set with respect to said representative voice waveform segments and number of samples for shifting the voice waveform segment in time direction, and storing in a storage device in a form of table. - View Dependent Claims (11)
-
-
12. A voice waveform segment generating apparatus for voice synthesis extracting a plurality of voice waveform segments from a voice waveform of an original human speech and generating information for selecting voice waveform segment necessary for voice synthesis among extracted voice waveform segments, comprising:
-
a sequential representative pitch segment determining portion selecting a range where voice waveform segments are regarded as the same voice waveform segment in a sequential zone and selecting representative voice waveform segment among voice waveform segments in said range; a representative pitch segment determining portion selecting a set of voice waveform segments which can be regarded as the same voice waveform with respect to the result of selection by said sequential representative pitch segment determining portion and selecting representative voice waveform segment among voice waveform segments in said set; a pitch segment registering portion storing said representative waveform segment and the voice waveform segments out of said set in a database in compressed form; a continuity table generating portion calculating number of voice waveform segments in said range and amplitude multiplying factor per voice waveform segment with respect to said voice waveform segment and storing in a storage device in a form of table; and a pitch index table generating portion calculating amplitude multiplying factor per each voice waveform segment in said set with respect to said representative voice waveform segments and number of samples for shifting the voice waveform segment in time direction, and storing in a storage device in a form of table. - View Dependent Claims (13)
-
-
14. A voice synthesizing method for synthesizing a desired voice waveform by overlaying a plurality of voice waveform segments in waveform concatenation method, comprising the steps of:
-
preliminarily storing compressed voice waveform segments in a database; returning the voice waveform segment to a demander when the voice waveform segment necessary for voice waveform synthesis is demanded and if the demanded voice waveform segment is already stored in a cache memory; reading out the compressed data of the voice waveform segment from said database storing the compressed data of the voice waveform segments and reproducing an original voice waveform segment by decompressing the read out compressed data if the demanded voice waveform segment is not stored in a cache memory; and storing the reproduced voice waveform segment in said cache memory and returning to said demander. - View Dependent Claims (15, 16)
-
-
17. A voice synthesizing segment generating method extracting a plurality of voice waveform segments from an originally spoken human speech and generating information for selecting the voice waveform segment necessary for voice synthesis from the extracted voice waveform segment, comprising the steps of:
-
selecting range, in which the voice waveform segments re regarded as the same within a sequential zone among all of voice waveform segments consisting the original speech, and selecting a representative voice waveform segment from the voice waveform segment within said range; storing said representative voice waveform segments and said voice waveform segment other than said range in a database in compressed form; and calculating number of sequential voice waveform segments within said range and amplitude multiplying factor per each waveform segment with respect to said representative voice waveform segment and storing in a storage device in a form of table. - View Dependent Claims (18)
-
-
19. A voice synthesizing segment generating method extracting a plurality of voice waveform segments from an originally spoken human speech and generating information for selecting the voice waveform segment necessary for voice synthesis from the extracted voice waveform segment, comprising the steps of:
-
selecting set of the voice waveform segments regarded as the same among all of voice waveform segments consisting the original speech, and selecting a representative voice waveform segment from the voice waveform segment within said set; storing said representative voice waveform segments and said voice waveform segment other than said set in a database in compressed form; and calculating amplitude multiplying factor per each waveform segment with respect to said representative voice waveform segment and number of samples for shifting the voice wave form in a time direction, in said set and storing in a storage device in a form of table. - View Dependent Claims (20, 21, 22, 23)
-
-
24. A voice synthesizing segment generating method extracting a plurality of voice waveform segments from an originally spoken human speech and generating information for selecting the voice waveform segment necessary for voice synthesis from the extracted voice waveform segment, comprising the steps of:
-
selecting range, in which the voice waveform segments are regarded as the same within a sequential zone among all of voice waveform segments consisting the original speech, and selecting a representative voice waveform segment from the voice waveform segment within said range; with respect to the result of selection, selecting set of the voice waveform segments regarded as the same voice waveform segment, and selecting a representative voice waveform segment from the voice waveform segment within said set; storing said representative voice waveform segments in said set and said voice waveform segment other than said set in a database in compressed form; calculating number of sequential voice waveform segments within said range and amplitude multiplying factor per each waveform segment with respect to said representative voice waveform segment and storing in a storage device in a form of table; and calculating amplitude multiplying factor per each waveform segment in said set with respect to said representative voice waveform segment and number of samples for shifting the voice wave form in a time direction, in said set and storing in a storage device in a form of table. - View Dependent Claims (25)
-
-
26. A storage medium recording a program for synthesizing a desired voice waveform by overlaying a plurality of voice waveform segments in waveform concatenation method, said program comprising the steps of:
-
preliminarily storing compressed voice waveform segments in a database; returning the voice waveform segment to a demander when the voice waveform segment necessary for voice waveform synthesis is demanded and if the demanded voice waveform segment is already stored in a cache memory; reading out the compressed data of the voice waveform segment from said database storing the compressed data of the voice waveform segments and reproducing an original voice waveform segment by decompressing the read out compressed data if the demanded voice waveform segment is not stored in a cache memory; and storing the reproduced voice waveform segment in said cache memory and returning to said demander. - View Dependent Claims (27, 28)
-
-
29. A storage medium recording a program extracting a plurality of voice waveform segments from an originally spoken human speech and generating information for selecting the voice waveform segment necessary for voice synthesis from the extracted voice waveform segment, said program comprising the steps of:
-
selecting range, in which the voice waveform segments are regarded as the same within a sequential zone among all of voice waveform segments consisting the original speech, and selecting a representative voice waveform segment from the voice waveform segment within said range; storing said representative voice waveform segments and said voice waveform segment other than said range in a database in compressed form; and calculating number of sequential voice waveform segments within said range and amplitude multiplying factor per each waveform segment with respect to said representative voice waveform segment and storing in a storage device in a form of table. - View Dependent Claims (30, 31)
-
-
32. A storage medium recording a program extracting a plurality of voice waveform segments from an originally spoken human speech and generating information for selecting the voice waveform segment necessary for voice synthesis from the extracted voice waveform segment, said program comprising the steps of:
-
selecting set of the voice waveform segments regarded as the same among all of voice waveform segments consisting the original speech, and selecting a representative voice waveform segment from the voice waveform segment within said set; storing said representative voice waveform segments and said voice waveform segment other than said set in a database in compressed form; and calculating amplitude multiplying factor per each waveform segment with respect to said representative voice waveform segment and number of samples for shifting the voice wave form in a time direction, in said set and storing in a storage device in a form of table. - View Dependent Claims (33, 34, 35)
-
-
36. A storage medium recording a program extracting a plurality of voice waveform segments from an originally spoken human speech and generating information for selecting the voice waveform segment necessary for voice synthesis from the extracted voice waveform segment, said program comprising the steps of:
-
selecting range, in which the voice waveform segments are regarded as the same within a sequential zone among all of voice waveform segments consisting the original speech, and selecting a representative voice waveform segment from the voice waveform segment within said range; with respect to the result of selection, selecting set of the voice waveform segments regarded as the same voice waveform segment, and selecting a representative voice waveform segment from the voice waveform segment within said set; storing said representative voice waveform segments and said voice waveform segment other than said set in a database in compressed form; calculating number of the voice waveform segments within said range and amplitude multiplying factor per each waveform segment with respect to said representative voice waveform segment and storing in a storage device in a form of table; and calculating amplitude multiplying factor per each waveform segment within said set with respect to said representative voice waveform segment and number of samples for shifting the voice wave form in a time direction and storing in a storage device in a form of table. - View Dependent Claims (37)
-
Specification