SPEECH SYNTHESIZER, AUDIO WATERMARKING INFORMATION DETECTION APPARATUS, SPEECH SYNTHESIZING METHOD, AUDIO WATERMARKING INFORMATION DETECTION METHOD, AND COMPUTER PROGRAM PRODUCT
First Claim
Patent Images
1. A speech synthesizer comprising:
- a source generator configured to generate a source signal by using a fundamental frequency sequence and a pulse signal;
a phase modulator configured to modulate, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information; and
a vocal tract filter unit configured to generate a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
1 Assignment
0 Petitions
Accused Products
Abstract
According to an embodiment, a speech synthesizer includes a source generator, a phase modulator, and a vocal tract filter unit. The source generator generates a source signal by using a fundamental frequency sequence and a pulse signal. The phase modulator modulates, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information. The vocal tract filter unit generates a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator.
-
Citations
15 Claims
-
1. A speech synthesizer comprising:
-
a source generator configured to generate a source signal by using a fundamental frequency sequence and a pulse signal; a phase modulator configured to modulate, with respect to the source signal generated by the source generator, a phase of the pulse signal at each pitch mark based on audio watermarking information; and a vocal tract filter unit configured to generate a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated by the phase modulator. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An audio watermarking information detection apparatus comprising:
-
a pitch mark estimator configured to estimate a pitch mark of a synthesized speech in which audio watermarking information is embedded and to extract a speech at each estimated pitch mark; a phase extractor configured to extract a phase of the speech extracted by the pitch mark estimator; a representative phase calculator configured to calculate a representative phase to be a representative of a plurality of frequency bins from the phase extracted by the phase extractor; and a determination unit configured to determine, based on the representative phase, whether there is the audio watermarking information. - View Dependent Claims (10, 11)
-
-
12. A speech synthesizing method comprising:
-
generating a source signal by using a fundamental frequency sequence and a pulse signal; modulating, with respect to the generated source signal, a phase of the pulse signal at each pitch mark based on audio watermarking information; and generating a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated.
-
-
13. An audio watermarking information detection method comprising:
-
estimating a pitch mark of a synthesized speech in which audio watermarking information is embedded and extracting a speech at each estimated pitch mark; extracting a phase of the extracted speech; calculating, from the extracted phase, a representative phase to be a representative of a plurality of frequency bins; and determining, based on the representative phase, whether there is the audio watermarking information.
-
-
14. A speech synthesizing program to cause a computer to execute:
-
generating a source signal by using a fundamental frequency sequence and a pulse signal; modulating, with respect to the generated source signal, a phase of the pulse signal at each pitch mark based on audio watermarking information; and generating a speech signal by using a spectrum parameter sequence with respect to the source signal in which the phase of the pulse signal is modulated.
-
-
15. An audio watermarking information detection program to cause a computer to execute:
-
estimating a pitch mark of a synthesized speech in which audio watermarking information is embedded and extracting a speech at each estimated pitch mark, extracting a phase of the extracted speech, calculating, from the extracted phase, a representative phase to be a representative of a plurality of frequency bins, and determining, based on the representative phase, whether there is the audio watermarking information.
-
Specification