Speech synthesis device and method
First Claim
1. A speech synthesis method comprising:
- receiving a voice signal of an utterance;
detecting a voiced section of the voice signal;
detecting a pitch of a trailing end portion of the voiced section;
acquiring voice data of a response to the utterance;
acquiring a representative pitch based on the voice data of the response;
determining one shift amount for shifting the representative pitch to a target pitch having a particular relationship to the detected pitch of the trailing end portion; and
synthesizing voice of the response based on the voice data of the response, while shifting pitch of the voice data in accordance with the one shift amount.
0 Assignments
0 Petitions
Accused Products
Abstract
This invention is an improvement of technology for automatically generating response voice to voice uttered by a speaker (user), and is characterized by controlling a pitch of the response voice in accordance with a pitch of the speaker'"'"'s utterance. A voice signal of the speaker'"'"'s utterance (e.g., question) is received, and a pitch (e.g., highest pitch) of a representative portion of the utterance is detected. Voice data of a responsive to the utterance is acquired, and a pitch (e.g., average pitch) based on the acquired response voice data is acquired. A pitch shift amount for shifting the acquired pitch to a target pitch having a particular relationship to the pitch of the representative portion is determined. When response voice is to be synthesized on the basis of the response voice data, the pitch of the response voice to be synthesized is shifted in accordance with the pitch shift amount.
11 Citations
6 Claims
-
1. A speech synthesis method comprising:
-
receiving a voice signal of an utterance; detecting a voiced section of the voice signal; detecting a pitch of a trailing end portion of the voiced section; acquiring voice data of a response to the utterance; acquiring a representative pitch based on the voice data of the response; determining one shift amount for shifting the representative pitch to a target pitch having a particular relationship to the detected pitch of the trailing end portion; and synthesizing voice of the response based on the voice data of the response, while shifting pitch of the voice data in accordance with the one shift amount. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A speech synthesis device comprising:
-
a receiver circuit that receives a voice signal of an utterance; and a processor configured to; detect a voiced section of the voice signal; detect a pitch of a trailing end portion of the voiced section; acquire voice data of a response to the utterance; acquire a representative pitch based on the voice data of the response; determine one shift amount for shifting the representative pitch to a target pitch having a particular relationship to the detected pitch of the trailing end portion; and synthesize voice of the response based on the voice data of the response, while shifting pitch of the voice data in accordance with the one shift amount.
-
Specification