Speech processing apparatus having a speech synthesis unit that performs speech synthesis while selectively changing recorded-speech-playback and text-to-speech and method
First Claim
1. A speech processing apparatus which generates guidance speech corresponding to user operation using a speech synthesis unit configured to perform speech synthesis while selectively changing recorded-speech-playback and text-to-speech, the apparatus comprising:
- a guidance holding unit that holds(i) a first guidance including fixed portions indicating fixed messages, and a first variable portion, and a second variable portion, wherein the first variable portion and the second variable portion are located between the fixed portions and indicate that a first message and a second message corresponding to user operation are inserted,(ii) a second guidance that has the second variable portion located at the end of a fixed portion and is synonymous with the first guidance, and(iii) a third guidance which has the first variable portion located at the end of a fixed portion and is synonymous with the first guidance;
an entry holding unit that holds a set of entries in which spellings, pronunciations of the spellings, and pieces of speech based on the pronunciations which are associated with user operation are configured to be registered; and
an acquisition unit that acquires an entry corresponding to operation performed by a user from said entry holding unit,wherein, based on a number of times of changing between the recorded-speech-playback and the text-to-speech when performing speech synthesis, said speech synthesizer unit applies;
the first guidance if the number of times of changing when performing speech synthesis using the first guidance is less than a predetermined number,the second guidance if the number of times of changing when performing speech synthesis using the first guidance is not less than the predetermined number, and if the number of times of changing when performing speech synthesis using the second guidance is less than the predetermined number, andthe third guidance if the number of times of changing when performing speech synthesis using the first guidance is not less than the predetermined number, and if the number of times of changing when performing speech synthesis using the second guidance is not less than the predetermined number, and if the number of times of changing when performing speech synthesis using the third guidance is less than the predetermined number.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech processing apparatus which can playback a sentence using recorded-speech-playback or text-to-speech is provided. It is determined whether each of a plurality of words or phrases constituting a sentence is a word or phrase to be played back by recorded-speech-playback or a word or phrase to be played back by text-to-speech. When each of the plurality of words or phrases is to be played back in a first sequence using the determined synthesis method, it is selected whether to playback each of the plurality of words or phrases in the first sequence or a sequence different from the first sequence, based on the number of times of reversing playback using recorded-speech-playback and playback using text-to-speech. Each of the plurality of words or phrases is played back in the selected sequence using the selected synthesis method.
-
Citations
4 Claims
-
1. A speech processing apparatus which generates guidance speech corresponding to user operation using a speech synthesis unit configured to perform speech synthesis while selectively changing recorded-speech-playback and text-to-speech, the apparatus comprising:
-
a guidance holding unit that holds (i) a first guidance including fixed portions indicating fixed messages, and a first variable portion, and a second variable portion, wherein the first variable portion and the second variable portion are located between the fixed portions and indicate that a first message and a second message corresponding to user operation are inserted, (ii) a second guidance that has the second variable portion located at the end of a fixed portion and is synonymous with the first guidance, and (iii) a third guidance which has the first variable portion located at the end of a fixed portion and is synonymous with the first guidance; an entry holding unit that holds a set of entries in which spellings, pronunciations of the spellings, and pieces of speech based on the pronunciations which are associated with user operation are configured to be registered; and an acquisition unit that acquires an entry corresponding to operation performed by a user from said entry holding unit, wherein, based on a number of times of changing between the recorded-speech-playback and the text-to-speech when performing speech synthesis, said speech synthesizer unit applies; the first guidance if the number of times of changing when performing speech synthesis using the first guidance is less than a predetermined number, the second guidance if the number of times of changing when performing speech synthesis using the first guidance is not less than the predetermined number, and if the number of times of changing when performing speech synthesis using the second guidance is less than the predetermined number, and the third guidance if the number of times of changing when performing speech synthesis using the first guidance is not less than the predetermined number, and if the number of times of changing when performing speech synthesis using the second guidance is not less than the predetermined number, and if the number of times of changing when performing speech synthesis using the third guidance is less than the predetermined number. - View Dependent Claims (2)
-
-
3. A speech processing method of generating guidance speech corresponding to user operation by controlling a speech processing apparatus having
a guidance holding unit that holds (i) a first guidance including fixed portions indicating fixed messages, and a first variable portion, and a second variable portion, wherein the first variable portion and the second variable portion are located between the fixed portions and indicate that a first message and a second message corresponding to user operation are inserted, (ii) a second guidance that has the second variable portion located at the end of a fixed portion and is synonymous with the first guidance, and (iii) a third guidance which has the first variable portion located at the end of a fixed portion and is synonymous with the first guidance; -
an entry holding unit that holds a set of entries in which spellings, pronunciations of the spellings, and pieces of speech based on the pronunciations which are associated with user operation are configured to be registered, and a speech synthesis unit that performs speech synthesis while selectively changing recorded-speech-playback and text-to-speech, the method comprising the steps of; acquiring an entry corresponding to an operation performed by a user from the entry holding unit; and applying, based on a number of times of changing between the recorded-speech-playback and the text-to-speech when performing speech synthesis; the first guidance if the number of times of changing when performing speech synthesis using the first guidance is less than a predetermined number, the second guidance if the number of times of changing when performing speech synthesis using the first guidance is not less than the predetermined number, and if the number of times of changing when performing speech synthesis using the second guidance is less than the predetermined number, and the third guidance if the number of times of changing when performing speech synthesis using the first guidance is not less than the predetermined number, and if the number of times of changing when performing speech synthesis using the second guidance is not less than the predetermined number, and if the number of times of changing when performing speech synthesis using the third guidance is less than the predetermined number. - View Dependent Claims (4)
-
Specification