Voice prompts for use in speech-to-speech translation system
First Claim
1. A method for use in indicating a dialogue turn in an automated speech-to-speech translation system, comprising the steps of:
- translating speech input between a plurality of speakers having a multilingual conversation using an automated speech-to-speech translation system; and
providing an indication to each speaker of the plurality of speakers of when it is a turn of each speaker to commence speaking in a dialog interaction between the plurality of speakers and provide speech input to the automated speech-to-speech translation system, wherein providing an indication comprises;
obtaining one or more previously-generated text-based scripts, the one or more text-based scripts being synthesizable into one or more voice prompts in different languages of the plurality of speakers, wherein the voice prompts are audible messages that notify a given speaker when it is a turn of the given speaker for inputting speech to the automated speech-to-speech translation system;
synthesizing for playback at least one of the one or more voice prompts from at least one of the one or more text-based scripts, the at least one synthesized voice prompt comprising an audible message in a language understandable to the given speaker to notify the given speaker when it is a turn of the given speaker for inputting speech to the automated speech-to-speech translation system; and
playing the at least one synthesized voice prompt to provide the audible message to the given speaker to notify the given speaker that it is the given speaker'"'"'s turn for inputting speech to the automated speech-to-speech translation system.
0 Assignments
0 Petitions
Accused Products
Abstract
Techniques for employing improved prompts in a speech-to-speech translation system are disclosed. By way of example, a technique for use in indicating a dialogue turn in an automated speech-to-speech translation system comprises the following steps/operations. One or more text-based scripts are obtained. The one or more text-based scripts are synthesizable into one or more voice prompts. At least one of the one or more voice prompts is synthesized for playback from at least one of the one or more text-based scripts, the at least one synthesized voice prompt comprising an audible message in a language understandable to a speaker interacting with the speech-to-speech translation system, the audible message indicating a dialogue turn in the automated speech-to-speech translation system.
73 Citations
22 Claims
-
1. A method for use in indicating a dialogue turn in an automated speech-to-speech translation system, comprising the steps of:
-
translating speech input between a plurality of speakers having a multilingual conversation using an automated speech-to-speech translation system; and providing an indication to each speaker of the plurality of speakers of when it is a turn of each speaker to commence speaking in a dialog interaction between the plurality of speakers and provide speech input to the automated speech-to-speech translation system, wherein providing an indication comprises; obtaining one or more previously-generated text-based scripts, the one or more text-based scripts being synthesizable into one or more voice prompts in different languages of the plurality of speakers, wherein the voice prompts are audible messages that notify a given speaker when it is a turn of the given speaker for inputting speech to the automated speech-to-speech translation system; synthesizing for playback at least one of the one or more voice prompts from at least one of the one or more text-based scripts, the at least one synthesized voice prompt comprising an audible message in a language understandable to the given speaker to notify the given speaker when it is a turn of the given speaker for inputting speech to the automated speech-to-speech translation system; and playing the at least one synthesized voice prompt to provide the audible message to the given speaker to notify the given speaker that it is the given speaker'"'"'s turn for inputting speech to the automated speech-to-speech translation system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of providing an interface for use in an automated speech-to-speech translation system, the automated speech-to-speech translation system being operated by a system user and interacted with by a foreign language speaker, the method comprising the steps of:
-
translating speech input between the foreign language speaker and the system user having a multilingual conversation using the automated speech-to-speech translation system; and utilizing an interface of the automated speech-to-speech translation to provide an indication to the foreign language speaker of when it is a turn of the foreign language speaker to commence speaking in a dialog interaction with the system user and provide speech input to the automated speech-to-speech translation system by the foreign language speaker, wherein utilizing an interface comprises; the system user enabling a microphone of the automated speech-to speech translation system via the interface; synthesizing at least one previously-generated text-based scripts into a voice prompt for playback to the foreign language speaker, the voice prompt comprising an audible message in a language understandable to the foreign language speaker to notify the foreign language speaker when it is a turn of the foreign language speaker to input speech to the automated speech-to-speech translation system; playing the audible message to the foreign language speaker to notify the foreign language speaker that is the foreign language speaker'"'"'s turn for inputting speech to the automated speech-to-speech translation system; and receiving speech uttered into the microphone by the foreign language speaker for translation by the automated speech-to speech translation system. - View Dependent Claims (10, 11)
-
-
12. An apparatus for use in indicating a dialogue turn in an automated speech-to-speech translation system, comprising:
-
a memory; and at least one processor coupled to the memory and operative to;
(i) translate speech input from a plurality of speakers having a multilingual conversation using an automated speech-to-speech translation system;
(ii) provide an indication to each speaker of the plurality of speakers of when it is a turn of each speaker to commence speaking in a dialog interact between the plurality of speakers and provide speech input to the automated speech-to-speech translation system, wherein the at least one processor is operative to provide an indication by;obtaining one or more previously-generated text-based scripts, the one or more text-based scripts being synthesizable into one or more voice prompts in different languages of the plurality of speakers, wherein the voice prompts are audible messages that notify a given speaker when it is a turn of the given speaker for inputting speech to the automated speech-to-speech translation system; synthesizing for playback at least one of the one or more voice prompts from at least one of the one or more text-based scripts, the at least one synthesized voice prompt comprising an audible message in a language understandable to the given speaker to notify the given speaker when it is a turn of the given speaker for inputting speech to the automated speech-to-speech translation system; and playing the at least one synthesized voice prompt to provide the audible message to the given speaker to notify the given speaker that it is the given speaker'"'"'s turn for inputting speech to the automated speech-to-speech translation system. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. An interface for use in an automated speech-to-speech translation system, the automated speech-to-speech translation system being operated by a system user and interacted with by a foreign language speaker, the interface comprising:
-
a display to display a graphical user interface of the automated speech-to-speech translation system, wherein the graphical user interface comprises; a first field for use by the system user to enable a microphone of the automated speech-to-speech translation system; a second field for use by the system user for at least one of displaying text of speech uttered by the system user and displaying text of translated speech uttered by the foreign language speaker; and a third field for use by the foreign language speaker for at least one of displaying text of speech uttered by the speaker and displaying text of translated speech uttered by the system user; wherein the automated speech-to-speech translation system synthesizes for audible output at least one previously-generated voice prompt to the foreign language speaker in a language understandable to the foreign language speaker to notify the foreign language speaker when it is a turn of the foreign language speaker for inputting speech to the automated speech-to-speech translation system, and wherein the automated speech-to-speech translation system receives speech uttered into the microphone by the foreign language speaker for translation by the automated speech-to-speech translation system. - View Dependent Claims (21)
-
-
22. An article of manufacture for use in indicating a dialogue turn in an automated speech-to-speech translation system, comprising a non-transitory computer readable storage medium containing one or more programs which when executed implement the steps of:
-
translating speech input between a plurality of speakers having a multilingual conversation using an automated speech-to-speech translation system; and providing an indication to each speaker of the plurality of speakers of when it is a turn of each speaker to commence speaking in a dialog interaction between the plurality of speakers and provide speech input to the automated speech-to-speech translation system, wherein providing an indication comprises; obtaining one or more previously-generated text-based scripts, the one or more text-based scripts being synthesizable into one or more voice prompts in different languages of the plurality of speakers, wherein the voice prompts are audible messages that notify a given speaker when it is a turn of the given speaker for inputting speech to the automated speech-to-speech translation system; synthesizing for playback at least one of the one or more voice prompts from at least one of the one or more text-based scripts, the at least one synthesized voice prompt comprising an audible message in a language understandable to the given speaker to notify the given speaker when it is a turn of the given speaker for inputting speech to the automated speech-to-speech translation system; and playing the at least one synthesized voice prompt to provide the audible message to the given speaker to notify the given speaker that it is the given speaker'"'"'s turn for inputting speech to the automated speech-to-speech translation system.
-
Specification