Voice user interface with personality
DC CAFCFirst Claim
Patent Images
1. An apparatus for a voice user interface with personality, the apparatus comprising:
- logic that provides a voice user interface, the voice user interface outputting first voice signals, and the voice user interface recognizing speech signals; and
logic that provides a personality, the personality emulating human verbal behavior for a particular personality;
the logic that provides the personality interfacing with the logic that provides the voice user interface to provide the voice user interface with personality.
8 Assignments
Litigations
0 Petitions
Accused Products
Abstract
The present invention provides a voice user interface with personality. In one embodiment, a method includes executing a voice user interface, and controlling the voice user interface to provide the voice user interface with a personality. The method includes selecting a prompt based on various context situations, such as a previously selected prompt and the user'"'"'s experience with using the voice user interface.
-
Citations
142 Claims
-
1. An apparatus for a voice user interface with personality, the apparatus comprising:
-
logic that provides a voice user interface, the voice user interface outputting first voice signals, and the voice user interface recognizing speech signals; and
logic that provides a personality, the personality emulating human verbal behavior for a particular personality;
the logic that provides the personality interfacing with the logic that provides the voice user interface to provide the voice user interface with personality. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43)
a telephone line card in communication with a telephone line for receiving speech input data from a user and speech output data from the logic that provides the voice user interface.
-
-
7. The apparatus as recited in claim 6 wherein the computer system further comprises:
a network card, the network card in communication with a network.
-
8. The apparatus as recited in claim 1 wherein the voice user interface with personality comprises a virtual assistant with personality.
-
9. The apparatus as recited in claim 8 wherein the personality of the virtual assistant comprises a predetermined tone or cadence of the first voice signals, a predetermined vocabulary, a predetermined sentence construction, or a predetermined degree of assertiveness.
-
10. The apparatus as recited in claim 8 wherein the first voice signals comprise a prompt, the prompt being scripted for the personality of the virtual assistant.
-
11. The apparatus as recited in claim 8 wherein the first voice signals comprise a recorded prompt, the recorded prompt being acted and recorded for the personality of the virtual assistant.
-
12. The apparatus as recited in claim 1 wherein the logic that provides the personality comprises randomly selecting a prompt from multiple prompts available to provide a predetermined response.
-
13. The apparatus as recited in claim 1 wherein the logic that provides the personality comprises selecting a prompt, the prompt comprising an appropriate temporal prompt.
-
14. The apparatus as recited in claim 1 wherein the logic that provides the personality comprises selecting a prompt, the prompt comprising a term that was previously spoken by the user in a recognized command.
-
15. The apparatus as recited in claim 1 wherein the logic that provides the voice user interface comprises recognizing multiple spoken commands as a predetermined command.
-
16. The apparatus as recited in claim 1 wherein the logic that provides. the voice user interface comprises generating second voice signals, the second voice signals comprising synthesized voice signals that correspond to text.
-
17. The apparatus as recited in claim 1 wherein the logic that provides the personality selects a smooth hand-off prompt to provide a smooth hand-off between the first voice signals and second voice signals, the second voice signals being output by the logic that provides the voice user interface.
-
18. The apparatus as recited in claim 17 wherein the logic that provides the personality selects a first complete sentence for output by the first voice signals and a second complete sentence for output by the second voice signals.
-
19. The apparatus as recited in claim 18 wherein the first voice signals comprise the voice of a virtual assistant, and the second voice signals comprise the voice of a helper to the virtual assistant.
-
20. The apparatus as recited in claim 19 wherein the second voice signals comprise recorded voice signals.
-
21. The apparatus as recited in claim 19 wherein the second voice signals comprise synthesized voice signals.
-
22. The apparatus as recited in claim 1 wherein the logic that provides the personality comprises controlling the selection of the first voice signals output by the voice user interface such that the voice user interface with personality behaves consistently with social and emotional norms, including politeness, while interacting with the user.
-
23. The apparatus as recited in claim 22 wherein the first voice signals comprise a word that is within a recognition grammar of the voice user interface.
-
24. The apparatus as recited in claim 22 wherein the logic that provides the personality comprises controlling the selection of the first voice signals output by the voice user interface such that the voice user interface with personality addresses a user by a proper name of the user.
-
25. The apparatus as recited in claim 22 wherein the voice user interface with personality suggests responses available to the user that are polite and socially appropriate, thereby permitting the user to be polite and socially appropriate while interacting with the voice user interface with personality.
-
26. The apparatus as recited in claim 1 wherein the logic that provides the personality comprises controlling the voice user interface in situations in which negative comments are needed.
-
27. The apparatus as recited in claim 26 wherein the logic that provides the personality comprises selecting a negative prompt, the negative prompt comprising a concise and plain statement of a problem without blaming a user, and outputting a recording of the brief negative prompt at a lower volume.
-
28. The apparatus as recited in claim 1 wherein the logic that provides the personality comprises selecting a prompt based on a user'"'"'s experience with using the voice user interface during a current session and across sessions.
-
29. The apparatus as recited in claim 28 wherein the logic that provides the personality comprises selecting a shorter prompt based on the user'"'"'s experience with using the voice user interface during the current session and across sessions.
-
30. The apparatus as recited in claim 28 wherein the logic that provides the personality comprises selecting a longer help prompt if the user'"'"'s input indicates a problem with increasing frequency during the current session and across sessions.
-
31. The apparatus as recited in claim 28 wherein the logic that provides the personality comprises selecting a prompt that provides the available options to the user if the voice user interface does not recognize a command spoken by the user or if the user has not spoken for a defined period of time.
-
32. The apparatus as recited in claim 28 wherein the logic that provides the personality comprises selecting a coaching prompt that provides a current state of interaction, provides commands that the user can say at the current state of interaction, and provides the actions that would be taken in response to each of the commands.
-
33. The apparatus as recited in claim 1 wherein the logic that provides the personality comprises selecting an approximation prompt.
-
34. The apparatus as recited in claim 1 wherein the logic that provides the personality comprises providing the voice user interface with a first personality and a second personality.
-
35. The apparatus as recited in claim 34 wherein the first voice signals comprise the voice of the first personality, and second voice signals comprise the voice of the second personality.
-
36. The apparatus as recited in claim 34 wherein a first virtual assistant comprises the first personality, and a second virtual assistant comprises the second personality.
-
37. The apparatus as recited in claim 36 wherein the first virtual assistant and the second virtual assistant comprise different genders.
-
38. The apparatus as recited in claim 36 wherein a user selects the first virtual assistant or the second virtual assistant based on descriptions of the first virtual assistant and the second virtual assistant or based on interacting with the first virtual assistant and the second virtual assistant.
-
39. The apparatus as recited in claim 1 wherein the logic that provides the personality comprises selecting a humorous prompt.
-
40. The apparatus as recited in claim 1 wherein the voice user interface with personality comprises a virtual assistant for a voice-based desktop environment.
-
41. The apparatus as recited in claim 40 wherein the desktop environment comprises multiple objects, the virtual assistant being navigated among the multiple objects by a user.
-
42. The apparatus as recited in claim 1 further comprising:
a recognition grammar stored in a memory, the recognition grammar comprising multiple phrases that a virtual assistant with a personality can recognize when spoken by a user, and the recognition grammar being selected based on the personality of the virtual assistant.
-
43. The apparatus as recited in claim 1 wherein the logic that provides the voice user interface comprises:
echo cancellation software, barge-in software, signal processing software, automatic speech recognition/natural language software, request for services software, and text-to-speech/recorded speech software.
-
44. A method for a voice user interface with personality, the method comprising:
-
executing a voice user interface, the voice user interface output first voice signals, the voice user interface recognizing speech signals; and
controlling the voice user interface to provide the voice user interface with a personality, the personality emulating human verbal behavior for a particular personality. - View Dependent Claims (45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81)
storing a recognition grammar in a memory, the recognition grammar comprising multiple phrases that the voice user interface can recognize when spoken by a user, and the grammar being selected based on the personality of the voice user interface.
-
-
81. The method as recited in claim 44 wherein the voice user interface comprises:
echo cancellation software, barge-in software, signal processing software, automatic speech recognition/natural language software, request for services software, and text-to-speech/recorded speech software.
-
82. A data signal in a carrier wave for a voice user interface with personality, the data signal in a carrier wave comprising:
-
first voice signals, the first voice signals being output by a voice user interface with personality, the personality emulating human verbal behavior for a particular personality; and
speech signals, the voice user interface with personality recognizing the speech signals. - View Dependent Claims (83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104)
second voice signals, the second voice signals being output by the voice user interface with personality.
-
-
90. The data signal in a carrier wave as recited in claim 89 further comprising:
third voice signals, the third voice signals being output by the voice user interface with personality, the third voice signals comprising a smooth hand-off between the first voice signals and the second voice signals.
-
91. The data signal in a carrier wave as recited in claim 89 wherein the first voice signals comprise a first complete sentence and the second voice signals comprise a second complete sentence.
-
92. The data signal in a carrier wave as recited in claim 89 wherein the first voice signals comprise the voice of a virtual assistant, and the second voice signals comprise the voice of a helper to the virtual assistant.
-
93. The data signal in a carrier wave as recited in claim 89 wherein the second voice signals comprise synthesized voice signals.
-
94. The data signal in a carrier wave as recited in claim 82 wherein the first voice signals comprise a word that is within a recognition grammar of the voice user interface with personality.
-
95. The data signal in a carrier wave as recited in claim 82 wherein the first voice signals comprise a proper name of the user.
-
96. The data signal in a carrier wave as recited in claim 82 wherein the first voice signals comprise a negative prompt, the negative prompt comprising a concise and plain statement of a problem without blaming a user, and the first voice signals being output at a lower volume.
-
97. The data signal in a carrier wave as recited in claim 82 wherein the first voice signals comprise a prompt that is selected based on a user'"'"'s experience with using the voice user interface with personality during a current session and across sessions.
-
98. The data signal in a carrier wave as recited in claim 97 wherein the first voice signals comprise a longer help prompt if the user'"'"'s input indicates a problem with increasing frequency during the current session and across sessions.
-
99. The data signal in a carrier wave as recited in claim 82 wherein the first voice signals comprise a coaching prompt that provides a current state of interaction, lists commands that the user can say at the current state of interaction, and lists the actions that would be taken in response to each of the commands.
-
100. The data signal in a carrier wave as recited in claim 82 wherein the first voice signals comprise an approximation prompt.
-
101. The data signal in a carrier wave as recited in claim 82 wherein the voice user interface with personality comprises a first personality and a second personality.
-
102. The data signal in a carrier wave as recited in claim 101 wherein the first voice signals comprise the voice of the first personality, and second voice signals comprise the voice of the second personality, the second voice signals being output by. the voice user interface with personality.
-
103. The data signal in a carrier wave as recited in claim 101 wherein a first virtual assistant comprises the first personality, and a second virtual assistant comprises the second personality.
-
104. The data signal in a carrier wave as recited in claim 82 wherein the first voice signals comprise a humorous prompt.
-
105. A computer-readable medium having a computer program accessible therefrom, the computer program comprising instructions for:
-
executing a voice user interface, the voice user interface outputting first voice signals;
the voice user interface recognizing speech signals; and
controlling the voice user interface to provide the voice user interface with a personality;
wherein tie personality emulates human verbal behavior for a particular personality.- View Dependent Claims (106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142)
the second voice signals being output by the voice user interface.
-
-
117. The computer-readable medium as recited in claim 116 wherein the controlling the voice user interface comprises selecting a first complete sentence for output by the first voice signals and a second complete sentence for output by the second voice signals.
-
118. The computer-readable medium as recited in claim 117 wherein the first voice signals comprise the voice of a virtual assistant and the second voice signals comprise the voice of a helper to the virtual assistant.
-
119. The computer-readable medium as recited in claim 118 wherein the second voice signals comprise recorded voice signals.
-
120. The computer-readable medium as recited in claim 118 wherein the second voice signals comprise synthesized voice signals.
-
121. The computer-readable medium as recited in claim 105 wherein the controlling the voice user interface comprises controlling the selection of the first voice signals output by the voice user interface such that the voice user interface behaves consistently with social and emotional norms, including politeness, while interacting with the user.
-
122. The computer-readable medium as recited in claim 121 wherein the controlling the voice user interface comprises selecting a word that is within a recognition grammar to be output by the first voice signals.
-
123. The computer-readable medium as recited in claim 121 wherein the controlling the voice user interface comprises controlling the selection of the first voice signals output by the voice user interface such that the voice user interface addresses a user by a proper name of the user.
-
124. The computer-readable medium as recited in claim 121 wherein the speech signals comprise user commands spoken politely.
-
125. The computer-readable medium as recited in claim 105 wherein the controlling We voice user interface comprises controlling the voice user interface in situations in which negative comments are needed.
-
126. The computer-readable medium as recited in claim 125 wherein the controlling the voice user interface comprises selecting a negative prompt, and outputting a recording of the negative prompt at a lower volume.
-
127. The computer-readable medium as recited in claim 105 wherein the controlling the voice user interface comprises selecting a prompt based on a user'"'"'s experience with using the voice user interface during a current session and across sessions.
-
128. The computer-readable medium as recited in claim 127 wherein the controlling the voice user interface comprises selecting a shorter prompt based on the user'"'"'s experience with using the voice user interface during the current session and across sessions.
-
129. The computer-readable medium as recited in claim 127 wherein the controlling the voice user interface comprises selecting a longer help prompt if the user'"'"'s input indicates a problem with increasing frequency during the current session and across sessions.
-
130. The computer-readable medium as recited in claim 127 wherein the controlling the voice user interface comprises selecting a prompt that lists the available options to the user if the voice user interface does not recognize a command spoken by the user or if the user has not spoken for a defined period of time.
-
131. The computer-readable medium as recited in claim 127 wherein the controlling the voice user interface comprises selecting a coaching prompt that provides a current state of interaction, lists commands that the user can say at the current state of interaction, and lists the actions that would be en in response to each of the commands.
-
132. The computer-readable medium as recited in claim 105 wherein the controlling the voice user interface comprises selecting an approximation prompt.
-
133. The computer-readable medium as recited in claim 105 wherein the controlling the voice user interface comprises providing the voice user interface with a first personality and a second personality.
-
134. The computer-readable medium as recited in claim 133 wherein the first voice signals comprise the voice of the first personality, and second voice signals comprise the voice of the second personality.
-
135. The computer-readable medium as recited in claim 133 wherein a first virtual assistant comprises the first personality, and a second assistant comprises the second personality.
-
136. The computer-readable medium as recited in claim 135 wherein the first virtual assistant and the second virtual assistant comprise different genders.
-
137. The computer-readable medium as recited in claim 135 wherein a user selects the first virtual assistant or the second virtual assistant based on descriptions of the first virtual assistant and the second virtual assistant or based on interacting with the first virtual assistant and the second virtual assistant.
-
138. The computer-readable medium as recited in claim 105 wherein the voice user interface with personality comprises selecting a humorous prompt.
-
139. The computer-readable medium as recited in claim 105 wherein the voice user interface with personality comprises a virtual assistant for a voice-based desktop environment.
-
140. The computer-readable medium as recited in claim 139 wherein the desktop environment comprises multiple objects, the virtual assistant being navigated among the multiple objects by a user.
-
141. The computer-readable medium as recited in claim 105 further comprising instructions for:
storing a recognition grammar in a memory, the recognition grammar comprising multiple phrases that the voice user interface can recognize when spoken by a user, and the grammar being selected based on the personality of the voice user interface.
-
142. The computer-readable medium as recited in claim 105 wherein the voice user interface comprises:
echo cancellation software, barge-in software, signal processing software, automatic speech recognition/natural language software, request for services software, and text-to-speech/recorded speech software.
Specification