Synthesized voice selection for computational agents
First Claim
Patent Images
1. A method comprising:
- receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device;
selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents;
responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent;
obtaining, by the first party agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and
outputting, for playback by one or more speakers of the computing device, to satisfy the utterance;
synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, andsynthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice.
3 Assignments
0 Petitions
Accused Products
Abstract
An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents; responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices; and outputting synthesized audio data using the selected voice to satisfy the utterance.
16 Citations
15 Claims
-
1. A method comprising:
-
receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents; responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent; obtaining, by the first party agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and outputting, for playback by one or more speakers of the computing device, to satisfy the utterance; synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, and synthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computing device comprising:
-
at least one processor; and at least one memory comprising instructions that when executed, cause the at least one processor to execute an assistant configured to; receive, from one or more microphones operably connected to the computing device, a representation of an utterance spoken at the computing device; select, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents, the memory further comprising instructions that when executed, cause the at least one processor to; select, in response to determining that the selected agent comprises a first party agent, a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent; obtain, by the first party agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and output, for playback by one or more speakers operably connected to the computing device, to satisfy the utterance; synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, and synthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice. - View Dependent Claims (11)
-
-
12. A computing system comprising:
-
one or more communication units; at least one processor; and at least one memory comprising instructions that when executed, cause the at least one processor to execute an assistant configured to; receive, from a computing device and via the one or more communication units, a representation of an utterance spoken at the computing device; and select, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents, the memory further comprising instructions that when executed, cause the at least one processor to; select, in response to determining that the selected agent comprises a first party agent, a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent; obtain, by the first party agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and output, for playback, to satisfy the utterance; synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, and synthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice. - View Dependent Claims (13)
-
-
14. A non-transitory computer-readable storage medium storing instructions that, when executed, cause one or more processors to execute an assistant configured to:
-
receive a representation of an utterance spoken at a computing device; select, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents, the storage medium further comprising instructions that when executed, cause the one or more processors to; select, in response to determining that the selected agent comprises a first party agent, a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent; obtain, by the first agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and output, for playback, to satisfy the utterance; synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, and synthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice. - View Dependent Claims (15)
-
Specification