Synthesized voice selection for computational agents

US 10,311,856 B2
Filed: 11/16/2017
Issued: 06/04/2019
Est. Priority Date: 10/03/2016
Status: Active Grant

First Claim

Patent Images

1. A method comprising:

receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device;

selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents;

responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent;

obtaining, by the first party agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and

outputting, for playback by one or more speakers of the computing device, to satisfy the utterance;

synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, andsynthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents; responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices; and outputting synthesized audio data using the selected voice to satisfy the utterance.

16 Citations

View as Search Results

15 Claims

1. A method comprising:
- receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device;
  
  selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents;
  
  responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent;
  
  obtaining, by the first party agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and
  
  outputting, for playback by one or more speakers of the computing device, to satisfy the utterance;
  
  synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, andsynthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The method of claim 1, wherein the utterance comprises a first utterance, the method further comprising:
    - receiving a representation of a second utterance spoken at the computing device;
      
      selecting, based on the second utterance, a second agent from the plurality of agents;
      
      responsive to determining that the selected second agent comprises a third-party agent, selecting a voice from the plurality of voices other than the reserved voice; and
      
      outputting synthesized audio data using the selected voice to satisfy the second utterance.
  - 3. The method of claim 2, wherein selecting the second agent from the plurality of agents comprises:
    - determining the second utterance includes one or more tasks to be performed by at least one of the plurality of agents;
      
      determining a capability level for each of the plurality of agents to perform one or more of the tasks to be performed by the at least one of the plurality of agents; and
      
      responsive to determining that the capability level for each of the plurality agents, selecting the second agent from the plurality of agents based on the second agent having the highest capability level.
  - 4. The method of claim 2, further comprising:
    - determining the second utterance includes a multi-element task to be performed by at least one of the plurality of agents, wherein the multi-element task includes at least a first sub-set of elements and a second sub-set of elements;
      
      performing, by the selected second agent, the first sub-set of elements of the multi-element task;
      
      determining the selected second agent cannot perform the second sub-set of elements of the multi-element task;
      
      selecting an additional agent to perform the second sub-set of elements of the multi-element task based on determining the additional agent is capable of performing the second sub-set of elements of the multi-element task; and
      
      performing, by the additional agent, the second sub-set of elements of the multi-element task.
  - 5. The method of claim 1, wherein the one or more processors are included in the computing device.
  - 6. The method of claim 1, wherein the one or more processors are included in a computing system.
  - 7. The method of claim 1, further comprising:
    - subsequent to outputting both the synthesized audio data that represents the first sub-set of the search results and the synthesized audio data that represents the second sub-set of the search results;
      
      outputting, by one or more of the speakers of the computing device, a request for feedback, from a user of the computing device, about the second sub-set of search results; and
      
      in response to outputting the request for feedback, receiving a representation of a user sentiment toward the second sub-set of search results.
  - 8. The method of claim 7, further comprising:
    - based on the user sentiment toward the second sub-set of search results, adjusting a ranking of the second sub-set of search results.
  - 9. The method of claim 1, wherein the outputting to satisfy the utterance further comprises:
    - outputting, by one or more user interfaces of the computing device, an indication of the first sub-set of the search results in a first font; and
      
      outputting, by one or more of the user interfaces of the computing device, an indication of the second sub-set of the search results in a second font.

10. A computing device comprising:
- at least one processor; and
  
  at least one memory comprising instructions that when executed, cause the at least one processor to execute an assistant configured to;
  
  receive, from one or more microphones operably connected to the computing device, a representation of an utterance spoken at the computing device;
  
  select, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents, the memory further comprising instructions that when executed, cause the at least one processor to;
  
  select, in response to determining that the selected agent comprises a first party agent, a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent;
  
  obtain, by the first party agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and
  
  output, for playback by one or more speakers operably connected to the computing device, to satisfy the utterance;
  
  synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, andsynthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice.
- View Dependent Claims (11)
- - 11. The device of claim 10, wherein the utterance comprises a first utterance, the assistant further configured to:
    - receive a representation of a second utterance spoken at the computing device;
      
      select, based on the second utterance, a second agent from the plurality of agents;
      
      select, responsive to determining that the selected second agent comprises a third-party agent, a voice from the plurality of voices other than the reserved voice; and
      
      output synthesized audio data using the selected voice to satisfy the second utterance.

12. A computing system comprising:
- one or more communication units;
  
  at least one processor; and
  
  at least one memory comprising instructions that when executed, cause the at least one processor to execute an assistant configured to;
  
  receive, from a computing device and via the one or more communication units, a representation of an utterance spoken at the computing device; and
  
  select, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents, the memory further comprising instructions that when executed, cause the at least one processor to;
  
  select, in response to determining that the selected agent comprises a first party agent, a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent;
  
  obtain, by the first party agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and
  
  output, for playback, to satisfy the utterance;
  
  synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, andsynthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice.
- View Dependent Claims (13)
- - 13. The system of claim 12, wherein the utterance comprises a first utterance, the assistant further configured to:
    - receive a representation of a second utterance spoken at the computing device;
      
      select, based on the second utterance, a second agent from the plurality of agents;
      
      select, responsive to determining that the selected second agent comprises a third-party agent, a voice from the plurality of voices other than the reserved voice; and
      
      output synthesized audio data using the selected voice to satisfy the second utterance.

14. A non-transitory computer-readable storage medium storing instructions that, when executed, cause one or more processors to execute an assistant configured to:
- receive a representation of an utterance spoken at a computing device;
  
  select, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents, the storage medium further comprising instructions that when executed, cause the one or more processors to;
  
  select, in response to determining that the selected agent comprises a first party agent, a reserved voice from a plurality of voices, wherein the reserved voice is associated with the first party agent;
  
  obtain, by the first agent and based on the utterance, a plurality of search results, including a first sub-set of the search results and a second sub-set of the search results; and
  
  output, for playback, to satisfy the utterance;
  
  synthesized audio data, of the first party agent, that represents the first sub-set of the search results using the selected reserved voice, andsynthesized audio data, of the first party agent, that represents the second sub-set of the search results using an additional voice from the plurality of voices, wherein the additional voice is distinct from the selected reserved voice.
- View Dependent Claims (15)
- - 15. The non-transitory computer-readable storage medium of claim 14, wherein the utterance comprises a first utterance, the assistant further configured to:
    - receive a representation of a second utterance spoken at the computing device;
      
      select, based on the second utterance, a second agent from the plurality of agents;
      
      select, responsive to determining that the selected second agent comprises a third-party agent, a voice from the plurality of voices other than the reserved voice; and
      
      output synthesized audio data using the selected voice to satisfy the second utterance.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Google LLC (Alphabet Inc.)
Original Assignee
Google LLC (Alphabet Inc.)
Inventors
Nygaard, Valerie, Caprita, Bogdan, Stets, Robert, Krishnakumaran, Saisuresh, Douglas, Jason Brant
Primary Examiner(s)
Pullias, Jesse S

Application Number

US15/815,375
Publication Number

US 20180096675A1
Time in Patent Office

565 Days
Field of Search

704257-275
US Class Current
CPC Class Codes

G06F 16/951   Indexing; Web crawling tech...

G06F 2209/5017   Task decomposition

G06F 3/167   Audio in a user interface, ...

G06F 9/5027   the resource being a machin...

G10L 13/00   Speech synthesis; Text to s...

G10L 13/04   Details of speech synthesis...

G10L 13/08   Text analysis or generation...

G10L 15/22   Procedures used during a sp...

G10L 15/26   Speech to text systems G10L...

G10L 2015/223   Execution procedure of a sp...

Y02D 10/00   Energy efficient computing,...

Synthesized voice selection for computational agents

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

16 Citations

15 Claims

Specification

Solutions

Use Cases

Quick Links

Synthesized voice selection for computational agents

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

16 Citations

15 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links