In-Call Virtual Assistants
First Claim
Patent Images
1. A system comprising:
- one or more processors; and
one or more computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising;
receiving, from a first user during a voice communication established between a device of the first user and a device of a second user, a request to invoke a virtual assistant during the voice communication, the virtual assistant performing speech recognition on an audio signal representing audio of the voice communication between the first and second users upon invocation, the performing of the speech recognition for identifying a voice command from at least one of the first user or the second user;
performing speech recognition on the audio signal representing the voice communication between the first and second users for identifying a voice command at least partly in response to receiving the request;
identifying a voice command from the audio of the voice communication between the first and second users responsive to the performing of the speech recognition;
performing a task corresponding to the voice command at least partly in response to the identifying of the voice command; and
providing an output audio signal to at least one of the device of the first user or the device of the second user during the voice communication, the output audio signal configured to cause audible output associated with the performing of the task on at least one of the device of the user or the device of the second user.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques for providing virtual assistants to assist users during a voice communication between the users. For instance, a first user operating a device may establish a voice communication with respective devices of one or more additional users, such as with a device of a second user. For instance, the first user may utilize her device to place a telephone call to the device of the second user. A virtual assistant may also join the call and, upon invocation by a user on the call, may identify voice commands from the call and may perform corresponding tasks for the users in response.
307 Citations
24 Claims
-
1. A system comprising:
-
one or more processors; and one or more computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising; receiving, from a first user during a voice communication established between a device of the first user and a device of a second user, a request to invoke a virtual assistant during the voice communication, the virtual assistant performing speech recognition on an audio signal representing audio of the voice communication between the first and second users upon invocation, the performing of the speech recognition for identifying a voice command from at least one of the first user or the second user; performing speech recognition on the audio signal representing the voice communication between the first and second users for identifying a voice command at least partly in response to receiving the request; identifying a voice command from the audio of the voice communication between the first and second users responsive to the performing of the speech recognition; performing a task corresponding to the voice command at least partly in response to the identifying of the voice command; and providing an output audio signal to at least one of the device of the first user or the device of the second user during the voice communication, the output audio signal configured to cause audible output associated with the performing of the task on at least one of the device of the user or the device of the second user. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A method comprising:
-
receiving a request to invoke a virtual assistant during a voice communication between a device of a first user and a device of a second user; performing speech recognition on an audio signal representing audio of the voice communication at least partly in response to the receiving; responsive to the performing of the speech recognition, identifying a voice command from at least one of the first user or the second user; and providing, by the virtual assistant, an output audio signal to at least one of the device of the first user or the device of the second user, the output audio signal for outputting audible content during the voice communication. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. One or more computer-readable media storing computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform acts comprising:
-
joining a computing device to a voice communication between two user devices, the computing device being remote from the two user devices; upon a user of one of the two user devices invoking the computing device, performing speech recognition by the computing device on an audio signal representing audio of the voice communication; and identifying a voice command from a user of one of the two devices responsive to performing the speech recognition on the audio signal representing the audio. - View Dependent Claims (20, 21, 22, 23, 24)
-
Specification