×

In-call virtual assistants

  • US 10,134,395 B2
  • Filed: 09/25/2013
  • Issued: 11/20/2018
  • Est. Priority Date: 09/25/2013
  • Status: Active Grant
First Claim
Patent Images

1. A system comprising:

  • one or more processors; and

    one or more computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising;

    receiving an indication that a first device of a first user and a second device of a second user are exchanging first voice-communication data;

    establishing a connection between the first device, the second device, and a computing device hosting at least a portion of a virtual assistant based at least in part on the indication;

    receiving, by the computing device, the first voice-communication data exchanged between the first device and the second device, the computing device being remote from both the first device and the second device, wherein the computing device is configured to communicate with the first device over one or more networks and configured to communicate with the second device over the one or more networks;

    performing speech recognition on a first part of a first audio signal to generate first text, the first audio signal representing first audio of the first voice-communication data exchanged between the first device and the second device;

    identifying a predefined utterance in the first text;

    invoking the virtual assistant based at least in part on identifying the predefined utterance;

    receiving identity information associated with the first device;

    transmitting, to the first device and at least partially in response to invoking the virtual assistant, information indicating that second voice-communication data will not be transmitted to the second device;

    transmitting, to the first device, a request for a password, the request based at least in part on the identity information;

    receiving the second voice-communication data from the first device including a representation of the password;

    preventing, based at least in part on the request for the password, the second voice-communication data from being transmitted to the second device;

    determining that the representation of the password is associated with an identity of the first user;

    receiving third voice-communication data exchanged between the first device and the second device, the third voice-communication data received subsequent to the first voice-communication data and the second voice-communication data;

    at least partly in response to identifying the predefined utterance, performing speech recognition on a second audio signal to generate second text, the second audio signal representing second audio of the third voice-communication data exchanged between the first device and the second device;

    identifying a voice command in the second text, the voice command being separate from and occurring after the predefined utterance;

    performing a task corresponding to the voice command at least partly in response to identifying the voice command; and

    sending an output audio signal to at least one of the first device or the second device, the output audio signal configured to cause audible output associated with the performing of the task on at least one of the first device or the second device.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×