×

Contextual voice user interface

  • US 10,446,147 B1
  • Filed: 06/27/2017
  • Issued: 10/15/2019
  • Est. Priority Date: 06/27/2017
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method comprising:

  • during a first time period at one or more remote devices;

    receiving, from a device, first input audio data corresponding to a first utterance;

    generating an identifier;

    associating the identifier with the first utterance;

    performing speech recognition processing on the first input audio data to generate first text data;

    associating the first text data with the identifier;

    performing natural language processing on the first text data to determine a first intent corresponding to the first utterance;

    associating the first intent with the identifier;

    performing natural language processing on the first text data to determine at least a portion of the first text data that potentially corresponds to an entity;

    associating, with the identifier, the at least a portion of the first text data and an indication of the entity;

    determining an application associated with the first intent;

    associating application data representing the application with the identifier;

    sending, to a remote device associated with the application, a signal requesting content responsive to the first utterance;

    receiving, from the remote device, content data representing the content; and

    causing the device to emit the content data; and

    during a second time period subsequent to the first time period at the one or more remote devices;

    receiving, from the device, second input audio data corresponding to a second utterance;

    performing speech recognition processing on the second input audio data to generate second text data;

    performing natural language processing on the second text data to determine a second intent corresponding to the second utterance, the second intent being to determine an explanation for processing of the first utterance and to receive previous speech processing results corresponding to the first utterance;

    determining the identifier associated with the first utterance;

    determining, based on the identifier, at least one of the first text data, the first intent, the at least a portion of the first text data, the indication of the entity, or the application data;

    determining an output data format associated with the second intent;

    generating output data using the output data format, wherein the output data includes the first text data and at least one of the first intent, the indication of the entity, or the application data with at least a first portion of the output data format; and

    sending the output data to the device.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×