Low-latency intelligent automated assistant

  • US 10,789,945 B2
  • Filed: 08/17/2017
  • Issued: 09/29/2020
  • Est. Priority Date: 05/12/2017
  • Status: Active Grant
  • ×
    • Pin Icon | RPX Insight
    • Pin
First Claim
Patent Images

1. An electronic device, comprising:

  • one or more processors; and

    memory storing one or more programs configured to be executed by the one or more processors, the one or more programs including instructions for;

    receiving a stream of audio, comprising;

    receiving, from a first time to a second time, a first portion of the stream of audio containing at least a portion of a user utterance, wherein one or more candidate text representations are determined based on the at least a portion of the user utterance while receiving the first portion of the stream of audio; and

    receiving, from the second time to a third time, a second portion of the stream of audio, wherein the electronic device stops receiving the stream of audio at the third time;

    after determining the one or more candidate text representations, determining whether the first portion of the stream of audio satisfies a predetermined condition;

    in response to determining that the first portion of the stream of audio satisfies the predetermined condition, performing, at least partially between the second time and the third time, operations comprising;

    determining, based on the one or more candidate text representations of the at least a portion of the user utterance, a plurality of candidate user intents for the at least a portion of the user utterance, wherein each candidate user intent of the plurality of candidate user intents corresponds to a respective candidate task flow of a plurality of candidate task flows;

    selecting a first candidate task flow of the plurality of candidate task flows; and

    executing the first candidate task flow without providing an output to a user of the device;

    determining whether a speech end-point condition is detected between the second time and the third time; and

    in response to determining that a speech end-point condition is detected between the second time and the third time, presenting, to the user, results from executing the selected first candidate task flow.

View all claims