×

Localized speech recognition with offload

  • US 8,880,398 B1
  • Filed: 01/21/2013
  • Issued: 11/04/2014
  • Est. Priority Date: 07/13/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • receiving, by a local computing device, an utterance from a user device, wherein the user device and the local computing device are part of a local network;

    in response to receiving the utterance, obtaining a text string transcription of the utterance from an automatic speech recognition (ASR) module of the local computing device, and selecting a response mode for the utterance from among a text-based response mode and a non-text-based response mode, wherein obtaining the text string transcription of the utterance comprises transcribing, by the ASR module of the local computing device, the utterance into the text string transcription, wherein the text string transcription includes a representation of the utterance, and wherein transcribing the utterance into the text string transcription comprises determining that the utterance matches a speaker adaptation profile, applying speaker adaptation parameters to the utterance, and updating the speaker adaptation parameters based at least on characteristics of the utterance, wherein the speaker adaptation parameters are associated with the speaker adaptation profile, and wherein the text string transcription is based on the speaker adaptation parameters;

    if the selected response mode is the text-based response mode, providing, by the local computing device, the text string transcription to a target device;

    if the selected response mode is the non-text-based response mode, (i) converting the text string transcription into one or more non-text, device-executable commands from a non-text, device-executable command set supported by the target device, and (ii) providing, by the local computing device, the one or more non-text, device-executable commands to the target device;

    receiving, by the local computing device, a second utterance from the user device; and

    in response to receiving the second utterance, obtaining a second text string transcription of the second utterance, wherein the second text string transcription is based on the updated speaker adaptation profile.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×