Device Selection for Providing a Response

US 20180210703A1
Filed: 01/22/2018
Published: 07/26/2018
Est. Priority Date: 09/21/2015
Status: Active Grant

First Claim

Patent Images

1. A system, comprising;

a first speech processing pipeline instance that receives a first audio signal from a first speech interface device, the first audio signal representing a speech utterance, the first speech processing pipeline instance also receiving a first timestamp indicating a first time at which a wakeword was detected by the first speech interface device;

a second speech processing pipeline instance that receives a second audio signal from a second speech interface device, the second audio signal representing the speech utterance, the second speech processing pipeline also receiving a second timestamp indicating a second time at which the wakeword was detected by the second speech interface device;

the first speech processing pipeline instance having a series of processing components comprising;

an automatic speech recognition (ASR) component configured to analyze the first audio signal to determine words of the speech utterance;

a natural language understanding (NLU) component positioned in the first speech processing pipeline instance after the ASR component, the NLU component being configured to analyze the words of the speech utterance to determine an intent expressed by the speech utterance;

a response dispatcher positioned in the first speech processing pipeline instance after the NLU component, the response dispatcher being configured to specify a speech response to the speech utterance;

a first source arbiter positioned in the first speech processing pipeline instance before the ASR component, the first source arbiter being configured to determine (a) that an amount of time represented by a difference between the first timestamp and the second timestamp is less than a threshold;

(b) to determine that the first timestamp is greater than the second timestamp; and

(c) to abort the first speech processing pipeline instance.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system may use multiple speech interface devices to interact with a user by speech. All or a portion of the speech interface devices may detect a user utterance and may initiate speech processing to determine a meaning or intent of the utterance. Within the speech processing, arbitration is employed to select one of the multiple speech interface devices to respond to the user utterance. Arbitration may be based in part on metadata that directly or indirectly indicates the proximity of the user to the devices, and the device that is deemed to be nearest the user may be selected to respond to the user utterance.

42 Citations

View as Search Results

1 Claim

1. A system, comprising;
- a first speech processing pipeline instance that receives a first audio signal from a first speech interface device, the first audio signal representing a speech utterance, the first speech processing pipeline instance also receiving a first timestamp indicating a first time at which a wakeword was detected by the first speech interface device;
  
  a second speech processing pipeline instance that receives a second audio signal from a second speech interface device, the second audio signal representing the speech utterance, the second speech processing pipeline also receiving a second timestamp indicating a second time at which the wakeword was detected by the second speech interface device;
  
  the first speech processing pipeline instance having a series of processing components comprising;
  
  an automatic speech recognition (ASR) component configured to analyze the first audio signal to determine words of the speech utterance;
  
  a natural language understanding (NLU) component positioned in the first speech processing pipeline instance after the ASR component, the NLU component being configured to analyze the words of the speech utterance to determine an intent expressed by the speech utterance;
  
  a response dispatcher positioned in the first speech processing pipeline instance after the NLU component, the response dispatcher being configured to specify a speech response to the speech utterance;
  
  a first source arbiter positioned in the first speech processing pipeline instance before the ASR component, the first source arbiter being configured to determine (a) that an amount of time represented by a difference between the first timestamp and the second timestamp is less than a threshold;
  
  (b) to determine that the first timestamp is greater than the second timestamp; and
  
  (c) to abort the first speech processing pipeline instance.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Amazon Technologies, Inc. (Amazon.com, Inc.)
Original Assignee
Amazon Technologies, Inc. (Amazon.com, Inc.)
Inventors
Meyers, James David, Pravinchandra, Shah Samir, Liu, Yue, Dean, Arlen, Miller, Daniel, Mandal, Arindam

Granted Patent

US 11,922,095 B2
Time in Patent Office

Days
Field of Search
US Class Current
CPC Class Codes

G06F 3/167   Audio in a user interface, ...

G10L 15/00   Speech recognition G10L17/0...

G10L 15/063   Training

G10L 15/1815   Semantic context, e.g. disa...

G10L 15/22   Procedures used during a sp...

G10L 15/222   Barge in, i.e. overridable ...

G10L 15/26   Speech to text systems G10L...

G10L 15/32   Multiple recognisers used i...

G10L 2015/088   Word spotting

G10L 2015/223   Execution procedure of a sp...

G10L 2015/226   using non-speech characteri...

Device Selection for Providing a Response

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

42 Citations

1 Claim

Specification

Solutions

Use Cases

Quick Links

Device Selection for Providing a Response

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

42 Citations

1 Claim

Specification

Subscription Required

Solutions

Use Cases

Quick Links