Voice controlled assistant with non-verbal user input
First Claim
Patent Images
1. A device comprising:
- a housing comprising at least one side and a top surface, the top surface comprising a first portion associated with a first zone and a second portion associated with a second zone;
one or more processors,memory;
one or more speakers configured to output an audible prompt to enter an input via one or more taps on the top surface of the device;
one or more microphones configured to capture audio and generate an audio signal based on the captured audio, the audio signal representing a series of taps, the series of taps comprising, in order, a first tap on the top surface of the housing and a second tap on the top surface of the housing, the first tap being at a first location in the first portion and the second tap being at a second location in the second portion;
a pattern recognition module, stored in the memory and executable by the one or more processors to analyze the audio signal generated by the one or more microphones to;
detect, within the audio signal, the first tap and the second tap,determine the first location of the first tap and the second location of the second tap,compare a tap sequence corresponding to the first tap at the first location and the second tap at the second tap location with a predetermined tap sequence associated with the input, anddetermine that the tap sequence corresponds to the predetermined tap sequence; and
a light indicator configured to emit light from the housing proximate the top surface of the housing, the light indicator being controlled to provide;
at least in part in response to determining the first location by analyzing the audio signal, a first visual cue at the first zone at a first time, and at least in part in response to determining the second location by analyzing the audio signal, a second visual cue at the second zone at a second time subsequent to the first time.
2 Assignments
0 Petitions
Accused Products
Abstract
A voice controlled assistant having a housing to hold one or more microphones, one or more speakers, and various computing components. The voice controlled assistant facilitates transactions and other functions primarily through verbal interactions with a user. In some situations, a transaction may require entry of a code, which the user may wish to enter in a non-verbal way. The voice controlled assistant is configured to analyze an audio signal to detect user interactions with the surface of the voice controlled assistant and to interpret the detected interactions as entry of the code.
27 Citations
25 Claims
-
1. A device comprising:
-
a housing comprising at least one side and a top surface, the top surface comprising a first portion associated with a first zone and a second portion associated with a second zone; one or more processors, memory; one or more speakers configured to output an audible prompt to enter an input via one or more taps on the top surface of the device; one or more microphones configured to capture audio and generate an audio signal based on the captured audio, the audio signal representing a series of taps, the series of taps comprising, in order, a first tap on the top surface of the housing and a second tap on the top surface of the housing, the first tap being at a first location in the first portion and the second tap being at a second location in the second portion; a pattern recognition module, stored in the memory and executable by the one or more processors to analyze the audio signal generated by the one or more microphones to; detect, within the audio signal, the first tap and the second tap, determine the first location of the first tap and the second location of the second tap, compare a tap sequence corresponding to the first tap at the first location and the second tap at the second tap location with a predetermined tap sequence associated with the input, and determine that the tap sequence corresponds to the predetermined tap sequence; and a light indicator configured to emit light from the housing proximate the top surface of the housing, the light indicator being controlled to provide; at least in part in response to determining the first location by analyzing the audio signal, a first visual cue at the first zone at a first time, and at least in part in response to determining the second location by analyzing the audio signal, a second visual cue at the second zone at a second time subsequent to the first time. - View Dependent Claims (2, 3, 4, 5, 19, 20, 25)
-
-
6. A method comprising:
under control of a computing device configured with executable instructions, outputting an audible prompt to enter a code via a series of taps on a surface of the computing device; illuminating a first visual designation on the surface of the computing device and a second visual designation on the surface of the computing device, the first visual designation identifying a first zone on the surface and the second visual designation identifying a second zone on the surface; generating an audio signal from sound captured from an environment by one or more microphones; identifying a first tap within the audio signal, the first tap corresponding to a first contact with the surface; based at least in part on characteristics of the audio signal; determining a first location of the first tap; determining that the first location corresponds to the first zone; and illuminating a first visual cue at the first zone; identifying a second tap within the audio signal, the second tap corresponding to a second contact with the surface subsequent to the first tap; based at least in part on characteristics of the audio signal; determining a second location of the second tap; determining that the second location corresponds to the second zone; and illuminating a second visual cue at the second zone subsequent to the first visual cue; and interpreting the first tap and the second tap correspond to the predetermined series. - View Dependent Claims (7, 8, 9, 10, 11, 21, 22, 23)
-
12. A device comprising:
-
one or more microphones configured to capture audio and generate an audio signal based on the captured audio; one or more speakers configured to output audio; one or more processors; and one or more computer-readable media having computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising; causing to be output, via the one or more speakers, an audible prompt to enter information via one or more contacts with a surface of the device; detecting a first tap within the audio signal, the first tap corresponding to a first contact with the device after the audible prompt; based on characteristics of the audio signal; determining a first location on the device associated with the first tap; and illuminating a first indicator zone on the device from among multiple indicator zones, the first indicator zone being proximate the first location; interpreting the tap as a user input, a value of the user input based at least in part on the location; and upon receiving a second tap at a second location on the device, illuminating a second indicator zone from the multiple indicator zones, the second indicator zone being proximate the second location. - View Dependent Claims (13, 14, 15, 16, 17, 18, 24)
-
Specification