Ambient assistant device
First Claim
Patent Images
1. A home assistant device, comprising:
- a speaker;
a microphone;
a display screen;
one or more processors; and
memory storing instructions, wherein the processor is configured to execute the instructions such that the processor and memory are configured to;
detect first speech spoken within an environment of the home assistant device using the microphone;
determine that the first speech includes content having one or more words or phrases included in a local dictionary of the home assistant device;
provide a first interaction opportunity with the home assistant device based on the one or more words or phrases of the first speech corresponding to the local dictionary, the first interaction opportunity providing a speech response using the speaker based on the content of the first speech and based on the first speech including the content having the one or more words or phrases included in the local dictionary of the home assistant device;
detect second speech spoken within the environment of the home assistant device, the first speech being different than the second speech;
determine that the content of the second speech does not include the one or more words or phrases corresponding to the local dictionary;
provide the second speech to a cloud server to determine content related to the second speech;
receive response data from the cloud server based on the second speech; and
provide a second interaction opportunity with the home assistant device based on the response data received from the cloud server, the second interaction opportunity different than the first interaction opportunity, the second interaction opportunity providing a visual response on the display screen based on the content of the second speech and based on the second speech being provided to the cloud server.
1 Assignment
0 Petitions
Accused Products
Abstract
Ambient assistance is described. An assistant device can detect speech in its environment and determine that the speech includes words or phrases of a local dictionary of the assistant device. The assistant device can then generate an interaction opportunity based on the words or phrases of the speech.
-
Citations
22 Claims
-
1. A home assistant device, comprising:
-
a speaker; a microphone; a display screen; one or more processors; and memory storing instructions, wherein the processor is configured to execute the instructions such that the processor and memory are configured to; detect first speech spoken within an environment of the home assistant device using the microphone; determine that the first speech includes content having one or more words or phrases included in a local dictionary of the home assistant device; provide a first interaction opportunity with the home assistant device based on the one or more words or phrases of the first speech corresponding to the local dictionary, the first interaction opportunity providing a speech response using the speaker based on the content of the first speech and based on the first speech including the content having the one or more words or phrases included in the local dictionary of the home assistant device; detect second speech spoken within the environment of the home assistant device, the first speech being different than the second speech; determine that the content of the second speech does not include the one or more words or phrases corresponding to the local dictionary; provide the second speech to a cloud server to determine content related to the second speech; receive response data from the cloud server based on the second speech; and provide a second interaction opportunity with the home assistant device based on the response data received from the cloud server, the second interaction opportunity different than the first interaction opportunity, the second interaction opportunity providing a visual response on the display screen based on the content of the second speech and based on the second speech being provided to the cloud server.
-
-
2. A method, comprising:
-
detecting first speech spoken within an environment of an assistant device; determining, by a processor, that the first speech includes content having one or more words or phrases corresponding to a local dictionary of the assistant device; generating a first interaction opportunity with the assistant device based on the one or more words or phrases of the first speech corresponding to the local dictionary, the first interaction opportunity being a first type of interaction opportunity based on the first speech including content having the one or more words or phrases corresponding to the local dictionary of the assistant device; detecting second speech spoken within the environment of the assistant device, the first speech being different than the second speech; determining that the second speech does not include the one or more words or phrases corresponding to the local dictionary; providing the second speech to a cloud server to determine content or interactions related to the second speech; receiving response data from the cloud server based on the content of the second speech; and generating a second interaction opportunity with the assistant device based on the response data received from the cloud server, the second interaction opportunity being a second type of interaction opportunity that is different than the first type of interaction opportunity, the second interaction opportunity being the second type based on providing the second speech to the cloud server. - View Dependent Claims (3, 4, 5, 6, 7, 8)
-
-
9. An assistant device, comprising:
-
one or more processors; and memory storing instructions, wherein the processor is configured to execute the instructions such that the processor and memory are configured to; detect first speech spoken within an environment of the assistant device; determine that the first speech includes content having one or more words or phrases corresponding to a local dictionary of the assistant device; generate a first interaction opportunity with the assistant device based on the one or more words or phrases of the first speech corresponding to the local dictionary, the first interaction opportunity being a first type of interaction opportunity based on the first speech including content having the one or more words or phrases corresponding to the local dictionary of the assistant device; detect second speech spoken within the environment of the assistant device, the first speech being different than the second speech; determine that the second speech does not include the one or more words or phrases corresponding to the local dictionary; provide the second speech to a cloud server to determine content or interactions related to the second speech; receive response data from the cloud server based on the content of the second speech; and generate a second interaction opportunity with the assistant device based on the response data received from the cloud server, the second interaction opportunity being a second type of interaction opportunity that is different than the first type of interaction opportunity, the second interaction opportunity being the second type based on providing the second speech to the cloud server. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A computer program product, comprising one or more non-transitory computer-readable media having computer program instructions stored therein, the computer program instructions being configured such that, when executed by one or more computing devices, the computer program instructions cause the one or more computing devices to:
-
detect first speech spoken within an environment of an assistant device; determine that the first speech includes content having one or more words or phrases corresponding to a local dictionary of the assistant device; generate a first interaction opportunity with the assistant device based on the one or more words or phrases of the first speech corresponding to the local dictionary, the first interaction opportunity being a first type of interaction opportunity based on the first speech including content having the one or more words or phrases corresponding to the local dictionary of the assistant device; detect second speech spoken within the environment of the assistant device, the first speech being different than the second speech; determine that the second speech does not include the one or more words or phrases corresponding to the local dictionary; provide the second speech to a cloud server to determine content or interactions related to the second speech; receive response data from the cloud server based on the content of the second speech; and generate a second interaction opportunity with the assistant device based on the response data received from the cloud server, the second interaction opportunity being a second type of interaction opportunity that is different than the first type of interaction opportunity, the second interaction opportunity being the second type based on providing the second speech to the cloud server. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
Specification