Methods and systems for detecting and processing speech signals
First Claim
1. A computer-implemented method comprising:
- receiving, by a computing device that (i) is operating in a low power mode, (ii) is configured to exit the low power mode upon determining that a particular hotword has likely been spoken, and (iii) is in proximity of other computing devices that are each also configured to exit the low power mode upon determining that the particular hotword has been spoken, audio data corresponding to a user uttering the particular hotword;
based on an estimated position of the user in relation to the computing device, determining, by the computing device, to remain operating in the low power mode despite determining that the particular hotword has likely been spoken.
3 Assignments
0 Petitions
Accused Products
Abstract
Provided are methods, systems, and apparatuses for detecting, processing, and responding to audio signals, including speech signals, within a designated area or space. A platform for multiple media devices connected via a network is configured to process speech, such as voice commands, detected at the media devices, and respond to the detected speech by causing the media devices to simultaneously perform one or more requested actions. The platform is capable of scoring the quality of a speech request, handling speech requests from multiple end points of the platform using a centralized processing approach, a de-centralized processing approach, or a combination thereof, and also manipulating partial processing of speech requests from multiple end points into a coherent whole when necessary.
-
Citations
20 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a computing device that (i) is operating in a low power mode, (ii) is configured to exit the low power mode upon determining that a particular hotword has likely been spoken, and (iii) is in proximity of other computing devices that are each also configured to exit the low power mode upon determining that the particular hotword has been spoken, audio data corresponding to a user uttering the particular hotword; based on an estimated position of the user in relation to the computing device, determining, by the computing device, to remain operating in the low power mode despite determining that the particular hotword has likely been spoken. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by a computing device that (i) is operating in a low power mode, (ii) is configured to exit the low power mode upon determining that a particular hotword has likely been spoken, and (iii) is in proximity of other computing devices that are each also configured to exit the low power mode upon determining that the particular hotword has been spoken, audio data corresponding to a user uttering the particular hotword; based on an estimated position of the user in relation to the computing device, determining, by the computing device, to remain operating in the low power mode despite determining that the particular hotword has likely been spoken. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising:
-
one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; receiving, by a computing device that (i) is operating in a low power mode, (ii) is configured to exit the low power mode upon determining that a particular hotword has likely been spoken, and (iii) is in proximity of other computing devices that are each also configured to exit the low power mode upon determining that the particular hotword has been spoken, audio data corresponding to a user uttering the particular hotword; based on an estimated position of the user in relation to the computing device, determining, by the computing device, to remain operating in the low power mode despite determining that the particular hotword has likely been spoken. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification