Dynamic audio optimization
First Claim
Patent Images
1. An audio optimization system comprising:
- one or more processors;
one or more sensors communicatively coupled to the one or more processors; and
one or more computer-readable storage media storing one or more computer-executable instructions that are executable by the one or more processors to;
receive data from the one or more sensors;
analyze the data to determine objects within an environment, the environment including a plurality of speakers that output audio;
detect audio characteristics of the audio being output;
determine locations based on the data, wherein each location is associated with a corresponding object of the objects;
identify, based on the objects, a first human and a second human, the first human being at a first location and the second human being at a second location;
select a new location based on the first location and the second location, wherein the new location is between the first location and the second location;
determine, from the data, user authentication information associated with at least one of the first human or the second human, the user authentication information comprising at least one of a pattern tapped onto a surface of the environment or user interaction with a reference object;
determine, based on at least one of the audio characteristics or the user authentication, that the first human or the second human is likely to have a higher interest in the audio; and
adjust audio output from a plurality of speakers to optimize the audio at the new location by instructing a first speaker of the plurality of speakers to output the audio at a first volume level and instructing a second speaker of the plurality of speakers to output the audio at a second volume level that is different than the first volume level such that a first detected volume level of the audio received from the first speaker at the new location is substantially equal to a second detected volume level of the audio received from the second speaker at the new location.
2 Assignments
0 Petitions
Accused Products
Abstract
An environment detection node supports dynamic audio optimization by receiving data from sensors and analyzing the received data to detect objects such as furniture and/or humans within an environment. Based on locations within the environment of the detected objects, the environment detection node determines an optimized target location and adjusts audio output to be optimized when heard at the target location.
-
Citations
38 Claims
-
1. An audio optimization system comprising:
-
one or more processors; one or more sensors communicatively coupled to the one or more processors; and one or more computer-readable storage media storing one or more computer-executable instructions that are executable by the one or more processors to; receive data from the one or more sensors; analyze the data to determine objects within an environment, the environment including a plurality of speakers that output audio; detect audio characteristics of the audio being output; determine locations based on the data, wherein each location is associated with a corresponding object of the objects; identify, based on the objects, a first human and a second human, the first human being at a first location and the second human being at a second location; select a new location based on the first location and the second location, wherein the new location is between the first location and the second location; determine, from the data, user authentication information associated with at least one of the first human or the second human, the user authentication information comprising at least one of a pattern tapped onto a surface of the environment or user interaction with a reference object; determine, based on at least one of the audio characteristics or the user authentication, that the first human or the second human is likely to have a higher interest in the audio; and adjust audio output from a plurality of speakers to optimize the audio at the new location by instructing a first speaker of the plurality of speakers to output the audio at a first volume level and instructing a second speaker of the plurality of speakers to output the audio at a second volume level that is different than the first volume level such that a first detected volume level of the audio received from the first speaker at the new location is substantially equal to a second detected volume level of the audio received from the second speaker at the new location. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method comprising:
-
receiving, from one or more sensors, data captured by the one or more sensors from an environment, the environment including at least a first speaker at a first physical location within the environment, a second speaker outputting audio, and a plurality of inanimate objects; determining at least one audio characteristic of the audio being output; based at least in part on the data, determining a plurality of locations, wherein individual locations of the plurality of locations are associated with an inanimate object of the plurality of inanimate objects; based at least in part on the plurality locations, determining a target location within the environment; determining a second physical location within the environment from which to output audio to optimize audio output at the target location; based at least in part on the data, determining user authentication information associated with at least one of a first human or a second human within the environment, the user authentication information comprising at least one of a pattern tapped onto a surface of the environment or user interaction with a reference object; determining, based at least in part on at least one of the at least one audio characteristic or the user authentication information, that the first human or the second human is likely to have a higher interest in the audio; causing a motor associated with the first speaker to move the first speaker from the first physical location to the second physical location; and substantially equalizing a first detected volume level of the audio at the target location from the first speaker with a second detected volume level of the audio at the target location from the second speaker by instructing the first speaker to output the audio at a first modified volume level and instructing the second speaker to output the audio at a second modified volume level that is different than the first modified volume level. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method comprising:
-
receiving, from one or more sensors, data captured by the one or more sensors from an environment, the environment including at least a first speaker and a second speaker outputting audio; based at least in part on the data, determining that a first human and a second human reside within the environment, the first human having a first location within the environment and the second human having a second location within the environment; determining that the first location is more frequently occupied than the second location; based at least in part on determining that the first location is more frequently occupied than the second location, substantially equalizing a first detected volume level of the audio at the first location from the first speaker with a second detected volume level of the audio at the first location from the second speaker by instructing the first speaker to output the audio at a first modified volume level and instructing the second speaker to output the audio at a second modified volume level that is different than the first modified volume level; based at least in part on the data, determining that at least one of the first human or the second human is changing location within the environment to a new location; and substantially equalizing a third detected volume level of the audio at the new location from the first speaker with a fourth detected volume level of the audio at the new location from the second speaker by instructing the first speaker to output the audio at a third modified volume level and instructing the second speaker to output the audio at a fourth modified volume level that is different than the third modified volume level. - View Dependent Claims (22, 23, 24, 25, 26, 27)
-
-
28. A method comprising:
-
receiving, from one or more sensors, data captured by the one or more sensors from an environment, the environment including at least a first speaker and a second speaker outputting audio; based at least in part on the data, detecting a plurality humans within the environment, individual humans of the plurality of humans having an associated location within the environment; based at least in part on the data, determining user authentication information associated with at least one human of the plurality of humans within the environment, the user authentication information comprising at least one of a pattern tapped onto a surface of the environment or user interaction with a reference object; identifying audio characteristics of the audio being output; based at least in part on the audio characteristics or the user authentication information, determining a set of humans of the plurality of humans that are likely to have a higher interest in the audio; based at least on locations associated with the set of humans, determining an optimized target location within the environment; and substantially equalizing a first detected volume level of the audio at the optimized target location from the first speaker with a second detected volume level of the audio at the optimized target location from the second speaker by instructing the first speaker to output the audio at a first modified volume level and instructing the second speaker to output the audio at a second modified volume level that is different than the first modified volume level. - View Dependent Claims (29, 30, 31)
-
-
32. A method comprising:
-
receiving, from one or more sensors, data captured by the one or more sensors from an environment, the data including user authentication information and the environment including at least a first speaker and a second speaker outputting audio, the user authentication information comprising at least one of a pattern tapped onto a surface or user interaction with a reference object; based at least in part on the data, determining that a human is present within the environment; based at least in part on the user authentication information, determining an identity of the human; determining an audio profile associated with the identity of the human; based at least on the audio profile and at least one audio characteristic of the environment, adjusting at least one audio characteristic of audio that is being output, wherein the at least one audio characteristic includes at least one of an echo level, a reverb level, a sound-absorbing level, or a background noise level; based at least in part on the data, determining a location of the human within the environment; substantially equalizing a first detected volume level of the audio at the location from the first speaker with a second detected volume level of the audio at the location from the second speaker by instructing the first speaker to output the audio at a first modified volume level and instructing the second speaker to output the audio at a second modified volume level that is different than the first modified volume level; determining that the human is changing or has changed location within the environment from the location to a new location; and substantially equalizing a third detected volume level of the audio at the new location from the first speaker with a fourth detected volume level of the audio at the new location from the second speaker by instructing the first speaker to output the audio at a third modified volume level and instructing the second speaker to output the audio at a fourth modified volume level that is different than the third modified volume level. - View Dependent Claims (33, 34, 35)
-
-
36. A method comprising:
-
receiving, from one or more sensors, data captured by the one or more sensors from an environment, the environment including at least a first speaker and a second speaker outputting audio; based at least in part on the data, detecting audio characteristics of the environment, wherein the audio characteristics of the environment include at least one of an echo level, a reverb level, a sound-absorbing level, or a background noise level; based at least in part on the data, determining user authentication information associated with at least one of a first human or a second human within the environment, the user authentication information comprising at least one of a pattern tapped onto a surface of the environment or user interaction with a reference object, the first human having a first location within the environment and the second human having a second location within the environment; determining that the first location is more frequently occupied than the second location; based at least in part on the audio characteristics or the user authentication information, adjusting the audio output from the first speaker and the second speaker; determining that at least one of the first human or the second human is changing or has changed location within the environment to a target location; and based at least in part on at least one of the first location being occupied more frequently than the second location or that the at least one of the first human or the second human is changing or has changed location, substantially equalizing a first detected volume level of the audio at the target location from the first speaker with a second detected volume level of the audio at the target location from the second speaker by instructing the first speaker to output the audio at a first modified volume level and instructing the second speaker to output the audio at a modified second volume level that is different than the first modified volume level. - View Dependent Claims (37, 38)
-
Specification