Exploiting visual information for enhancing audio signals via source separation and beamforming
First Claim
1. A system, comprising:
- a memory that stores instructions;
a processor that executes the instructions to perform operations, the operations comprising;
obtaining visual content associated with an environment of a user;
extracting, from the visual content, first metadata associated the environment of the user, wherein the first metadata indicates location information, information that identifies an interferer generating noise in the environment, information that identifies a location of the interferer, and an action being performed by the user;
determining a location of the user based on the first metadata;
loading, based on the location, an audio profile corresponding to the location of the user;
loading a user profile of the user, wherein the user profile comprises audio data associated with the user;
cancelling, based on the audio profile, the first metadata, and the user profile, the noise from the environment of the user; and
adjusting, based on the audio profile and the user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.
30 Citations
20 Claims
-
1. A system, comprising:
-
a memory that stores instructions; a processor that executes the instructions to perform operations, the operations comprising; obtaining visual content associated with an environment of a user; extracting, from the visual content, first metadata associated the environment of the user, wherein the first metadata indicates location information, information that identifies an interferer generating noise in the environment, information that identifies a location of the interferer, and an action being performed by the user; determining a location of the user based on the first metadata; loading, based on the location, an audio profile corresponding to the location of the user; loading a user profile of the user, wherein the user profile comprises audio data associated with the user; cancelling, based on the audio profile, the first metadata, and the user profile, the noise from the environment of the user; and adjusting, based on the audio profile and the user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method, comprising:
-
capturing visual content associated with an environment of a user; extracting, from the visual content, first metadata associated the environment of the user, wherein the first metadata indicates location information, information that identifies an interferer generating noise in the environment, information that identifies a location of the interferer, and an action being performed by the user; determining, by utilizing instructions from memory that are executed by a processor, a location of the user based on the first metadata; loading, based on the location, an audio profile corresponding to the location of the user; loading a user profile of the user, wherein the user profile comprises audio data associated with the user; cancelling, based on the audio profile, the first metadata, and the user profile, noise from the environment of the user; and adjusting, based on the audio profile and the user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A non-transitory computer-readable device comprising instructions, which when executed by a processor, cause the processor to perform operations comprising:
-
obtaining visual content associated with an environment of a user; extracting, from the visual content, first metadata associated the environment of the user, wherein the first metadata indicates location information, information that identifies an interferer generating noise in the environment, information that identifies a location of the interferer, and an action being performed by the user; determining a location of the user based on the first metadata; loading, based on the location, an audio profile corresponding to the location of the user; loading a user profile of the user, wherein the user profile comprises audio data associated with the user; cancelling, based on the audio profile, the first metadata, and the user profile, noise from the environment of the user; and adjusting, based on the audio profile and the user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user. - View Dependent Claims (20)
-
Specification