Speaker and Person Backlighting For Improved AEC and AGC
First Claim
Patent Images
1. A method to be executed at least in part in a computing device for improving image quality of a selected region in a video frame, the method comprising:
- receiving a captured video frame;
determining a region of interest based on input through at least one from a set of;
sound source localization, multi-person detection, and active speaker detection;
automatically adjusting at least one of an exposure parameter and a gain parameter for the determined region of interest such that the image quality of the region of interest is improved; and
encoding the video frame for at least one of transmission and storage.
2 Assignments
0 Petitions
Accused Products
Abstract
Regions of interest in video image capture for communication purposes are selected based on one or more inputs based on sound source localization, multi-person detection, and active speaker detection using audio and/or visual cues. Exposure and/or gain for the selected region are automatically enhanced for improved video quality focusing on people or inanimate objects of interest.
-
Citations
20 Claims
-
1. A method to be executed at least in part in a computing device for improving image quality of a selected region in a video frame, the method comprising:
-
receiving a captured video frame; determining a region of interest based on input through at least one from a set of;
sound source localization, multi-person detection, and active speaker detection;automatically adjusting at least one of an exposure parameter and a gain parameter for the determined region of interest such that the image quality of the region of interest is improved; and encoding the video frame for at least one of transmission and storage. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computing device for improving image quality of a region of interest in a video communication application, comprising:
-
a memory a video capture device configured to capture frames of video; a processor coupled to the memory and the video capture device, and configured to execute a video processing application, the video processing application comprising; a pre-processing module for; receiving a captured video frame; a selection module for; determining the region of interest based on input through at least one from a set of;
sound source localization, multi-person detection, and active speaker detection;an automatic gain/exposure control module for; adjusting at least one of a gain and an exposure for a portion of the video frame containing the region of interest by computing an image mean for pixel values of the portion of the video frame weighted based on fixed backlighting for the portion of the video frame and comparing the computed image mean to a threshold value for determining at least one of a new gain parameter and a new exposure parameter; and an encoding module for; encoding the processed video frame for subsequent transmission to a video rendering application; and a communication device configured to transmit encoded frames to another computing device over a network for one of rendering and storage. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer-readable storage medium with instructions stored thereon for improving image quality of a selected person in a video conference application, the instructions comprising:
-
receiving a captured video frame; dividing the video frame into at least two backlighting bands; assigning different weight factors to the backlighting bands; and adjusting a backlighting of each band based on the assigned weight factors such that the backlighting band containing at least the selected person is rendered more prominent than other bands; determining the selected person based on input through at least one from a set of;
sound source localization, multi-person detection, and active speaker detection;determining at least one of a new gain parameter and a new exposure parameter for a portion of the video frame containing the selected person by computing an image mean for pixel values of the video frame and the portion of the video frame weighted based on corresponding backlighting bands and comparing the computed image mean to a target value and threshold value such that the selected person becomes prominent within the video frame based on the application of the new gain parameter and the new exposure parameter; encoding the processed video frame; and transmitting encoded frames to another computing device over a network for one of rendering and storage. - View Dependent Claims (19, 20)
-
Specification