Noise reduction based on mouth area movement recognition
First Claim
1. A computer-implemented method, comprising:
- capturing video information using a camera of a computing device, the video information showing at least a portion of a mouth area of a user of the computing device;
capturing audio information using a microphone of the computing device, the audio information including voice data generated by the user and an amount of noise;
processing the video information to determine a movement of the portion of the mouth area of the user;
applying noise reduction to the audio information to generate modified audio information that corresponds to a reduction of at least a portion of the noise;
transmitting, over a communication network, the modified audio information;
determining that the movement of the portion of the mouth area does not correspond to user speech; and
causing at least one of capturing the audio information, applying the noise reduction, or transmitting the modified audio information to cease being performed for at least a period of time.
1 Assignment
0 Petitions
Accused Products
Abstract
A computing device can capture video data of at least a portion of a mouth area (e.g., mouth, lips, tongue, chin, jaw) of a user of the device. The computing device can also capture sound data including a voice of the user as well as noise (e.g. background noise). The video data can be processed to detect a movement of the portion of the mouth area. The movement of the portion of the mouth area can be analyzed and compared with mouth area movement models characteristic of oral communication (e.g., speech, song). If the movement of the portion of the mouth area corresponds to at least one model characteristic of oral communication, then the movement indicates that the user is likely engaging in oral communication. Noise reduction can be applied and/or increased on the captured sound data to reduce noise and in turn enhance the user'"'"'s voice.
78 Citations
25 Claims
-
1. A computer-implemented method, comprising:
-
capturing video information using a camera of a computing device, the video information showing at least a portion of a mouth area of a user of the computing device; capturing audio information using a microphone of the computing device, the audio information including voice data generated by the user and an amount of noise; processing the video information to determine a movement of the portion of the mouth area of the user; applying noise reduction to the audio information to generate modified audio information that corresponds to a reduction of at least a portion of the noise; transmitting, over a communication network, the modified audio information; determining that the movement of the portion of the mouth area does not correspond to user speech; and causing at least one of capturing the audio information, applying the noise reduction, or transmitting the modified audio information to cease being performed for at least a period of time. - View Dependent Claims (2, 3)
-
-
4. A computer-implemented method, comprising:
-
receiving image information showing at least a portion of a face of a user of a computing device; receiving audio information corresponding to the image information; processing the image information to determine a movement of the portion of the face of the user; applying noise reduction to the audio information to generate modified audio information; determining that the movement of the portion of the face of the user does not correspond to communication; and causing at least one of receiving the audio information or applying the noise reduction to cease being performed for at least a period of time. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A computing device, comprising:
-
at least one image capture component configured to capture image information; at least one audio capture component configured to capture audio information; a processor; and a memory device including instructions that, upon being executed by the processor, cause the computing device to; receive image information showing at least a portion of a face of a user of the computing device from the at least one image capture component; receive audio information corresponding to the image information from the at least one audio capture component; process the image information to determine a movement of the portion of the face of the user; apply noise reduction to the audio information to generate modified audio information; determine that the movement of the portion of the face of the user does not correspond to oral communication; and cause at least one of ceasing to receive the audio information or ceasing to apply the noise reduction for at least a period of time. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A non-transitory computer-readable storage medium including instructions that, upon being executed by a processor of a computing device, cause the computing device to:
-
receive image information showing at least a portion of a face of a user of the computing device; receive audio information corresponding to the image information; process the image information to determine a movement of the portion of the face of the user; apply noise reduction to the audio information to generate modified audio information; determine that the movement of the portion of the face of the user does not correspond to oral communication; and cause at least one of ceasing to receive the audio information or ceasing to apply the noise reduction for at least a period of time. - View Dependent Claims (23, 24, 25)
-
Specification