Speechreading using facial feature parameters from a non-direct frontal view of the speaker
First Claim
Patent Images
1. A system for performing recognition comprising:
- a telephone transmitter contained in a movable telephone transmitter housing;
a camera directly mounted to and positioned with respect to the telephone transmitter to obtain video information, corresponding to at least one facial feature for speechreading, from a non-direct frontal view of a speaker;
a data channel coupled to the camera to transfer the video information from the camera; and
a recognition processing logic coupled to the data channel to perform speechreading recognition of the video information.
1 Assignment
0 Petitions
Accused Products
Abstract
A system for performing recognition having a telephone transmitter, a camera, a data channel and recognition processing logic, in which the camera is directly mounted to and positioned with respect to the telephone housing to obtain video information from a non-direct frontal view of the speaker corresponding to at least one facial feature for use in speechreading. The facial features that may be obtained include the position of the tongue, separation of the teeth and the rounding protrusion of the lips. Using this data, recognition processing logic performs speechreading recognition of the video information.
42 Citations
22 Claims
-
1. A system for performing recognition comprising:
-
a telephone transmitter contained in a movable telephone transmitter housing; a camera directly mounted to and positioned with respect to the telephone transmitter to obtain video information, corresponding to at least one facial feature for speechreading, from a non-direct frontal view of a speaker; a data channel coupled to the camera to transfer the video information from the camera; and a recognition processing logic coupled to the data channel to perform speechreading recognition of the video information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 21)
-
-
12. An apparatus for obtaining data for use by recognition processing logic, said apparatus comprising:
-
a telephone transmitter; a camera coupled to the telephone transmitter and positioned with respect to the telephone transmitter to obtain video information, corresponding to at least one facial feature for speechreading, from a non-direct frontal view of a speaker, wherein the video information comprises position of a tongue of a user and the rounding protrusion of the lips; a data channel coupled to the camera to transfer the video information from the camera to the recognition processing logic to enable speechreading recognition of the video information. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 22)
-
-
20. A method for performing recognition comprising the steps of:
-
receiving audio information from a speaker using a telephone transmitter; receiving video information, corresponding to at least one facial feature of the speaker, from a non-direct frontal view of the speaker, using a camera coupled to the telephone transmitter; transferring the audio and video information by a data channel to recognition logic for speech and pattern recognition.
-
Specification