Image encoding and synthesis

US 4,841,575 A
Filed: 11/14/1986
Issued: 06/20/1989
Est. Priority Date: 11/14/1985
Status: Expired due to Term

First Claim

Patent Images

1. An apparatus for encoding a moving image including a human face comprising:

means for receiving video input data;

means for output of data representing one frame of the image;

identification means arranged in operation for each frame of the image to identify that part of the input data corresponding to the mouth of the face represented and(a) in a first phase of operation to compare the mouth data parts of each frame with those of other frames to select a representative set of mouth data parts, to store the selected parts and to output the selected parts;

(b) in a second phase to compare the mouth data part of each frame with those of the stored set and to generate a codeword indicating which member of the set the mouth data part of that frame most closely resembles.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Visual images of the face of a speaker are processed to extract during a learning sequence a still frame of the image and a set of typical mouth shapes. Encoding of a sequence to be transmitted, recorded etc. is then achieved by matching the changing mouth shapes to those of the set and generating codewords identifying them. Alternatively, the codewords may be generated to accompany real or synthetic speech using a look- up table relating speech parameters to codewords. In a receiver, the still frames and set of mouth shapes are stored and received codewords used to select successive mouth shapes to be incorporated in the still frame.

104 Citations

17 Claims

1. An apparatus for encoding a moving image including a human face comprising:
- means for receiving video input data;
  
  means for output of data representing one frame of the image;
  
  identification means arranged in operation for each frame of the image to identify that part of the input data corresponding to the mouth of the face represented and(a) in a first phase of operation to compare the mouth data parts of each frame with those of other frames to select a representative set of mouth data parts, to store the selected parts and to output the selected parts;
  
  (b) in a second phase to compare the mouth data part of each frame with those of the stored set and to generate a codeword indicating which member of the set the mouth data part of that frame most closely resembles.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. An apparatus according to claim 1 in which the identification means is arranged in operation firstly to identify that part of one frame of input data corresponding to the mouth of the face represented and to identify the mouth part of successive frames by auto-correlation with data from the said one frame.
  - 3. An apparatus according to claim 1 or 2 in which the identification means is arranged in operation during the first phase to store a first mouth data part and then for the mouth data parts of each successive frame to compare it with the first and any other stored mouth data part and if the result of the comparison exceeds a threshold value, to store it.
  - 4. An apparatus according to claim 1, or 2 in which the comparison of mouth data is carried out by subtraction of individual picture element values and summing the absolute values of the differences.
  - 5. An apparatus according to claim 1, or 2 including means for tracking the face position within the image and generating coded data representing that position.
  - 6. An apparatus according to any one of the preceding claims 1 or 2, in which the identification means is arranged during the second phase in the event that the result of the comparison between a mouth data part and that one of the stored set which it most closely resembles exceeds a predetermined threshold to output that data part and store it as part of the stored set.
  - 7. An apparatus according to any one of the preceding claims 1 or 2 further including identification means arranged in operation for each frame of the image to identify that part of the input data corresponding to the eyes of the face represented and(a) in the first phase of operation to compare, for each frame, the eye data part thus identified with those of other frames to eye data parts of each frame with those of other frames to select a representative set of eye data parts, to store the selected parts and to output the selected parts;
    - (b) in the second phase to compare the eye data part of each frame with those of the stored set of eye data parts and to generate a codeword indicating which member of the set the eye data part of that frame most closely resembles.
  - 8. An image transmission system comprising an encoding apparatus according to any one of claims 1 or 2 and a receiver for decoding a moving image including a human face, the receiver comprising:
    - frame store means for receiving and storing data representing one frame of the image;
      
      means for repetitive readout of the frame store to produce a video signal;
      
      further store means receiving and storing a set of selected mouth data parts; and
      
      control means arranged in operation to receive input codewords and in response to each codeword to read out the corresponding mouth data part from the further store and to effect insertion of that data into the data supplied to the readout means.

9. A receiver for decoding a moving image including a human face, the receiver comprising:
- frame store means for receiving and storing data representing one frame of the image;
  
  means for repetitive readout of the frame store to produce a video signal;
  
  further store means receiving and storing a set of selected mouth data parts; and
  
  control means arranged in operation to receive input codewords and in response to each codeword to read out the corresponding mouth data part from the further store and to effect insertion of that data into the data supplied to the readout means.
- View Dependent Claims (10, 11, 12, 13)
- - 10. A receiver according to claim 9 in which the control means is arranged to overwrite the frame store with the mouth data.
  - 11. A receiver according to claim 9 in which the control means is arranged to supply the mouth data to the readout means.
  - 12. A receiver according to claim 9, 10 or 11 including means responsive to input data to effect corresponding movement of the face within the area of the image.
  - 13. A receiver according to claim 9, 10 or 11 including means arranged to effect random movement of the face within the area of the image.

14. A speech synthesiser including means for synthesis of a moving image including a human face, comprising;
- (a) means for storage and output of the image of a face;
  
  (b) means for storage and output of a set of mouth data blocks each corresponding to the mouth area of the face and representing a respective different mouth shape;
  
  (c) speech synthesis mans responsive to input information identifying words or parts of words to be spoken;
  
  (d) means storing a table relating words or part of words to codewords identifying mouth data blocks or sequences thereof;
  
  (e) control means responsive to the said input information to select for output the corresponding codewords or codeword sequences from the table.
- View Dependent Claims (15, 17)
- - 15. A synthesiser according to claim 14 in which the speech synthesis means includes means arranged in operation for processing and queuing the input information, the queue including flag codes indicating changes in mouth shape, and responsive to each flag code to transmit to the control means an indication following generation of the speech preceding that code in the queue, whereby the control means may synchronise the codeword output to the synthesised speech.
  - 17. An apparatus according to claim 14, 15 or 16 including a receiver for decoding a moving image including a human face, the receiver comprising:
    - frame store means for receiving and storing data representing one frame of the image;
      
      means for repetitive readout of the frame store to produce a video signal;
      
      further store means receiving and storing a set of selected mouth data parts; and
      
      control means arranged in operation to receive input codewords and in response to each codeword to read out the corresponding mouth data part from the further store and to effect insertion of that data into the data supplied to the readout means.

16. An apparatus for synthesis of a moving image, comprising:
- (a) means for storage and output of the image of a face;
  
  (b) means for storage and output of a set of mouth data blocks each corresponding to the mouth area of the face and representing a respective different mouth shape;
  
  (c) an audio input for receiving speech signals and frequency analysis means responsive to such signals to produce sequences of spectral parameters;
  
  (d) means storing a table relating spectral parameter sequences to codewords identifying mouth data blocks or sequences thereof;
  
  (e) control means responsive to the said spectral parameters to select for output the corresponding codewords or codeword sequences from the table.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
British Telecommunications PLC (BT Group PLC)
Original Assignee
British Telecommunications PLC (BT Group PLC)
Inventors
Welsh, William J., Fenn, Brian A., Challener, Paul
Primary Examiner(s)
Wong, Peter S.
Assistant Examiner(s)
VOELTZ, EMANUEL T

Application Number

US06/930,473
Time in Patent Office

949 Days
Field of Search

381/36-51, 364/513, 364/513.5, 379/52-54, 382/1, 382/2, 382/16, 382/18, 382/19
US Class Current

704/260
CPC Class Codes

G06T 9/001   Model-based coding, e.g. wi...

G10L 2021/105   Synthesis of the lips movem...

H04N 19/20   using video object coding

Image encoding and synthesis

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

104 Citations

17 Claims

Specification

Use Cases

Quick Links

Others

Image encoding and synthesis

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

104 Citations

17 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others