Three-dimensional image coding by merger of left and right images
First Claim
1. A method of encoding a signal with a three dimensional image sequence using left and right video image sequences, each of the video images being a picture constituted by image lines obtained by scanning, said method comprising the steps of:
- merging left line images contained in a left picture with right line images contained in a right picture continuously to produce a plurality of alternately arranged left and right line merged pictures;
encoding one of said merged pictures as a first merged picture to produce an I-picture;
predictively encoding one of said merged pictures as a second merged picture by using said first merged picture, producing a P-picture; and
predictively encoding one of said merged pictures as a third merged picture located between said first and second merged pictures, by using at least one of said first and second merged pictures, producing a B-picture,wherein each of said predictively encoding steps for the P-picture and B-picture comprises the steps of;
separating the corresponding merged picture into a plurality of 16×
16 pixel macroblocks;
calculating a first value indicative of the predictive encoding efficiency performed on each of said 16×
16 pixel macroblocks by using a target macroblock and one of said first and second merged pictures;
separating each of said plurality of 16×
16 pixel macroblocks into left and right 16×
8 pixel macroblock halves, each of said left halves containing left line images, and each of said right halves containing right line images;
separating at least one of said first and second merged pictures into a right field and a left field, said right field containing right line images and said left field containing left line images;
calculating a second value indicative of the predictive encoding efficiency performed on each of the left and right 16×
8 pixel macroblock halves, by using one of said right and left 16×
8 pixel macroblock halves in combination with said right and left fields, wherein said first value is higher than said second value when the correlation between the left and right line images in each of said 16×
16 pixel macroblocks is higher than the correlation between one of said left and right 16×
8 pixel macroblock halves and one of said left and right fields;
selecting the most efficient predictive encoding process between said first and second values, wherein the predictive encoding process of each of said 16×
16 pixel macroblocks is selected when said first value is higher than said second value; and
predictively encoding the corresponding merged picture to produce the P-picture or B-picture, based on the most efficient of the predictive encoding processes.
1 Assignment
0 Petitions
Accused Products
Abstract
A method is disclosed, for encoding a signal with a three dimensional image sequence using a series of left and right images. Each image in the left image series is a picture formed by non-interlaced or interlaced scanned left line images, and each image in the right image series is a picture formed by non-interlaced or interlaced scanned right line images. The left line images contained in the left picture are merged with the right line images contained in the right picture to produce an alternately arranged left and right line merged picture. The merged picture is encoded using an MPEG-2 compliant encoder.
215 Citations
17 Claims
-
1. A method of encoding a signal with a three dimensional image sequence using left and right video image sequences, each of the video images being a picture constituted by image lines obtained by scanning, said method comprising the steps of:
-
merging left line images contained in a left picture with right line images contained in a right picture continuously to produce a plurality of alternately arranged left and right line merged pictures; encoding one of said merged pictures as a first merged picture to produce an I-picture; predictively encoding one of said merged pictures as a second merged picture by using said first merged picture, producing a P-picture; and predictively encoding one of said merged pictures as a third merged picture located between said first and second merged pictures, by using at least one of said first and second merged pictures, producing a B-picture, wherein each of said predictively encoding steps for the P-picture and B-picture comprises the steps of; separating the corresponding merged picture into a plurality of 16×
16 pixel macroblocks;calculating a first value indicative of the predictive encoding efficiency performed on each of said 16×
16 pixel macroblocks by using a target macroblock and one of said first and second merged pictures;separating each of said plurality of 16×
16 pixel macroblocks into left and right 16×
8 pixel macroblock halves, each of said left halves containing left line images, and each of said right halves containing right line images;separating at least one of said first and second merged pictures into a right field and a left field, said right field containing right line images and said left field containing left line images; calculating a second value indicative of the predictive encoding efficiency performed on each of the left and right 16×
8 pixel macroblock halves, by using one of said right and left 16×
8 pixel macroblock halves in combination with said right and left fields, wherein said first value is higher than said second value when the correlation between the left and right line images in each of said 16×
16 pixel macroblocks is higher than the correlation between one of said left and right 16×
8 pixel macroblock halves and one of said left and right fields;selecting the most efficient predictive encoding process between said first and second values, wherein the predictive encoding process of each of said 16×
16 pixel macroblocks is selected when said first value is higher than said second value; andpredictively encoding the corresponding merged picture to produce the P-picture or B-picture, based on the most efficient of the predictive encoding processes. - View Dependent Claims (2, 3, 4)
-
-
5. An apparatus for encoding a signal with a three dimensional image sequence using left and right video image sequences, each of the video images being a picture constituted by line images obtained by scanning, said method comprising the step of using an encoding apparatus, wherein said encoding apparatus includes:
-
a mixer for merging left line images contained in a left picture with right line images contained in a right picture to produce an alternately arranged left and right line merged picture, wherein said merged picture includes a plurality of 16×
16 pixel macroblocks;an encoder for executing the encoding on each of said macroblocks of the merged picture, wherein said encoder includes; first encoder for performing inter-frame predictive encoding as well as motion compensation on each of said macroblocks; second encoder for separating each macroblock into left and right 16×
8 pixel macroblock halves, each of said left halves containing the left line images and each of said right halves containing the right line images, and for performing inter-field predictive encoding as well as motion compensation on each of said 16×
8 macroblock halves; andselector for selecting one of encoded data produced by said first encoder and data produced by said second encoder, wherein said selector selects said inter-frame predictive encoding as well as motion compensation of said first encoder when the correlation between the left and right line images in each of said 16×
16 pixel macroblocks is higher than one of the correlation between the adjacent left line images in said left 16×
8 pixel macroblock half and the correlation between the adjacent right line images in said right 16×
8 pixel macroblock half. - View Dependent Claims (6)
-
-
7. An apparatus for encoding a signal with a three dimensional image sequence using left and right video image sequences, each of the video images being a picture constituted by image lines obtained by scanning, said apparatus comprising:
-
a mixer for merging left line images contained in a left picture with right line images contained in a right picture continuously to produce a plurality of alternately arranged left and right line merged pictures; and an encoder for encoding one of said merged pictures as a first merged picture to produce an I-picture, for predictively encoding one of said merged pictures as a second merged picture by using said first merged picture by using said first merged picture to produce a P-picture, and for predictively encoding one of said merged pictures as a third merged picture located between said first and second merged pictures by using at least one of said first and second merged pictures to produce a B-picture, wherein said encoder includes; a macroblock converter for separating a merged picture into a plurality of 16×
16 pixel macroblocks;a first value calculating circuit for calculating a first value indicative of the predictive encoding efficiency performed on each of said 16×
16 pixel macroblocks by using a target macroblock and one of said first and second merged pictures;a first separator for separating each of said plurality of 16×
16'"'"' pixel macroblocks into left and right 16×
8 pixel macroblock halves, each of said left halves containing left line images, and each of said right halves containing right line images;a second separator for separating at least one of said first and second merged pictures into a right field and a left field, said right field containing right line images and said left field containing left line images; a second value calculating circuit for calculating a second value indicative of the predictive encoding efficiency performed on each of the left and right macroblocks, by using one of said right and left macroblock halves in combination with said right and left fields, wherein said first value is higher than said second value when the correlation between the left and right line images in each of said 16×
16 pixel macroblocks is higher than the correlation between one of said left and right 16×
8 pixel macroblock halves and one of said left and right fields; anda value comparator for selecting the most efficient predictive encoding process between said first and second values, wherein said value comparator selects the predictive encoding process of each of said 16×
16 pixel macroblocks when said first value is higher than said second value, whereby said encoder predictively encodes a merged picture to produce the P-picture or B-picture, based on the most efficient of the predictive encoding processes. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A method of encoding a signal with a three dimensional image sequence using left and right video image sequences, each of the video images being a picture constituted by image lines obtained by scanning, said method comprising the steps of:
-
merging the left line images contained in said left picture with the right line images contained in said right picture to produce an alternately arranged left and right line merged blocks for use in a first encoding process; producing first blocks, each constituted by left line images and second blocks, each constituted by right line images for use in a second encoding process; estimating a first predictive encoding efficiency of said first encoding process and a second predictive encoding efficiency of said second encoding process; comparing the first and second predictive encoding efficiencies; selecting the most efficient predictive encoding process based on the comparing result; and predictively encoding the corresponding blocks to produce an encoded picture based on the most efficient of the predictive encoding processes. - View Dependent Claims (13, 14)
-
-
15. A method of encoding a signal with a three dimensional image sequence using left and right video image sequences, each of the video images being a picture constituted by image lines obtained by scanning, said method comprising the steps of:
-
merging the left line images contained in said left picture with the right line images contained in said right picture to produce an alternately arranged left and right line merged blocks for use in a first encoding process; producing first blocks, each constituted by left line images and second blocks each constituted by right line images for use in a second encoding process; estimating a first predictive encoding efficiency of said first encoding process and a second predictive encoding efficiency of said second encoding process, wherein said first predictive encoding efficiency is higher than said second predictive encoding efficiency when the correlation between the left and right line images in said each merged block is higher than one of the correlation between the adjacent left lines in said each first block and the correlation between the adjacent right lines in said each second block; comparing the first and second predictive encoding efficiencies; selecting the most efficient predictive encoding process based on the comparing result, wherein the predictive encoding process of said merged blocks is selected when said first predictive encoding efficiency is higher than said second predictive encoding efficiency; and predictively encoding the corresponding blocks to produce an encoded picture based on the most efficient of the predictive encoding processes. - View Dependent Claims (16, 17)
-
Specification