Video image formatting technique
First Claim
1. A method for establishing a pan and scan pixel coordinate (P) that defines an image capture window useful for formatting an image having a first aspect ratio to yield an image having a second aspect ratio, comprising the steps of:
- examining an incoming image frame by examining video, audio and image data associated with the image frame to ascertain a location therein of most pertinent activity; and
establishing the pan and scan pixel coordinate in accordance with the location in the incoming image frame of the most pertinent activity in accordance with a predetermined combination of the video, audio and image data wherein the pan and scan pixel coordinate (P) is determined from the video, audio, and image data in accordance with the relationship;
P=X1*F(V)+X2*F(A)+X3*F(D)where X1=weighting factor assigned to the video data (0<
X1<
1);
X2=weighting factor assigned to the audio data (0<
X2<
1);
X3=weighting factor assigned to the image data (0<
X3<
1);
F(V)=coordinate predicted using video data;
F(A)=coordinate predicted using audio data;
F(D)=coordinate predicted using image data;
with X1+X2+X3=1.
2 Assignments
0 Petitions
Accused Products
Abstract
The formatting of a video image frame (10) having a first aspect ratio to yield an image frame having a different aspect ratio is facilitated by establishing a pan and scan pixel coordinate (P) within the image that defines an image capture window for formatting purposes. A predictor (22) determines the pan and scan pixel coordinate by examining the image frame to determine the location therein of the most pertinent activity. The predictor (22) determines the location of the most pertinent activity by examining at least one of: video data, audio data, and other data such as closed captioning information.
-
Citations
4 Claims
-
1. A method for establishing a pan and scan pixel coordinate (P) that defines an image capture window useful for formatting an image having a first aspect ratio to yield an image having a second aspect ratio, comprising the steps of:
-
examining an incoming image frame by examining video, audio and image data associated with the image frame to ascertain a location therein of most pertinent activity; and establishing the pan and scan pixel coordinate in accordance with the location in the incoming image frame of the most pertinent activity in accordance with a predetermined combination of the video, audio and image data wherein the pan and scan pixel coordinate (P) is determined from the video, audio, and image data in accordance with the relationship;
P=X1*F(V)+X2*F(A)+X3*F(D)where X1=weighting factor assigned to the video data (0<
X1<
1);X2=weighting factor assigned to the audio data (0<
X2<
1);X3=weighting factor assigned to the image data (0<
X3<
1);F(V)=coordinate predicted using video data; F(A)=coordinate predicted using audio data; F(D)=coordinate predicted using image data; with X1+X2+X3=1.
-
-
2. Apparatus for establishing a pan and scan pixel coordinate (P) that defines an image capture window useful for formatting an image from a first aspect ratio to a second aspect ratio, comprising:
-
means for examining the image frame to ascertain a location therein of most pertinent activity by examining video, audio and image data associated with the image frame; means for establishing the pan and scan pixel coordinate in accordance with the location in the image of the most pertinent activity in accordance with a predetermined combination of the video, audio and image data; wherein the determining means establishes the pan and scan pixel coordinate (P) in accordance with the relationship;
P=X1*F(V)+X2*F(A)+X3*F(D)where X1=weighting factor assigned to the video data (0<
X1<
1)X2=weighing factor assigned to the audio data (0<
X2<
1)X3=weighting factor assigned to the image data (0<
X3<
1)F(V)=coordinate predicted using video data F(A)=coordinate predicted using audio data F(D)=coordinate predicted using image data with X1+X2+X3=1.
-
-
3. A method for formatting an incoming image frame having a first aspect ratio to yield a formatted image frame having a second aspect ratio, comprising the step of:
-
establishing within the image frame a pan and scan pixel coordinate (P) that lies substantially coincident with a location in the image frame associated with most pertinent activity by examining video, audio and image data associated with the image frame, and panning and scanning the incoming image frame with an image capture window centered about the pan and scan pixel location, to format incoming image frame to yield the formatted image data frame in accordance with a predetermined combination of the video, audio and image data wherein the pan and scan pixel coordinate (P) is determined in accordance with the relationship;
P=X1*F(V)+X2*F(A)+X3*F(D)where X1=weighting factor assigned to the video data (0<
X1<
1)X2=weighting factor assigned to the audio data (0<
X2<
1)X3=weighting factor assigned to the image data (0<
X3<
1)F(V)=coordinate predicted using video data F(A)=coordinate predicted using audio data F(D)=coordinate predicted using image data with X1+X2+X3=1.
-
-
4. Apparatus for formatting an incoming image frame having a first aspect ratio to yield a formatted image frame having a second aspect ratio,
means for establishing within the image frame a pan and scan pixel coordinate (P) that lies substantially coincident with a location in the image frame associated with most pertinent activity by examining video, audio and image data associated with the image frame, and means for panning and scanning the incoming image frame with an image capture window centered about the pan and scan pixel coordinate to format incoming image frame to yield the formatted image frame in accordance with a predetermined combination of the video, audio arid image data wherein establishing means establishes the pan and scan pixel coordinate (P) in accordance with the relationship: -
P=X1*F(V)+X2*F(A)+X3*F(D)where X1=weighting factor assigned to the video data (0<
X1<
1)X2=weighting factor assigned to the audio data (0<
X2<
1)X3=weighting factor assigned to the image data (0<
X3<
1)F(V)=coordinate predicted using video data F(A)=coordinate predicted using audio data F(D)=coordinate predicted using image data with X1+X2+X3=1.
-
Specification