Apparatus and method for editing content
First Claim
Patent Images
1. A method of a content editing apparatus for editing moving image content, the method comprising:
- acquiring moving image content;
generating a face database (DB) of face images based on face data extracted from the moving image content;
generating a speech DB of speech based on speech data extracted from the moving image content;
mapping an image of a person included in the face DB with speech data of the person included in the speech DB;
selecting at least one frame among frames included in the moving image content; and
creating edited image content of the moving image content using the mapped image and speech data, and the selected at least one frame,wherein the selecting of the at least one frame among the frames included in the moving image content comprises;
determining at least one scene among scenes of the moving image content based on a voice level variation and a voice frequency variation of voice data in the moving image content, andselecting at least one frame among frames included in the determined scene, the at least one frame comprising fewer than all of the frames included in the determined scene,wherein the edited image content includes at least one image segment corresponding to the selected at least one frame, andwherein the image segment includes an image of the selected frame and speech data corresponding to the selected frame.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and a method for editing moving image content are provided. The method includes acquiring moving image content, mapping an image of a person included in the moving image content and speech data of the person, selecting at least one frame among frames included in the moving image content, and creating edited content of the moving image content using the mapped image and speech data, and the selected at least one frame.
79 Citations
22 Claims
-
1. A method of a content editing apparatus for editing moving image content, the method comprising:
-
acquiring moving image content; generating a face database (DB) of face images based on face data extracted from the moving image content; generating a speech DB of speech based on speech data extracted from the moving image content; mapping an image of a person included in the face DB with speech data of the person included in the speech DB; selecting at least one frame among frames included in the moving image content; and creating edited image content of the moving image content using the mapped image and speech data, and the selected at least one frame, wherein the selecting of the at least one frame among the frames included in the moving image content comprises; determining at least one scene among scenes of the moving image content based on a voice level variation and a voice frequency variation of voice data in the moving image content, and selecting at least one frame among frames included in the determined scene, the at least one frame comprising fewer than all of the frames included in the determined scene, wherein the edited image content includes at least one image segment corresponding to the selected at least one frame, and wherein the image segment includes an image of the selected frame and speech data corresponding to the selected frame. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 22)
-
-
12. A content editing apparatus for editing moving image content, the content editing apparatus comprising:
-
a memory configured to; store a face database (DB) of face images included in the moving image content, store a speech DB of speech included in the moving image content, and store a mapping of an image of a person included in the face DB with speech data of the person included in the speech DB; and at least one processor configured to; generate the face DB based on face data extracted from the moving image content, generate the speech DB based on speech data extracted from the moving image content, map the image of a person included in the face DB with the speech data of the person included in the speech DB, select at least one frame among frames included in the moving image content, and create edited image content of the moving image content using the mapped image and speech data, and the selected at least one frame, wherein the at least one processor is further configured to; determine at least one scene among scenes of the moving image content based on a voice level variation and a voice frequency variation of voice data in the moving image content, and select at least one frame among frames included in the determined scene, the at least one frame comprising fewer than all of the frames included in the determined scene, wherein the edited image content includes at least one image segment corresponding to the selected at least one frame, and wherein the image segment includes an image of the selected frame and speech data corresponding to the selected frame. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
Specification