Method and apparatus for detecting text associated with video
First Claim
1. A method of detecting a text associated with video, the method comprising:
- reading a t-th frame (where t is a positive integer) among frames forming the video as a current frame;
determining whether there is a text area detected from a previous frame which is a (t−
N)-th (where N is a positive integer) frame among the frames forming the video, in the current frame;
upon determining that there is no text area detected from the previous frame in the current frame, detecting the text area in the entire current frame; and
upon determining that there is the text area detected from the previous frame in the current frame, detecting the text area from a remaining area obtained by excluding from the current frame an area corresponding to the text area detected from the previous frame and verifying whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video,wherein the detecting of the text area from the current frame comprisesdetecting a character block from the current frame,determining whether the current frame is a text frame, using the detected character block, andupon determining that the current frame is the text frame, post-processing the text frame and determining a text area from the text frame.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for detecting text associated with video are provided. The method of detecting the text of the video includes reading a t-th frame (where t is a positive integer) among frames forming the video as a current frame, determining whether there is a text area detected from a previous frame which is a (t−N)-th (where N is a positive integer) frame among the frames forming the video, in the current frame, and upon determining that there is no text area detected from the previous frame in the current frame, detecting the text area in the entire current frame. Upon determining that there is the text area detected from the previous frame in the current frame, the text area is detected from a remaining area obtained by excluding from the current frame an area corresponding to the text area detected from the previous frame. Whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video is verified.
-
Citations
42 Claims
-
1. A method of detecting a text associated with video, the method comprising:
-
reading a t-th frame (where t is a positive integer) among frames forming the video as a current frame; determining whether there is a text area detected from a previous frame which is a (t−
N)-th (where N is a positive integer) frame among the frames forming the video, in the current frame;upon determining that there is no text area detected from the previous frame in the current frame, detecting the text area in the entire current frame; and upon determining that there is the text area detected from the previous frame in the current frame, detecting the text area from a remaining area obtained by excluding from the current frame an area corresponding to the text area detected from the previous frame and verifying whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video, wherein the detecting of the text area from the current frame comprises detecting a character block from the current frame, determining whether the current frame is a text frame, using the detected character block, and upon determining that the current frame is the text frame, post-processing the text frame and determining a text area from the text frame. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method of detecting a text associated with video, the method comprising:
-
reading a t-th frame (where t is a positive integer) among frames forming the video as a current frame; determining whether there is a text area detected from a previous frame which is a (t−
N)-th (where N is a positive integer) frame among the frames forming the video, in the current frame;upon determining that there is no text area detected from the previous frame in the current frame, detecting the text area in the entire current frame; and upon determining that there is the text area detected from the previous frame in the current frame, detecting the text area from a remaining area obtained by excluding from the current frame an area corresponding to the text area detected from the previous frame and verifying whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video, wherein the detecting of the text area from the remaining area of the current frame comprises; detecting a character block from the remaining area of the current frame, determining whether the current frame is a text frame based on the remaining area of the current frame, and upon determining that the current frame is the text frame, post-processing the remaining area and determining the text area from the remaining area.
-
-
19. A method of detecting a text associated with video, the method comprising:
-
reading a t-th frame (where t is a positive integer) among frames forming the video as a current frame; determining whether there is a text area detected from a previous frame which is a (t−
N)-th (where N is a positive integer) frame among the frames forming the video, in the current frame;upon determining that there is no text area detected from the previous frame in the current frame, detecting the text area in the entire current frame; and upon determining that there is the text area detected from the previous frame in the current frame, detecting the text area from a remaining area obtained by excluding from the current frame an area corresponding to the text area detected from the previous frame and verifying whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video, wherein the determining whether there is the text area in the next frame comprises; obtaining the number of brightness differences greater than an eleventh threshold value among brightness differences which are differences between brightness values of pixels in the text area determined in the t-th frame and brightness values of pixels in the text area of the (t+N)-th frame corresponding to the text area determined in the t-th frame, and upon determining that the number of the brightness differences greater than the eleventh threshold value is greater than a twelfth threshold value if there is no text area in the next frame, setting the number of the brightness differences greater than the eleventh threshold value as greater than the twelfth threshold value. - View Dependent Claims (20, 21)
-
-
22. An apparatus for detecting text associated with video, the apparatus comprising:
-
a frame inputting unit reading a t-th frame (where t is a positive integer) among frames forming the video as a current frame; a text area checking unit checking whether there is a text area detected from a previous frame which is a (t−
N)-th (where N is a positive integer) frame among the frames forming the video, in the current frame and outputting the checked results as a first control signal;a first text area detecting unit detecting text area from the current frame in response to the first control signal; a second text area detecting unit detecting text area from a remaining area obtained by excluding an area corresponding to the text area detected from the previous frame from the current frame in response to the first control signal; and a text area verifying unit verifying whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
-
36. A method of detecting a text associated with a video, comprising:
-
detecting the text area from a remaining area obtained by excluding from a current frame an area corresponding to the text area detected from a previous frame; and verifying whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video, wherein the detecting the text area comprises; detecting a character block from the remaining area of the current frame, determining whether the current frame is a text frame based on the remaining area of the current frame, and upon determining that the current frame is the text frame, post-processing the remaining area and determining the text area from the remaining area.
-
-
37. An apparatus for detecting text associated with video, the apparatus comprising:
-
a first text area detecting unit detecting a first text area from a current frame in response to a first control signal; a second text area detecting unit detecting a second text area from a remaining area obtained by excluding an area corresponding to the first text area detected from the previous frame from the current frame in response to a first control signal; and a text area verifying unit verifying whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video, wherein the first text area detecting unit comprises; a first character block detecting portion detecting a character block from the current frame in response to the first control signal, a first text frame checking portion checking whether the current frame is a text frame using the detected character block and outputting the checked results as a third control signal, and a first text area determining portion post-processing the text frame and determining a text area from the text frame in response to the third control signal.
-
-
38. An apparatus for detecting a text associated with video, comprising:
-
an isolated character block removing part removing an isolated character block in which there is no adjacent character block, from the character blocks included in a text frame in response to a control signal; a bounding box determining part determining a group of adjacent character blocks from the text frame, from which the isolated character block is removed, as a bounding box; a bounding box extracting part obtaining a number of sub-blocks included in each bounding box and extracting the bounding box having the number of sub-blocks greater than a first threshold value from the text frame when the current frame is divided into sub-blocks having a certain size; and a text area extracting part obtaining an average edge density with respect to each of extracted bounding boxes and determining a bounding box having an average edge density greater than a second threshold value as the text area. - View Dependent Claims (39, 40)
-
-
41. A computer-readable storage storing a computer-readable program which controls a computer system to execute detecting text associated with video, by:
-
reading a t-th frame (where t is a positive integer) among frames forming the video as a current frame; determining whether there is a text area detected from a previous frame which is a (t−
N)-th (where N is a positive integer) frame among the frames forming the video, in the current frame;upon determining that there is no text area detected from the previous frame in the current frame, detecting the text area in the entire current frame; and upon determining that there is the text area detected from the previous frame in the current frame, detecting the text area from a remaining area obtained by excluding from the current frame an area corresponding to the text area detected from the previous frame and verifying whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video, wherein after the text area is detected or verified, determining whether the current frame is an end frame of the video, and upon determining that the current frame is not the end frame of the video, increasing t by a value of N and reading the current frame.
-
-
42. A computer-readable storage storing a computer-readable program which controls a computer system to execute detecting text associated with video, by:
-
reading a t-th frame (where t is a positive integer) among frames forming the video as a current frame; determining whether there is a text area detected from a previous frame which is a (t−
N)-th (where N is a positive integer) frame among the frames forming the video, in the current frame;upon determining that there is no text area detected from the previous frame in the current frame, detecting the text area in the entire current frame; upon determining that there is the text area detected from the previous frame in the current frame, detecting the text area from a remaining area obtained by excluding from the current frame an area corresponding to the text area detected from the previous frame and verifying whether there is a text area in a next frame which is a (t+N)-th frame among the frames forming the video; detecting a character block from the current frame; determining whether the current frame is a text frame, using the detected character block; and upon determining that the current frame is the text frame, post-processing the text frame and determining a text area from the text frame.
-
Specification