Detection of transitions between text and non-text frames in a video stream
First Claim
1. A method comprising:
- determining, by a computing device, a boundary between a first portion of a video and a second portion of the video, the determining comprising;
determining a text detection window, wherein a height of the text detection window is less than a width of the text detection window if a text language of the video is intended to be read in a horizontal manner,determining, using the text detection window, a number of text blocks within a first plurality of video frames of the video and a number of text blocks within a second plurality of video frames following the first plurality of video frames in the video, andcomparing the number of text blocks within the first plurality of video frames with the number of text blocks within the second plurality of video frames.
1 Assignment
0 Petitions
Accused Products
Abstract
Detecting the start of a credit roll within video program may allow for the automatic extension of video recordings among other functions. The start of the credit roll may be detected by determining the number of text blocks within a sequence of frames and identifying a point in the sequence of frames where a difference between the number of text blocks in frames occurring before the point and the number of text blocks in frames occurring after the point is greatest and exceeds a specified threshold. Text blocks may be identified within each frame by partitioning the frame into one or more segments and recording the segments having a pixel of a sufficiently high contrast. Contiguous segments may be merged or combined into single blocks, which may then be filtered to remove noise and false positives. Additional content may be inserted into the credit roll frames.
23 Citations
19 Claims
-
1. A method comprising:
determining, by a computing device, a boundary between a first portion of a video and a second portion of the video, the determining comprising; determining a text detection window, wherein a height of the text detection window is less than a width of the text detection window if a text language of the video is intended to be read in a horizontal manner, determining, using the text detection window, a number of text blocks within a first plurality of video frames of the video and a number of text blocks within a second plurality of video frames following the first plurality of video frames in the video, and comparing the number of text blocks within the first plurality of video frames with the number of text blocks within the second plurality of video frames. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
9. A method comprising:
-
dividing, by a computing device, a video frame, of a plurality of video frames in a video, into sections using a text detection window of a defined size; determining, within the sections of the video frame, contiguous sections having at least a portion with a contrast greater than a contrast threshold; combining the determined contiguous sections to form one or more text blocks; determining a number of formed text blocks in each of the plurality of video frames; and applying a wavelet to the determined number of formed text blocks to determine a beginning of a credit roll in the video. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A method comprising:
-
determining a number of text blocks within each of a plurality of video frames of a video content item, the determining comprising; dividing each of the plurality of video frames into sections based on a size of a text detection window, wherein a relationship between a height and a width of the text detection window is defined based on an intended reading direction of text included in the video content item, and determining which of the sections comprises a portion having a determined contrast greater than a contrast threshold; and determining a boundary between a first portion and a second portion of the video content item based on the determined number of text blocks within each of the plurality of video frames. - View Dependent Claims (15, 16)
-
-
17. A method comprising:
-
determining, by a computing device, text areas within a first plurality of video frames of a content item, the determining comprising dividing each of the first plurality of video frames into sections based on a size of a text detection window comprising a height and a width, the height and the width being defined by an intended reading direction of text in the video frames, and evaluating a contrast of each of the sections; and determining whether a boundary between a first portion and a second portion of the content item exists within the first plurality of video frames based on a number of text areas within each of the first plurality of video frames. - View Dependent Claims (18, 19)
-
Specification