Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters
First Claim
1. A method for content based video sequence identification comprising:
- applying a bi-level filter to images in a first pass analysis to detect a set of initial interest points in a plurality of selected video frames, wherein the first pass analysis reduces the effective area of the images in each selected video frame to multiple smaller images; and
applying an array of anisotropic filters to regions of pixels around each initial interest point of the set of initial interest points in a second pass analysis to refine a spatial position for each initial interest point and determine a first scale parameter in the x direction (sx) and a second scale parameter in the y direction (sy), wherein the sx and the sy scale parameters are separately varied to provide accurate region characterizations that are resistant to image distortion for identification of the plurality of selected video frames in a video sequence.
14 Assignments
0 Petitions
Accused Products
Abstract
Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.
-
Citations
27 Claims
-
1. A method for content based video sequence identification comprising:
-
applying a bi-level filter to images in a first pass analysis to detect a set of initial interest points in a plurality of selected video frames, wherein the first pass analysis reduces the effective area of the images in each selected video frame to multiple smaller images; and applying an array of anisotropic filters to regions of pixels around each initial interest point of the set of initial interest points in a second pass analysis to refine a spatial position for each initial interest point and determine a first scale parameter in the x direction (sx) and a second scale parameter in the y direction (sy), wherein the sx and the sy scale parameters are separately varied to provide accurate region characterizations that are resistant to image distortion for identification of the plurality of selected video frames in a video sequence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method for content based video sequence identification, the method comprising:
-
applying a bi-level filter in a first pass analysis to detect a set of initial interest points in selected video frames, wherein the first pass analysis reduces the effective area of images in each selected video frame to multiple smaller images; applying an array of anisotropic filters to regions of pixels around the set of initial interest points in a second pass analysis to form a 4-dimensional (4D) space of determinant images with coordinate (x, y, sx, sy) values; and interpolating the determinant images to identify refined interest points with coordinate (x, y, sx, sy) values that provide accurate region characterizations that are resistant to image distortion for identification of the video frames in the video sequence. - View Dependent Claims (21, 22, 23, 24)
-
-
25. A computer readable non-transitory medium having embodied thereon a program for content based video sequence identification, the program being executable by a computer to perform the steps of:
-
applying a bi-level filter in a first pass analysis to detect a set of initial interest points in selected video frames, wherein the first pass analysis reduces the effective area of images in each selected video frame to multiple smaller images; applying an array of anisotropic filters to regions of pixels around the set of initial interest points in a second pass analysis to form a 4-dimensional (4D) space of determinant images with coordinate (x, y, sx, sy) values; and interpolating the determinant images to identify refined interest points with coordinate (x, y, sx, sy) values that provide accurate region characterizations that are resistant to image distortion for identification of the video frames in the video sequence. - View Dependent Claims (26, 27)
-
Specification