Persistent feature descriptors for video
First Claim
1. A method of extracting feature descriptors for a video, in a video feature descriptor extractor, the video including a sequence of pictures, the method comprising:
- identifying a first key picture and a second key picture later in the sequence than the first key picture and having at least one picture between them;
extracting a first set of feature descriptors from the first key picture and a second set of feature descriptors from the second key picture;
identifying a set of pairs of feature descriptors, where each pair includes one descriptor from the first set and one descriptor from the second set;
generating motion field information describing a motion field between the first key picture and the second key picture; and
filtering the set of pairs of feature descriptors based on correlation with the motion information to produce and output a subset of persistent descriptors, wherein filtering the set of pairs of feature descriptors includes discarding, from the set, one or more pairs of feature descriptors based on a determination of whether the pairs are consistent with the motion field, a pair of feature descriptors being consistent with the motion field if relative locations of the descriptors of the pair in their respective key picture conform to the motion field.
6 Assignments
0 Petitions
Accused Products
Abstract
Methods and devices for extracting feature descriptors for a video, the video having a sequence of pictures. The method includes identifying a first key picture and a second key picture later in the sequence than the first key picture; extracting a first set of feature descriptors from the first key picture and a second set of feature descriptors from the second key picture; identifying a set of pairs of feature descriptors, where each pair includes one descriptor from the first set and one descriptor from the second set; generating motion information describing the motion field between the first key picture and the second key picture; and filtering the set of pairs of feature descriptors based on correlation with the motion information to produce and output a subset of persistent descriptors.
17 Citations
23 Claims
-
1. A method of extracting feature descriptors for a video, in a video feature descriptor extractor, the video including a sequence of pictures, the method comprising:
-
identifying a first key picture and a second key picture later in the sequence than the first key picture and having at least one picture between them; extracting a first set of feature descriptors from the first key picture and a second set of feature descriptors from the second key picture; identifying a set of pairs of feature descriptors, where each pair includes one descriptor from the first set and one descriptor from the second set; generating motion field information describing a motion field between the first key picture and the second key picture; and filtering the set of pairs of feature descriptors based on correlation with the motion information to produce and output a subset of persistent descriptors, wherein filtering the set of pairs of feature descriptors includes discarding, from the set, one or more pairs of feature descriptors based on a determination of whether the pairs are consistent with the motion field, a pair of feature descriptors being consistent with the motion field if relative locations of the descriptors of the pair in their respective key picture conform to the motion field. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A video feature descriptor extractor for extracting feature descriptors for a video, the video including a sequence of pictures, the video feature descriptor extractor comprising:
-
a processor; memory; and an encoding application containing instructions executable by the processor that, when executed, cause the processor to identify a first key picture and a second key picture later in the sequence than the first key picture and having at least one picture between them; extract a first set of feature descriptors from the first key picture and a second set of feature descriptors from the second key picture; identify a set of pairs of feature descriptors, where each pair includes one descriptor from the first set and one descriptor from the second set; generate motion field information describing a motion field between the first key picture and the second key picture; and filter the set of pairs of feature descriptors based on correlation with the motion information to produce and output a subset of persistent descriptors, wherein filtering the set of pairs of feature descriptors includes discarding, from the set, one or more pairs of features descriptors based on a determination of whether the pairs are consistent with the motion field, a pair of feature descriptors being consistent with the motion field if relative locations of the descriptors of the pair in their respective key picture conform to the motion field. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A non-transitory processor-readable medium storing processor-executable instructions for extracting feature descriptors for a video, the video including a sequence of pictures, wherein the processor-executable instructions, when executed by a processor in a video feature descriptor extractor, cause the processor to:
-
identify a first key picture and a second key picture later in the sequence than the first key picture and having at least one picture between them; extract a first set of feature descriptors from the first key picture and a second set of feature descriptors from the second key picture; identify a set of pairs of feature descriptors, where each pair includes one descriptor from the first set and one descriptor from the second set; generate motion field information describing a motion field between the first key picture and the second key picture; and filter the set of pairs of feature descriptors based on correlation with the motion information to produce and output a subset of persistent descriptors, wherein filtering the set of pairs of feature descriptors includes discarding, from the set, one or more pairs of features descriptors based on a determination of whether the pairs are consistent with the motion field, a pair of feature descriptors being consistent with the motion field if relative locations of the descriptors of the pair in their respective key picture conform to the motion field.
-
Specification