Efficient video data representation and content based video retrieval framework
First Claim
Patent Images
1. A method, executed by one or more processors, for processing a series of image frames in a video data to compress, store, search and/or retrieve images of interest in the video data, the method comprising:
- performing a segmenting process on the video data, wherein the segmenting process distinguishes different portions of the video data representing different objects and background images;
performing a wavelet transform process on each of the different portions to compress the video data, wherein the wavelet transform process encodes each of the different objects and background images in a respective wavelet pyramid representation;
identifying one or more features associated with each of the different objects and background images; and
encoding the one or more features in association with the respective encoded wavelet pyramid representation of their respective different objects and background images to generate a feature indexed wavelet pyramid representation of the video data, wherein the feature indexed wavelet period representation comprises the respective wavelet pyramid representations and a separate encoded index of attributes associated with the objects encoded in the respective wavelet pyramid representation, wherein the attributes are in the group consisting of object detection ID, scheme, value, tracking ID and tracking information of the objects.
1 Assignment
0 Petitions
Accused Products
Abstract
An efficient object-detection-driven video data representation along with a unified content based video compression and retrieval framework and wavelet based searching engine is described to encode the disclosed video data representation. The video data representation and unified compression and retrieval and searching engine together facilitate rapid retrieval of desired video information from large stores of video data.
-
Citations
24 Claims
-
1. A method, executed by one or more processors, for processing a series of image frames in a video data to compress, store, search and/or retrieve images of interest in the video data, the method comprising:
-
performing a segmenting process on the video data, wherein the segmenting process distinguishes different portions of the video data representing different objects and background images; performing a wavelet transform process on each of the different portions to compress the video data, wherein the wavelet transform process encodes each of the different objects and background images in a respective wavelet pyramid representation; identifying one or more features associated with each of the different objects and background images; and encoding the one or more features in association with the respective encoded wavelet pyramid representation of their respective different objects and background images to generate a feature indexed wavelet pyramid representation of the video data, wherein the feature indexed wavelet period representation comprises the respective wavelet pyramid representations and a separate encoded index of attributes associated with the objects encoded in the respective wavelet pyramid representation, wherein the attributes are in the group consisting of object detection ID, scheme, value, tracking ID and tracking information of the objects. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for processing a sequence of images to compress, store, search and/or retrieve images of interest in the video data, the system comprising:
one or more processors in communication with a data storage device, the one or more processors configured to; perform a segmenting process on the video data, wherein the segmenting process distinguishes different portions of the video data representing different objects and background images; perform a wavelet transform process on each of the different portions to compress the video data, wherein the wavelet transform process encodes each of the different objects and background images in a respective wavelet pyramid representation; identify one or more features associated with each of the different objects and background images; and encode and index the one or more features in association with the respective wavelet pyramid representation of their respective different objects and background images to generate a feature indexed wavelet pyramid representation of the video data, wherein the feature indexed wavelet period representation comprises the respective wavelet pyramid representations and a separate encoded index of attributes associated with the objects encoded in the respective wavelet pyramid representation, wherein the attributes are in the group consisting of object detection ID, scheme, value, tracking ID and tracking information of the objects. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
19. A non-transitory computer-readable medium having computer readable program codes embodied thereon for processing a sequence of images to compress, store, search and/or retrieve images of interest in the video data, the computer-readable codes including instructions that, when executed by a processor, cause the processor to:
-
perform a segmenting process on the video data, wherein the segmenting process distinguishes different portions of the video data representing different objects and background images; perform a wavelet transform process on each of the different portions to compress the video data, wherein the wavelet transform process encodes each of the different objects and background images in a respective wavelet pyramid representation; identify one or more features associated with each of the different objects and background images; and encode the one or more features in association with the respective encoded wavelet pyramid representation of their respective different objects and background images to generate a feature indexed wavelet pyramid representation of the video data, wherein the feature indexed wavelet period representation comprises the respective wavelet pyramid representations and a separate encoded index of attributes associated with the objects encoded in the respective wavelet pyramid representation, wherein the attributes are in the group consisting of object detection ID, scheme, value, tracking ID and tracking information of the objects. - View Dependent Claims (20, 21, 22, 23, 24)
-
Specification