VIDEO SYNTHESIS USING VIDEO VOLUMES
First Claim
1. A computer-implemented method of creating a composite video, the method comprising:
- accessing a database storing volumes from a plurality of digital videos, each volume being a spatio-temporal segment of one of the videos and having a start time and an end time within the video, wherein the spatial extent of each volume is less than an entire frame of the video;
providing, to a client device, indicia for a plurality of the volumes;
receiving, from the client device, selections of a subset of the indicia, thereby selecting a corresponding subset of the plurality of the volumes;
retrieving, from the database, the volumes of the subset;
compositing the retrieved volumes to form a composite video.
2 Assignments
0 Petitions
Accused Products
Abstract
A volume identification system identifies a set of unlabeled spatio-temporal volumes within each of a set of videos, each volume representing a distinct object or action. The volume identification system further determines, for each of the videos, a set of volume-level features characterizing the volume as a whole. In one embodiment, the features are based on a codebook and describe the temporal and spatial relationships of different codebook entries of the volume. The volume identification system uses the volume-level features, in conjunction with existing labels assigned to the videos as a whole, to label with high confidence some subset of the identified volumes, e.g., by employing consistency learning or training and application of weak volume classifiers.
- The labeled volumes may be used for a number of applications, such as training strong volume classifiers, improving video search (including locating individual volumes), and creating composite videos based on identified volumes.
-
Citations
21 Claims
-
1. A computer-implemented method of creating a composite video, the method comprising:
-
accessing a database storing volumes from a plurality of digital videos, each volume being a spatio-temporal segment of one of the videos and having a start time and an end time within the video, wherein the spatial extent of each volume is less than an entire frame of the video; providing, to a client device, indicia for a plurality of the volumes; receiving, from the client device, selections of a subset of the indicia, thereby selecting a corresponding subset of the plurality of the volumes; retrieving, from the database, the volumes of the subset; compositing the retrieved volumes to form a composite video. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer-implemented method, comprising:
-
transmitting a search query over a computer network to a video sharing service; receiving indicia of a plurality of volumes of digital videos, the volumes being spatio-temporal segments of the digital videos and matching the search query, at least one of the indicia comprising a user interface element associated with a corresponding one of the volumes; responsive to a user selection of the user interface element, displaying a user interface for specifying a new composite video, the user interface comprising; a video region representing an area corresponding to a total spatial region of frames of a new composite video, and within the video region, a visual representation of the volume corresponding to the selected user interface element; monitoring user actions that specify properties of the volume within the new composite video; and sending a request to the video sharing service to save the new composite video comprising the specified properties to a video repository of the video sharing service.
-
-
15. A non-transitory computer-readable storage medium storing executable computer program instructions comprising:
-
instructions for accessing a database storing volumes from a plurality of digital videos, each volume being a spatio-temporal segment of one of the videos and having a start time and an end time within the video, wherein the spatial extent of each volume is less than an entire frame of the video; instructions for providing, to a client device, indicia for a plurality of the volumes; instructions for receiving, from the client device, selections of a subset of the indicia, thereby selecting a corresponding subset of the plurality of the volumes; instructions for retrieving, from the database, the volumes of the subset; and instructions for compositing the retrieved volumes to form a composite video. - View Dependent Claims (16, 17, 18)
-
-
19. A computer-implemented method comprising:
-
accessing a database storing volumes from a plurality of digital videos, each volume being a spatio-temporal segment of one of the digital videos and having a start time and an end time within the video, wherein the spatial extent of each volume is less than an entire frame of the video, and wherein each volume is associated with a label descriptive of the volume; receiving, from a client device, a search query; responsive to receiving the search query, identifying a plurality of the volumes having a label matching the search query; and providing indicia of the identified volumes to the client device. - View Dependent Claims (20, 21)
-
Specification