Scene and activity identification in video summary generation
First Claim
1. A system configured to generate a video summary from recorded video footage, the system comprising:
- storage media that stores a video of an activity including multiple frames and audio captured contemporaneously with the frames, the video having an unedited viewing time;
one or more physical processors configured by machine readable instructions to;
obtain a user-selected summary length for a video summary of the video;
obtain metadata for the video, the metadata including biometric information of a user captured contemporaneously with capture of the video and camera motion information characterizing position and/or motion of the camera during capture of the video;
analyze the biometric information of the user captured contemporaneously with capture of the video, the camera motion information characterizing position and/or motion of the camera during capture of the video, and the audio captured contemporaneously with the frames of the video to identify events of interest and types of the events of interest within the video;
select portions of the video for inclusion in the video summary, the portions including the events of interest, wherein numbers of the frames included within the portions are determined based on a type of the activity and the types of the events of interest such that the selected portions of the video include a first portion, the first portion including a first event of interest, and a first number of the frames included within the first portion is;
a first value based on the activity being of a first activity type and the first event of interest being of a first event type;
a second value based on the activity being of a first activity type and the first event of interest being of a second event type;
a third value based on the activity being of a second activity type and the first event of interest being of a first event type;
ora fourth value based on the activity being of a second activity type and the first event of interest being of a second event type;
wherein the first value, the second value, the third value, and the fourth value are different; and
generate an electronic file defining the video summary from the selected portions of the video so that the video summary has the user-selected summary length.
4 Assignments
0 Petitions
Accused Products
Abstract
Video and corresponding metadata is accessed. Events of interest within the video are identified based on the corresponding metadata, and best scenes are identified based on the identified events of interest. A video summary can be generated including one or more of the identified best scenes. The video summary can be generated using a video summary template with slots corresponding to video clips selected from among sets of candidate video clips. Best scenes can also be identified by receiving an indication of an event of interest within video from a user during the capture of the video. Metadata patterns representing activities identified within video clips can be identified within other videos, which can subsequently be associated with the identified activities.
-
Citations
20 Claims
-
1. A system configured to generate a video summary from recorded video footage, the system comprising:
-
storage media that stores a video of an activity including multiple frames and audio captured contemporaneously with the frames, the video having an unedited viewing time; one or more physical processors configured by machine readable instructions to; obtain a user-selected summary length for a video summary of the video; obtain metadata for the video, the metadata including biometric information of a user captured contemporaneously with capture of the video and camera motion information characterizing position and/or motion of the camera during capture of the video; analyze the biometric information of the user captured contemporaneously with capture of the video, the camera motion information characterizing position and/or motion of the camera during capture of the video, and the audio captured contemporaneously with the frames of the video to identify events of interest and types of the events of interest within the video; select portions of the video for inclusion in the video summary, the portions including the events of interest, wherein numbers of the frames included within the portions are determined based on a type of the activity and the types of the events of interest such that the selected portions of the video include a first portion, the first portion including a first event of interest, and a first number of the frames included within the first portion is; a first value based on the activity being of a first activity type and the first event of interest being of a first event type; a second value based on the activity being of a first activity type and the first event of interest being of a second event type; a third value based on the activity being of a second activity type and the first event of interest being of a first event type;
ora fourth value based on the activity being of a second activity type and the first event of interest being of a second event type; wherein the first value, the second value, the third value, and the fourth value are different; and generate an electronic file defining the video summary from the selected portions of the video so that the video summary has the user-selected summary length. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system configured to generate a video summary from recorded video footage, the system comprising:
-
storage media that stores video of an activity including multiple frames and audio captured contemporaneously with the frames, the video having an unedited viewing time; one or more physical processors configured by machine readable instructions to; obtain metadata for the video, the metadata including biometric information of a user captured contemporaneously with capture of the video and camera motion information characterizing position and/or motion of the camera during capture of the video; analyze the biometric information of the user captured contemporaneously with capture of the video, the camera motion information characterizing position and/or motion of the camera during capture of the video, and the audio captured contemporaneously with the frames of the video to identify events of interest and types of the events of interest within the video; identify frames of the video that are frames of interest based on the biometric information of the user captured contemporaneously with capture of the individual frames, the camera motion information characterizing position and/or motion of the camera during capture of the individual frames, and the audio captured contemporaneously with the frames, wherein numbers of the frames that are identified are determined based on a type of the activity and the types of the events of interest such that the identified frames of interest include frames including a first event of interest, and a first number of the frames including the first event of interest is; a first value based on the activity being of a first activity type and the first event of interest being of a first event type; a second value based on the activity being of a first activity type and the first event of interest being of a second event type; a third value based on the activity being of a second activity type and the first event of interest being of a first event type;
ora fourth value based on the activity being of a second activity type and the first event of interest being of a second event type; wherein the first value, the second value, the third value, and the fourth value are different; and effectuate display to the user of visual flags that indicate the identified frames of interest for the user. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification