VIDEO ACCESS SYSTEM AND METHOD BASED ON ACTION TYPE DETECTION
First Claim
1. A video access system for generating a description of the video segment based on a predetermined set of action describing rules, each rule associated with a respective action type from a predetermined set of detectable action types, each of the rules in the set defining at least one role in the action, and based on predetermined conditions defined for the rules, for computing score values dependent on the attribute values of an object or objects assigned to the at least one role, the system comprisingan action type detector coupled to a source of video data;
- an object detector coupled to the source of video data;
a description generator coupled to the action type detector and the object detector, and configured toselect a rule dependent on an action type detected from the video data for a video segment by the action type detector, the rule being selected from a predetermined set of rules;
assign detected objects, which have been detected from the video data for a video segment by the object detector, to the at least role of the selected rule;
compute score values for assignments of different detected objects to the role, each score value being computed dependent on the attribute values of the assigned object according to the predetermined conditions defined for the rule;
select a combination of the detected action type and at least one assigned detected objects on the basis of the score values;
generate a description of the video segment from the selected combination.
1 Assignment
0 Petitions
Accused Products
Abstract
Descriptions of video segments for use to search or select video segments are generated by using a combination of video based action type detection and image based object detection. Action type detection results in detection of action types from a predetermined set of action types, with little or no information about actors involved in the action. Object detection results in detection of individual objects in the images, with little or no information actions. A set of rules is used that define one or more roles associated with action types and conditions on objects that can fill these roles. Detected action types are used to select rules and detected objects are assigned to roles defined the selected rule. Score values are computed for different assignments of detected objects to the role, as a function of the attribute values of the assigned objects. A combination of the detected action type and the detected objects is selected on the basis of the score values. A description of the video segment is generated from the selected combination.
150 Citations
14 Claims
-
1. A video access system for generating a description of the video segment based on a predetermined set of action describing rules, each rule associated with a respective action type from a predetermined set of detectable action types, each of the rules in the set defining at least one role in the action, and based on predetermined conditions defined for the rules, for computing score values dependent on the attribute values of an object or objects assigned to the at least one role, the system comprising
an action type detector coupled to a source of video data; -
an object detector coupled to the source of video data; a description generator coupled to the action type detector and the object detector, and configured to select a rule dependent on an action type detected from the video data for a video segment by the action type detector, the rule being selected from a predetermined set of rules; assign detected objects, which have been detected from the video data for a video segment by the object detector, to the at least role of the selected rule; compute score values for assignments of different detected objects to the role, each score value being computed dependent on the attribute values of the assigned object according to the predetermined conditions defined for the rule; select a combination of the detected action type and at least one assigned detected objects on the basis of the score values; generate a description of the video segment from the selected combination. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A video segment selection method, the method comprising
determining a detected action type for a video segment, and/or a likelihood value of the detected action type, the detected action type being selected from a predetermined set of action types dependent on features derived from the video segment; -
determining detections and/or likelihood values for detected objects in the video segment and attribute values of the detected objects; assigning the detected objects to a role defined in a predetermined rule associated with the detected action type; computing score values for different assignments of detected objects to the role, each as a function of the attribute values of the assigned objects according to predetermined conditions defined for the rule; selecting a combination of the detected action type and the detected objects on the basis of the score values; generating a description of the video segment from the selected combination. - View Dependent Claims (11, 12, 13, 14)
-
Specification