Apparatus and methods for real time estimation of differential motion in live video
First Claim
1. A video processing device of a robotic apparatus, comprising:
- a camera configured to generate a compressed video frame;
an input interface coupled to the camera and configured to receive the compressed video frame;
a video encoder configured to;
determine information associated with a portion of the compressed frame, the information corresponding to motion of one or more objects within the compressed frame;
evaluate the information for a pattern, the pattern comprising a temporal sequence of a first group of motion vectors and a subsequent second group of motion vectors; and
identify the one or more objects based at least in part on a salient feature in the pattern within the information and based on a determination that a residual motion vector associated with the portion of the compressed frame breaches a velocity threshold, the residual motion vector being derived relative to a prevailing motion vector within the portion of the compressed frame;
an output interface configured to communicate an indication based on the identification of the one or more objects;
wherein;
the one or more objects identified in the compressed frame correspond to a physical task of the robotic apparatus; and
the indication is configured to modify a trajectory of the robotic apparatus based on the identified one or more objects.
2 Assignments
0 Petitions
Accused Products
Abstract
A data processing apparatus may use a video encoder in order to extract motion information from streaming video in real time. Output of the video encoder may be parsed in order to extract motion information associated with one or more objects within the video stream. Motion information may be utilized by e.g., an adaptive controller in order to detect one or more objects salient to a given task. The controller may be configured to determine a control signal associated with the given task. The control signal determination may be configured based on a characteristic of an object detected using motion information extracted from the encoded output. The control signal may be provided to a robotic device causing the device to execute the task. The use of dedicated hardware video encoder output may reduce energy consumption associated with execution of the task and/or extend autonomy of the robotic device.
220 Citations
23 Claims
-
1. A video processing device of a robotic apparatus, comprising:
-
a camera configured to generate a compressed video frame; an input interface coupled to the camera and configured to receive the compressed video frame; a video encoder configured to; determine information associated with a portion of the compressed frame, the information corresponding to motion of one or more objects within the compressed frame; evaluate the information for a pattern, the pattern comprising a temporal sequence of a first group of motion vectors and a subsequent second group of motion vectors; and identify the one or more objects based at least in part on a salient feature in the pattern within the information and based on a determination that a residual motion vector associated with the portion of the compressed frame breaches a velocity threshold, the residual motion vector being derived relative to a prevailing motion vector within the portion of the compressed frame; an output interface configured to communicate an indication based on the identification of the one or more objects; wherein; the one or more objects identified in the compressed frame correspond to a physical task of the robotic apparatus; and the indication is configured to modify a trajectory of the robotic apparatus based on the identified one or more objects. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of operating a video encoder apparatus, the apparatus configured to compress a raw frame of real-time video into a compressed frame, the method comprising:
-
causing the encoder apparatus to produce the compressed frame; parsing the compressed frame, the parsing configured to provide information associated with the compressed frame, the information corresponding to motion of an object in the raw frame, the information comprising a plurality of components; evaluating the information to determine a presence of the object within the raw frame, the determination of the presence of the object being based on a determination, based on at least one statistical parameter, of at least one residual motion vector derived from at least one dominant motion vector within the compressed frame, the determination of the presence of the object being further based on a determination that the derived at least one residual motion vector breaches a velocity threshold; and providing an indication of the presence of the object based on the evaluation. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A method of processing raw video to efficiently detect objects present therein, the method comprising:
-
compressing the raw video according to a compression algorithm to produce compressed video; parsing at least a portion of the compressed video to obtain information corresponding to motion of a moving object, the information comprising a map of pixel groups of the at least the portion of the compressed video, the map of pixel groups comprising a distribution of at least (i) a first plurality of the pixel groups having corresponding first motion vectors and (ii) a second plurality of the pixel groups having corresponding second motion vectors, the first motion vectors and the second motion vectors being different in location and direction; and evaluating the information associated with the first plurality and the second plurality of pixel groups of the at least the portion of the parsed compressed video to identify the moving object therein, the evaluating the information associated with the first and second plurality of pixel groups to identify the moving object further comprising, evaluating a respective rate of change of motion within the first plurality and the second plurality of pixel groups, and determining that at least one residual motion vector within the map of pixel groups breaches a velocity threshold.
-
-
23. A non-transitory computer-readable storage medium having computer readable instructions stored thereon, the instructions being executable by at least one processor to cause a video encoder apparatus to:
-
produce a compressed frame; parse the compressed frame, the parse being configured to provide information associated with the compressed frame and corresponding to motion of one or more objects in the compressed frame, the information comprising a plurality of components; evaluate the information to determine a presence of the one or more objects in the compressed frame, the determination of the presence of the one or more objects being based on (i) a determination, based on at least one statistical parameter, of a residual motion vector derived from a dominant vector in the compressed frame, and (ii) a determination that the derived residual motion vector breaches a velocity threshold; and provide an indication of the presence of the one or more objects based on the evaluation.
-
Specification