Method and apparatus for detecting movement patterns at a self-service checkout terminal
First Claim
1. A system for detecting movement patterns with respect to at least one hand of an individual and an item in the individual'"'"'s hand to be purchased at a checkout terminal, comprising:
- (a) a video camera positioned to view a scene which includes therein a scanner for scanning items;
(b) frame grabber means coupled to the video camera for generating a stream of video frames representing activity in the scene;
(c) processor means coupled to the frame grabber means, the processor means performing steps of;
(1) detecting whether the hand in a video frame;
(2) tracking hand movement with respect to the scanner over a plurality of video frames and generating tracking information representative thereof including associated X and Y Cartesian coordinates for the location of the hand and the height and width of the hand;
(3) eliminating tracking information corresponding to the detected hand;
(4) determining the presence of the item to be purchased in the detected hand; and
(5) generating event information descriptive of user activity at the checkout terminal based on the tracking information.
3 Assignments
0 Petitions
Accused Products
Abstract
A system and method for detecting hand and item movement patterns comprising a video camera positioned to view a scene which includes therein a scanner for scanning items, wherein the video camera generates a sequence of video frames representing activity in the scene, processing means coupled to the video camera, the processing means performing steps of identifying regions of a video frame representing a hand; and tracking hand movement with respect to the scanner over a plurality of video frames. Event information descriptive of user activity at the self-service checkout terminal is generated based on the tracking information.
235 Citations
21 Claims
-
1. A system for detecting movement patterns with respect to at least one hand of an individual and an item in the individual'"'"'s hand to be purchased at a checkout terminal, comprising:
-
(a) a video camera positioned to view a scene which includes therein a scanner for scanning items;
(b) frame grabber means coupled to the video camera for generating a stream of video frames representing activity in the scene;
(c) processor means coupled to the frame grabber means, the processor means performing steps of;
(1) detecting whether the hand in a video frame;
(2) tracking hand movement with respect to the scanner over a plurality of video frames and generating tracking information representative thereof including associated X and Y Cartesian coordinates for the location of the hand and the height and width of the hand;
(3) eliminating tracking information corresponding to the detected hand;
(4) determining the presence of the item to be purchased in the detected hand; and
(5) generating event information descriptive of user activity at the checkout terminal based on the tracking information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
(i) generating regions clusters for a video frame which are hypothesis of a hand; and
(ii) evaluating each region cluster with hand model information and generating a confidence value that represents a likelihood that the region cluster represents a hand.
-
-
4. The system of claim 3, wherein the processor means generates the region clusters by periodically generating a background image representing background image information for the scene, comparing information for a video frame with the background image to generate regions, and forming region clusters by representing the regions into color regions, and combining one or more color regions based on predetermined criteria.
-
5. The system of claim 4, wherein the processor means discards those regions which are highly unlikely to contain a hand, prior to regimentation of the regions.
-
6. The system of claim 4, wherein the processor means forms a hypothesis list of regions clusters in order of highest likelihood of representing a hand.
-
7. The system of claim 6, wherein the processor means evaluates each region cluster in the hypothesis list by comparing it with hand model information and generating a confidence value representing a degree of match with the hand model information, and wherein a region cluster determined to have confidence value which exceeds a minimum confidence threshold is added to a hand list.
-
8. The system of claim 7, wherein the processor means is programmed to determine whether an item is associated with a region cluster in the hand list.
-
9. The system of claim 8, wherein the processor means tracks hand movement by:
-
(i) determining a measure of correspondence between region clusters in consecutive video frames that are determined to represent a hand;
(ii) constructing hand tracks based on the measure of correspondence of region dusters determined between video frames.
-
-
10. The system of claim 1, wherein the processor generates event information for at least one of the following events:
- (a) an item has passed over the scanner, including its direction;
(b) an empty hand has passed over the scanner, (c) an item passed over a scanned product zone in the self-service checkout terminal;
(d) an empty hand passed over scanned product zone;
(e) an item was placed in the scanned product zone;
(f) an item was removed from the scanned product zone; and
(g) an item was passed over the scanner and placed in the scanned product zone.
- (a) an item has passed over the scanner, including its direction;
-
11. A method for detecting movement patterns with respect to at least one hand of an individual and an item in the individual'"'"'s hand to be purchased in a scene of activity at a checkout terminal having a scanner, based on a sequence of video frames representing activity in the scene, comprising steps of:
-
(a) detecting whether the hand in a video frame;
(b) tracking hand movement with respect to the scanner over a plurality of video frames and generating tracking information representative thereof including associated X and Y Cartesian coordinates for the location of the hand and the height and width of the hand;
(c) eliminating tracking information corresponding to the detected hand;
(d) determining the presence of the item to be purchased in the detected hand; and
(e) generating event information descriptive of user activity at the checkout terminal based on the tracking information. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
(i) generating regions dusters for a video frame which are hypothesis of a hand; and
(ii) evaluating each region cluster with hand model information and generating a confidence value that represents a likelihood that the region cluster represents a hand.
-
-
14. The method of claim 13, wherein the step of generating the region clusters comprises steps of periodically generating a background image representing background image information for the scene, comparing image information for a video frame with the background image to generate regions, and forming region clusters by resegmenting the regions into color regions, and combining one or more color regions based on predetermined criteria.
-
15. The method of claim 14, and further comprising the step of discarding those regions which are highly unlikely to contain a hand, prior to resegmentation of the regions.
-
16. The method of claim 14, and further comprising the step of forming a hypothesis list of regions clusters in order of highest likelihood of representing a hand.
-
17. The method of claim 16, wherein the step of evaluating comprises evaluating each region cluster in the hypothesis list by comparing it with hand model information and generating a confidence value representing a degree of match with the hand model information, and wherein a region duster determined to have confidence value which exceeds a confidence threshold is added to a hand list.
-
18. The method of claim 17, and further comprising the step of determining whether an item is associated with a region cluster in the hand list.
-
19. The method of claim 18, wherein the step of tracking hand movement comprises steps of:
-
(i) determining a measure of correspondence between region clusters in consecutive video frames that are determined to represent a hand;
(ii) constructing hand tracks based on the measure of correspondence of region clusters determined between video frames.
-
-
20. The method of claim 11, wherein step of generating event information comprises generating information for at least one of the following events:
- (a) an item has passed over the scanner, including its direction, (b) an empty hand has passed over the scanner;
(c) an item passed over a scanned product zone in the self-service checkout terminal;
(d) an empty hand passed over scanned product zone;
(e) an item was placed in the scanned product zone;
(f) an item was removed from the scanned product zone, and (g) an item was passed over the scanner and placed in the scanned product zone.
- (a) an item has passed over the scanner, including its direction, (b) an empty hand has passed over the scanner;
-
21. The method of claim 11, wherein the step of determining the presence of the item to be purchased in the detected hand comprises steps of:
-
(a) calculating an average product size for the item over time;
(b) comparing the calculated average product size with a minimum product size threshold; and
(c) determining from the comparison whether the item to be purchased in the detected hand.
-
Specification