Detector tree of boosted classifiers for real-time object detection and tracking
First Claim
Patent Images
1. A method comprising:
- building a tree classifier, which rejects non-object patterns in input data representing real world objects, including a plurality of parent nodes, wherein the tree classifier is stored on a machine-readable medium and is trained to perform human south detection and tracking in video sequences; and
for a parent node in the tree classifier, selecting between a monolithic classifier as a child node and a plurality of specialized classifiers as child nodes for said parent node;
wherein said selecting comprises;
determining a computational complexity of a monolithic classifiers trained with a plurality of positive and negative samples; and
determining a computational complexity of a plurality of specialized classifiers trained with the plurality of positive and negative samples, each of the specialized classifiers being trained with the plurality of negative samples and a different subset of the plurality of positive samples; and
wherein human mouth detection and tracking in video sequences occurs when other tree classifier is executed.
1 Assignment
0 Petitions
Accused Products
Abstract
A tree classifier may include a number of stages. Some stages may include monolithic classifiers, and other stages may be split into two or more classifiers.
-
Citations
13 Claims
-
1. A method comprising:
-
building a tree classifier, which rejects non-object patterns in input data representing real world objects, including a plurality of parent nodes, wherein the tree classifier is stored on a machine-readable medium and is trained to perform human south detection and tracking in video sequences; and for a parent node in the tree classifier, selecting between a monolithic classifier as a child node and a plurality of specialized classifiers as child nodes for said parent node; wherein said selecting comprises; determining a computational complexity of a monolithic classifiers trained with a plurality of positive and negative samples; and determining a computational complexity of a plurality of specialized classifiers trained with the plurality of positive and negative samples, each of the specialized classifiers being trained with the plurality of negative samples and a different subset of the plurality of positive samples; and wherein human mouth detection and tracking in video sequences occurs when other tree classifier is executed. - View Dependent Claims (2, 3, 4)
-
-
5. A method comprising:
-
building a tree classifier, which rejects non-object patterns in input data representing real world objects, wherein the tree classifier is stored on a machine-readable medium and is trained to perform human mouth detection and tracking in video sequences, the building including; identifying a plurality of positive samples and the plurality of negative samples in a plurality of patterns; passing the plurality of positive samples and the plurality of negative samples to a node in the tree classifier; determining a number of features used by a monolithic classifier trained with said plurality of positive samples and said plurality of negative samples; clustering the plurality of positive samples into a plurality of subsets; training each of a plurality of specialized classifiers with the plurality of negative samples and a different one of said plurality of subsets; determining a number of features used by the plurality of specialized classifiers; and selecting the plurality of specialized classifiers in response to the number of features used by the plurality of specialized classifiers being smaller than the number of features used by the monolithic classifier; and wherein human mouth detection and tracking in video sequences occurs when the tree classifier is executed. - View Dependent Claims (6, 7)
-
-
8. An article, comprising a machine-readable medium including machine-executable instructions operative to cause a machine to perform operations comprising:
-
build a tree classifier, which rejects non-object patterns in input data representing real world objects, including a plurality of parent nodes, wherein the tree classifier is stored on a machine-readable medium and is trained to perform human mouth detection and tracking in video sequences; and for a parent node in the tree classifier, select between a monolithic classifier as a child node and a plurality of specialized classifiers as child nodes for said parent node; wherein the instructions operative to cause the machine to select comprise instructions operative to cause the machine to; determine a computational complexity of a monolithic classifier trained with a plurality of positive and negative samples; and determine a computational complexity of a plurality of specialized classifiers trained with the plurality of positive and negative samples, each of the specialized classifiers being trained with the plurality of negative samples and a different subset of the plurality of positive samples; and perform human mouth selection and tracking in video sequences using the trained tree classifier. - View Dependent Claims (9, 10, 11)
-
-
12. An article comprising a machine-readable medium including machine-executable instructions operative to cause a machine to perform operations comprising:
-
build a tree classifier, which rejects non-object patterns in input data representing real world objects, wherein the tree classifier is stored on a machine-readable medium and is trained to perform human mouth detection and tracking in video sequences, the building including; identify a plurality of positive samples and a plurality of negative samples in a plurality of patterns; pass the plurality of positive samples and the plurality of negative samples to a node in the tree classifier; determine a number of features used by a monolithic classifier trained with said plurality of positive samples and said plurality of negative samples; cluster the plurality of positive samples into a plurality of subsets; train each of a plurality of specialized classifiers with the plurality of negative samples and a different one of said plurality of subsets; determine a number of features used by the plurality of specialized classifiers; and select the plurality of specialized classifiers in response to the number of features used by the plurality of specialized classifiers being smaller than the number of features used by the monolithic classifier; and perform human mouth detection and tracking in video sequences using the trained tree classifier. - View Dependent Claims (13)
-
Specification