Identification of people using multiple types of input
First Claim
Patent Images
1. A method comprising:
- identifying a pool of features comprising at least one feature from a first type of input and at least one feature from a second type of input where the second type of input is different from the first type of input; and
generating a classifier for speaker detection using a learning algorithm wherein nodes of the classifier are selected using the pool of features and a preferable feature is weighted higher than a less preferable feature such that the preferable feature is located in the classifier before the less preferable feature.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for detecting people or speakers in an automated fashion are disclosed. A pool of features including more than one type of input (like audio input and video input) may be identified and used with a learning algorithm to generate a classifier that identifies people or speakers. The resulting classifier may be evaluated to detect people or speakers.
-
Citations
18 Claims
-
1. A method comprising:
-
identifying a pool of features comprising at least one feature from a first type of input and at least one feature from a second type of input where the second type of input is different from the first type of input; and generating a classifier for speaker detection using a learning algorithm wherein nodes of the classifier are selected using the pool of features and a preferable feature is weighted higher than a less preferable feature such that the preferable feature is located in the classifier before the less preferable feature. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
accepting input data comprising a first type of input data and a second type of input data that is different from the first type of input data; and evaluating a person detection classifier to detect a person wherein the classifier has been created by; identifying a pool of features comprising at least one feature associated with the first type of input data and at least one feature associated with the second type of input data; and generating the classifier using a learning algorithm by selecting nodes of the classifier using the pool of features and weighting a preferable feature higher than a less preferable feature such that the preferable feature is located in the classifier before the less preferable feature. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A system comprising:
-
a video input device that produces video data; an audio input device that produces audio data; and a detector device including a detector configured to accept the video data and the audio data and evaluate a person detection classifier to detect a person where the classifier has been created by; identifying a pool of features comprising at least one feature associated with the video data and at least one feature associated with the audio data; and generating the classifier using a learning algorithm by selecting nodes of the classifier using the pool of features and weighting a preferable feature higher than a less preferable feature such that the preferable feature is located in the classifier before the less preferable feature. - View Dependent Claims (15)
-
-
16. A method comprising:
-
identifying a pool of features comprising at least one feature from a first type of input and at least one feature from a second type of input where the second type of input is different from the first type of input; generating a classifier for speaker detection using a learning algorithm wherein nodes of the classifier are selected using the pool of features; and evaluating the classifier to detect a person, wherein at least one of the at least one feature from the first type of input or the at least one feature from the second type of input operates so that a false positive result is associated with a second person that is different from the person.
-
-
17. A method comprising:
identifying a pool of features comprising at least one feature from a first type of input and at least one feature from a second type of input where the second type of input is different from the first type of input, wherein the first type of input or the second type of input includes an audio input, the pool of features includes an audio feature associated with a sound source localization input, and the audio feature is associated with a function selected from the following functions;
-
18. A system comprising:
-
a video input device that produces video data; an audio input device that produces audio data, the audio data including sound source localization data; and a detector device including a detector configured to accept the video data and the audio data and evaluate a person detection classifier to detect a person where the classifier has been created by; identifying a pool of features comprising at least one feature associated with the video data and at least one feature associated with the audio data, the pool of features including an audio feature associated with a function selected from the following functions;
-
Specification