Robust detection and classification of objects in audio using limited training data
First Claim
1. A method of classifying a homogeneous audio segment into one of a plurality of classes, said method comprising the steps of:
- dividing said homogeneous audio segment into a plurality of sub-segments;
extracting for each sub-segment a feature vector; and
classifying said homogeneous audio segment by comparing said feature vectors of said plurality of sub-segments with a plurality of continuous distribution functions, wherein each continuous distribution function defines one of said plurality of classes.
1 Assignment
0 Petitions
Accused Products
Abstract
A method (200) and apparatus (100) for classifying a homogeneous audio segment are disclosed. The homogeneous audio comprises a sequence of audio samples (x(n)). The method (200) starts by forming a sequence of frames (701-704) along the sequence of audio samples (x(n)), each frame (701-704) comprising a plurality of the audio samples (x(n)). The homogeneous audio segment is next divided (206) into a plurality of audio clips (711-714), with each audio clip being associated with a plurality of the frames (701-704). The method (200) then extracts (208) at least one frame feature for each clip (711-714). A clip feature vector (f) is next extracted from frame features of frames associated with the audio clip (711-714). Finally the segment is classified based on a continuous function during the distribution of the clip feature vectors (f).
26 Citations
16 Claims
-
1. A method of classifying a homogeneous audio segment into one of a plurality of classes, said method comprising the steps of:
-
dividing said homogeneous audio segment into a plurality of sub-segments; extracting for each sub-segment a feature vector; and classifying said homogeneous audio segment by comparing said feature vectors of said plurality of sub-segments with a plurality of continuous distribution functions, wherein each continuous distribution function defines one of said plurality of classes. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus for classifying a homogeneous audio segment into one of a plurality of classes, said apparatus comprising:
-
means for dividing said homogeneous audio segment into a plurality of sub-segments; means for extracting for each sub-segment a feature vector; and means for classifying said homogeneous audio segment by comparing said feature vectors of said plurality of sub-segments with a plurality of continuous distribution functions, wherein each continuous distribution function defines one of said plurality of classes. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A program stored in a memory medium for classifying a homogeneous audio segment into one of a plurality of classes, said program comprising;
-
code for dividing said homogeneous audio segment into a plurality of sub-segments; code for extracting for each sub-segment a feature vector; and code for classifying said homogeneous audio segment by comparing said feature vectors of said plurality of sub-segments with a plurality of continuous distribution functions, wherein each continuous distribution function defines one of said plurality of classes. - View Dependent Claims (15, 16)
-
Specification