Selective sampling for sound signal classification
First Claim
1. A method for sound signal classification, comprising:
- receiving a sound signal;
specifying meta-data to be extracted from the sound signal;
dividing the sound signal into a set of frames;
applying a fitness function to the frames to create a set of fitness data;
selecting a frame from the set of frames, if the frame'"'"'s corresponding fitness datum within the set of fitness data exceeds a predetermined threshold value;
extracting the meta-data from the selected frames; and
classifying the sound signal based on the meta-data extracted from the selected frames.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method of selective sampling for sound signal classification is disclosed. The method of the present invention discloses the elements of: receiving a sound signal; specifying meta-data to be extracted from the sound signal; dividing the sound signal into a set of frames; applying a fitness function to the frames to create a set of fitness data; selecting a frame from the set of frames, if the frame'"'"'s corresponding fitness datum within the set of fitness data exceeds a predetermined threshold value; extracting the meta-data from the selected frames; and classifying the sound signal based on the meta-data extracted from the selected frames. The system of the present invention discloses means for implementing the method.
-
Citations
19 Claims
-
1. A method for sound signal classification, comprising:
-
receiving a sound signal;
specifying meta-data to be extracted from the sound signal;
dividing the sound signal into a set of frames;
applying a fitness function to the frames to create a set of fitness data;
selecting a frame from the set of frames, if the frame'"'"'s corresponding fitness datum within the set of fitness data exceeds a predetermined threshold value;
extracting the meta-data from the selected frames; and
classifying the sound signal based on the meta-data extracted from the selected frames. - View Dependent Claims (2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, 15, 17)
-
-
7. The method of claim I wherein specifying includes:
specifying dialect meta-data.
-
16. The method of claim I further wherein classifying includes:
-
adding together each of the selected frame'"'"'s confidence scores for each meta-data class; and
assigning the sound signal to that meta-data class having a highest total confidence score.
-
-
18. A method for sound signal classification, comprising:
-
receiving a speech signal;
specifying meta-data to be extracted from the sound signal;
dividing the sound signal into a set of equal length time frames;
applying a fitness function to the frames to create a set of fitness data;
selecting a frame for meta-data extraction, if the frame'"'"'s fitness datum exceeds a greatest fitness datum within the set of fitness data by a predetermined margin;
extracting the meta-data from the selected frames using a Multi-Layer Perceptron (MLP) neural network;
adding together each of the selected frame'"'"'s confidence scores for each meta-data class; and
assigning the sound signal to that meta-data class having a highest total confidence score.
-
-
19. A system for sound signal classification comprising a:
-
means for receiving a sound signal;
means for specifying meta-data to be extracted from the sound signal;
means for dividing the sound signal into a set of frames;
means for applying a fitness function to the frames to create a set of fitness data;
means for selecting a frame from the set of frames, if the frame'"'"'s corresponding fitness datum within the set of fitness data exceeds a predetermined threshold value;
means for extracting the meta-data from the selected frames; and
means for classifying the sound signal based on the meta-data extracted from the selected frames.
-
Specification