PRODUCING TIME UNIFORM FEATURE VECTORS
First Claim
1. A method of processing a signal representing speech, the method comprising:
- receiving a frame of the signal representing speech, the frame comprising a voiced frame;
extracting one or more cords from the voiced frame based on occurrence of one or more events within the frame and wherein the one or more cords collectively comprise less than all of the frame; and
normalizing the one or more cords on a time basis.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods, systems, and machine-readable media are disclosed for processing a signal representing speech. According to one embodiment, processing a signal representing speech can comprise receiving a frame of the signal representing speech, the frame comprising a voiced frame. One or more cords can be extracted from the voiced frame based on occurrence of one or more events within the frame. For example, the one or more events comprise one or more glottal pulses. The one or more cords can collectively comprise less than all of the frame. The one or more cords can be normalized on a time basis. For example, each of the one or more cords can begin with onset of a glottal pulse and extend to a point prior to an onset of neighboring glottal pulse but may exclude a portion of the frame prior to the onset of the neighboring glottal pulse.
27 Citations
21 Claims
-
1. A method of processing a signal representing speech, the method comprising:
-
receiving a frame of the signal representing speech, the frame comprising a voiced frame; extracting one or more cords from the voiced frame based on occurrence of one or more events within the frame and wherein the one or more cords collectively comprise less than all of the frame; and normalizing the one or more cords on a time basis. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a classification module adapted to receive a frame of a signal representing speech and classify the frame as a voiced frame; a cord finder module communicatively coupled with the classification module and adapted to receive the frame from the classification module and extract one or more cords from the frame based on occurrence of one or more events within the frame and wherein the one or more cords collectively comprise less than all of the frame; and a time normalization module communicatively coupled with the cord finder module and adapted to receive the one or more extracted cords from the cord finder module and normalize the one or more cords on a time basis. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A machine-readable medium having stored thereon a series of instruction which, when executed by a processor, cause the processor to process a signal representing speech by:
-
receiving a frame of the signal representing speech, the frame comprising a voiced frame; extracting one or more cords from the voiced frame based on occurrence of one or more events within the frame and wherein the one or more cords collectively comprise less than all of the frame; and normalizing the one or more cords on a time basis.
-
Specification