Statistical acoustic processing method and apparatus for speech recognition using a toned phoneme system
First Claim
1. A method for recognizing words of speech comprising at least one syllable having tonal content, the method comprising the steps of:
- decomposing said at least one syllable into a preme and a toneme, the toneme having a tone value; and
recognizing the words of speech based on the preme and toneme of said at least one syllable including the steps of;
continuously detecting a pitch value for the toneme of said at least one syllable;
creating at least one pitch contour based on the detected pitch value;
determining whether a discontinuity representing an un-toned portion of said at least one syllable exists between adjacent pitch contours and if so producing at least one simulated tone value to mask the discontinuity;
obtaining parameters from the pitch value for the toneme and from a derivative of the at least one pitch contour; and
determining the tone value of the toneme of said at least one syllable using the parameters.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for acoustic signal processing of speech recognition, the method comprising the following components: 1) Decompose each syllable into two phonemes of comparable length and complexity, the first one being a preme, and the second one being a toneme; 2) Each toneme is assigned a tone value such as high, rising, low, falling, and untoned; 3) No tone value is assigned to premes; 4) Pitch is detected continuously and treated the same way as energy and cepstrals in a Hidden Markov Model to predict the tone of a toneme; 5) The tone of a syllable is defined as the tone of its component toneme.
-
Citations
12 Claims
-
1. A method for recognizing words of speech comprising at least one syllable having tonal content, the method comprising the steps of:
-
decomposing said at least one syllable into a preme and a toneme, the toneme having a tone value; and recognizing the words of speech based on the preme and toneme of said at least one syllable including the steps of; continuously detecting a pitch value for the toneme of said at least one syllable; creating at least one pitch contour based on the detected pitch value; determining whether a discontinuity representing an un-toned portion of said at least one syllable exists between adjacent pitch contours and if so producing at least one simulated tone value to mask the discontinuity; obtaining parameters from the pitch value for the toneme and from a derivative of the at least one pitch contour; and determining the tone value of the toneme of said at least one syllable using the parameters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for recognizing words of speech comprising at least one syllable having tonal content, comprising:
-
means for decomposing said at least one syllable into a preme and a toneme, the toneme having a tone value; and means for recognizing the words of speech based on the preme and toneme of said at least one syllable comprising; means for converting the words of speech into an electrical signal; pitch extraction means for extracting a pitch value for the toneme of said at least one syllable if the signal energy is above a threshold; means for extrapolating the signal wherever the signal energy is below the threshold or the extracted pitch value is not within a pre-determined range to generate an extended pitch signal; storage means for storing data including the extended pitch signal and at least one derivative of the extended pitch signal; and means for determining the tone value of the toneme of said at least one syllable using the stored data. - View Dependent Claims (10, 11, 12)
-
Specification