Hidden conditional random field models for phonetic classification and speech recognition
First Claim
Patent Images
1. A method of training a hidden conditional random field model, the method comprising:
- defining a set of hidden states for each of a plurality of labels;
identifying a constrained sequence of sets of hidden states, at least one set in the sequence containing fewer than all of the hidden states defined for all of the labels; and
adjusting parameters of the hidden conditional random field model to make state sequences in the constrained sequence of sets of hidden states more likely than a state sequence in an unconstrained sequence of sets of hidden states, each set in the unconstrained sequence containing all of the hidden states defined for all of the labels.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus are provided for training and using a hidden conditional random field model for speech recognition and phonetic classification. The hidden conditional random field model uses features, at least one of which is based on a hidden state in a phonetic unit. Values for the features are determined from a segment of speech, and these values are used to identify a phonetic unit for the segment of speech.
15 Citations
24 Claims
-
1. A method of training a hidden conditional random field model, the method comprising:
-
defining a set of hidden states for each of a plurality of labels;
identifying a constrained sequence of sets of hidden states, at least one set in the sequence containing fewer than all of the hidden states defined for all of the labels; and
adjusting parameters of the hidden conditional random field model to make state sequences in the constrained sequence of sets of hidden states more likely than a state sequence in an unconstrained sequence of sets of hidden states, each set in the unconstrained sequence containing all of the hidden states defined for all of the labels. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-readable medium having computer-executable instructions for performing steps comprising:
-
receiving a speech signal;
determining values from the speech signal for features that are defined for a hidden conditional random field model, at least one of the features based on a hidden state in a phonetic unit; and
using the values of the features in the hidden conditional random field model to identify at least one phonetic unit. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method of decoding a speech signal to identify at least one phonetic unit, the method comprising:
-
identifying a first value for a feature for a first hidden state at a time point using a segment of the speech signal;
identifying a second value for the feature for a second hidden state at the time point using the segment of the speech signal;
using both the first value for the feature and the second value for the feature in a model to identify a phonetic unit for the segment of speech. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification