Method of phonetic modeling using acoustic decision tree
First Claim
Patent Images
1. A method of performing speech recognition comprising the steps of:
- generating phonetic models comprising the steps of forming triphone grammars from phonetic data;
training triphone models;
clustering triphones that are acoustically close together to form clustered triphone model by an acoustic decision tree analysis; and
mapping unclustered triphone grammars into a clustered model; and
recognizing input speech by comparing said input speech to said clustered triphone model.
1 Assignment
0 Petitions
Accused Products
Abstract
Phonetic modeling includes the steps of forming triphone grammars (11) from phonetic data, training triphone models (13), clustering triphones (14) that are acoustically close together and mapping unclustered triphone grammars into a clustered model (16). The clustering process includes using a decision tree based on the acoustic likelihood and allows sub-model clusters in user-definable units.
83 Citations
11 Claims
-
1. A method of performing speech recognition comprising the steps of:
- generating phonetic models comprising the steps of forming triphone grammars from phonetic data;
training triphone models;
clustering triphones that are acoustically close together to form clustered triphone model by an acoustic decision tree analysis; and
mapping unclustered triphone grammars into a clustered model; and
recognizing input speech by comparing said input speech to said clustered triphone model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- generating phonetic models comprising the steps of forming triphone grammars from phonetic data;
-
9. A speech recognition system comprising:
-
a microphone for receiving speech;
a clustered model from clustering triphones that are acoustically close together; and
a processor including a comparison means coupled to said microphone and said clustered model and responsive to said speech received for comparing incoming speech to said clustered model to provide a given output when there is a compare. - View Dependent Claims (10)
-
-
11. A speech recognition system comprising:
-
a clustered model from clustering triphones according to the steps of;
collecting speech data, forming triphone grammars, clustering triphones that are acoustically close together and clustering triphones by decision tree analysis wherein the decision criteria is on likelihood improvement based on acoustic vectors; and
a speech recognizer for comparing said incoming speech to said clustered model for recognizing speech.
-
Specification