Speech recognition models combining gender-dependent and gender-independent phone states and using phonetic-context-dependence
First Claim
1. A method of gender dependent speech recognition comprising the steps of:
- identifying phone state models common to both genders;
identifying gender specific phone state models;
identifying a gender of a speaker; and
recognizing acoustic data from the speaker based on the phone state models.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of gender dependent speech recognition includes the steps of identifying phone state models common to both genders, identifying gender specific phone state models, identifying a gender of a speaker and recognizing acoustic data from the speaker. A method of constructing a gender-dependent speech recognition model includes the steps of providing training data of a known gender, aligning the training data, tagging the training data with a gender to create gender-tagged data, determining a gender question at a node to determine gender dependence of the gender-tagged data, determining a phonetic context question at the node to determine phonetic context dependence of the gender-tagged data, determining a highest value of an evaluation function between the gender dependence and the phonetic context dependence to determine which dependence is a dominant dependence, splitting the data of the dominant dependence into child nodes according to likelihood criteria, comparing the highest value with a threshold value to determine if additional splitting is necessary, repeating theses steps for each child node until the highest value is below the threshold value and counting the nodes having gender dependence to determine an overall gender dependence level. A gender-dependent speech recognition system includes an input device for inputting speech to a preprocessor. The preprocessor converts the speech into acoustic data, and a processor for identifies gender-dependent phone state models and phone state modes common to both genders. The phone state models are stored in a memory device wherein the processor recognizes the speech in accordance with the phone state models.
103 Citations
18 Claims
-
1. A method of gender dependent speech recognition comprising the steps of:
-
identifying phone state models common to both genders; identifying gender specific phone state models; identifying a gender of a speaker; and recognizing acoustic data from the speaker based on the phone state models. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method of constructing a gender-dependent speech recognition model comprising the steps of:
-
a) aligning acoustic data with a gender independent system b) asking a gender question at a node to determine gender dependence of the acoustic data; c) asking a phonetic context question at the node to determine phonetic context dependence of the acoustic data; d) determining a highest value of an evaluation function between the gender dependence and the phonetic context dependence to determine which dependence is a dominant dependence; e) splitting the data of the dominant dependence into child nodes according to the question of dominant dependence; and f) repeating steps b-e for each child node until a threshold criterion is met. - View Dependent Claims (10, 11, 12)
-
-
13. A method of constructing a gender-dependent speech recognition model comprising the steps of:
-
a) providing training data of a known gender; b) aligning the training data; c) tagging the training data with a gender to create gender-tagged data; d) asking a gender question at a node to determine gender dependence of the gender-tagged data; e) asking a phonetic context question at the node to determine phonetic context dependence of the gender-tagged data; f) determining a highest value of an evaluation function between the gender dependence and the phonetic context dependence to determine which dependence is a dominant dependence; g) splitting the data of the dominant dependence into child nodes according to a likelihood criterion; h) comparing the highest value with a threshold value to determine if additional splitting is necessary; and i) repeating steps d-h for each child node until the highest value is below the threshold value. - View Dependent Claims (14, 15)
-
-
16. A gender-dependent speech recognition system comprising:
-
an input device for inputting speech to a preprocessor, the preprocessor converting speech into acoustic data; and a processor for identifying gender-dependent phone state models and phone state modes common to both genders, the phone state models being stored in a memory device wherein the processor recognizes the speech in accordance with the phone state models. - View Dependent Claims (17, 18)
-
Specification