Method for recognizing speech
First Claim
1. Method for recognizing speech, wherein for the process of recognition a current acoustic model (CAM) based on a set of model function mixtures (MFMl, . . . , MFMn) is used and wherein said current acoustic model (CAM) is adapted during the recognition process by changing at least in part the contributions of model function mixture components (MFMjk) of model function mixtures (MFMj) based on at least one recognition result already obtained, characterized in that the process of recognition is started using a starting acoustic model (SAM) as said current acoustic model (CAM), after given numbers of performed recognition steps and/or obtained recognition results a modified acoustic model (MAM) is generated based on said current acoustic model (CAM) by cancelling model function mixture components (MFMjk) having negligible contributions with respect to at least given numbers of recognition results already obtained, and the process of recognition is continued using said modified acoustic model (MAM) as said current acoustic model (CAM) in each case.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for recognizing speech is proposed wherein the process of recognition is started using the starting acoustic model (SAM) and wherein the current acoustic model (CAM) is modified by removing or cancelling model function mixture components (MFMjk) which are negligible for the description of the speaking behaviour and quality of the current speaker. Therefore, the size of the acoustic model (SAM, CAM) is reduced by adaptation to the current speaker enabling fast performance and increased recognition efficiency.
13 Citations
10 Claims
-
1. Method for recognizing speech,
wherein for the process of recognition a current acoustic model (CAM) based on a set of model function mixtures (MFMl, . . . , MFMn) is used and wherein said current acoustic model (CAM) is adapted during the recognition process by changing at least in part the contributions of model function mixture components (MFMjk) of model function mixtures (MFMj) based on at least one recognition result already obtained, characterized in that the process of recognition is started using a starting acoustic model (SAM) as said current acoustic model (CAM), after given numbers of performed recognition steps and/or obtained recognition results a modified acoustic model (MAM) is generated based on said current acoustic model (CAM) by cancelling model function mixture components (MFMjk) having negligible contributions with respect to at least given numbers of recognition results already obtained, and the process of recognition is continued using said modified acoustic model (MAM) as said current acoustic model (CAM) in each case.
Specification