Voice authentication and speech recognition system and method
First Claim
Patent Images
1. A method for configuring a speech recognition system, the method comprising:
- obtaining a speech sample from a user utilised to authenticate the user as part of an authentication process;
processing the speech sample to train one or more generic acoustic model(s) for units of speech associated with the speech sample;
storing the trained acoustic model(s) in a personalised acoustic model set for the user;
selectively re-training the acoustic model(s) in the personalised model set based on additional speech samples provided by the user containing corresponding units of speech;
responsive to determining that the user has accessed a speech recognition function, directing a speech recognition process to access the personalised model set for recognising subsequent user utterances; and
further comprising determining a measure of quality for each of the stored acoustic models and wherein the acoustic modules are re-trained based on additional speech samples until the corresponding quality measure meets a predefined threshold.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for configuring a speech recognition system comprises obtaining a speech sample utilized by a voice authentication system in a voice authentication process. The speech sample is processed to generate acoustic models for units of speech associated with the speech sample. The acoustic models are stored for subsequent use by the speech recognition system as part of a speech recognition process.
11 Citations
16 Claims
-
1. A method for configuring a speech recognition system, the method comprising:
-
obtaining a speech sample from a user utilised to authenticate the user as part of an authentication process; processing the speech sample to train one or more generic acoustic model(s) for units of speech associated with the speech sample; storing the trained acoustic model(s) in a personalised acoustic model set for the user; selectively re-training the acoustic model(s) in the personalised model set based on additional speech samples provided by the user containing corresponding units of speech; responsive to determining that the user has accessed a speech recognition function, directing a speech recognition process to access the personalised model set for recognising subsequent user utterances; and further comprising determining a measure of quality for each of the stored acoustic models and wherein the acoustic modules are re-trained based on additional speech samples until the corresponding quality measure meets a predefined threshold. - View Dependent Claims (2, 3, 4, 5, 8, 14)
-
-
6. A combined speech recognition and voice authentication method, comprising:
responsive to a user being successfully authenticated by a voice authentication function, accessing a personalised set of acoustic language models for use by a speech recognition function in recognising one or more utterances by the user, the acoustic model set containing acoustic language models which have been trained using voice data derived from utterances provided by the user either during enrolment with the authentication function or during one or more subsequent authentications. - View Dependent Claims (7)
-
9. A speech recognition system comprising:
-
a processing module programmed to; obtain a speech sample utilised to authenticate a user as part of an authentication process; process the speech sample to train one or more generic acoustic models for speech units associated with the speech sample and to subsequently store the trained acoustic model(s) in a personalised acoustic model set; selectively re-train the acoustic model(s) based on additional speech samples provided by the user containing corresponding units of speech; responsive to determining that the user has accessed a speech recognition function, the processing module is further arranged to access the personalised acoustic model set for recognising subsequent user utterances; and the processing module being further programmed to determine a measure of quality for each of the stored acoustic models and continuing to regenerate the acoustic models until the quality measure reaches a predefined threshold. - View Dependent Claims (10, 11, 12, 13)
-
-
15. A combined speech recognition and voice authentication method, comprising:
-
responsive to a user being successfully authenticated by a voice authentication function, accessing a personalised set of acoustic grammar models for use by a speech recognition function in recognising one or more utterances by the user, the acoustic model set containing acoustic grammar models which have been trained using voice data derived from utterances provided by the user either during enrolment with the authentication function or during one or more subsequent authentications; and further comprising determining a measure of quality for each of the stored acoustic models and wherein the acoustic modules are re-trained based on additional speech samples until the corresponding quality measure meets a predefined threshold. - View Dependent Claims (16)
-
Specification