Method and apparatus for automated speaker parameters adaptation in a deployed speaker verification system
First Claim
1. A method of performing speaker recognition, the method comprising:
- collecting representations of voice characteristics, in association with corresponding speakers, the representations being generated by a deployed voice-based interactive system;
updating system-level parameters of a classifier used in speaker recognition, based on the representations collected, the representations collected being indicative of inter-speaker and intra-speaker variations in voice characteristics, the system-level parameters updated to improve distinguishability between different speakers by maximizing inter-speaker variations while accommodating intra-speaker variability, wherein updating the system-level parameters of the classifier includes updating the system-level parameters each time a given number of successful identifications of at least one speaker is achieved; and
employing the classifier, with the corresponding system-level parameters updated, in performing speaker recognition.
2 Assignments
0 Petitions
Accused Products
Abstract
Typical speaker verification systems usually employ speakers'"'"' audio data collected during an enrollment phase when users enroll with the system and provide respective voice samples. Due to technical, business, or other constraints, the enrollment data may not be large enough or rich enough to encompass different inter-speaker and intra-speaker variations. According to at least one embodiment, a method and apparatus employing classifier adaptation based on field data in a deployed voice-based interactive system comprise: collecting representations of voice characteristics, in association with corresponding speakers, the representations being generated by the deployed voice-based interactive system; updating parameters of the classifier, used in speaker recognition, based on the representations collected; and employing the classifier, with the corresponding parameters updated, in performing speaker recognition.
-
Citations
18 Claims
-
1. A method of performing speaker recognition, the method comprising:
-
collecting representations of voice characteristics, in association with corresponding speakers, the representations being generated by a deployed voice-based interactive system; updating system-level parameters of a classifier used in speaker recognition, based on the representations collected, the representations collected being indicative of inter-speaker and intra-speaker variations in voice characteristics, the system-level parameters updated to improve distinguishability between different speakers by maximizing inter-speaker variations while accommodating intra-speaker variability, wherein updating the system-level parameters of the classifier includes updating the system-level parameters each time a given number of successful identifications of at least one speaker is achieved; and employing the classifier, with the corresponding system-level parameters updated, in performing speaker recognition. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. An apparatus of performing speaker recognition, the apparatus comprising:
-
at least one processor; and at least one memory with computer code instructions stored thereon, the at least one processor and the at least one memory, with the computer code instructions, being configured to cause the apparatus to; collect representations of voice characteristics, in association with corresponding speakers, the representations being generated by a deployed voice-based interactive system; update system-level parameters of a classifier, used in speaker recognition, based on the representations collected, the representations collected being indicative of inter-speaker and intra-speaker variations in voice characteristics, the system-level parameters updated to improve distinguishability between different speakers by maximizing inter-speaker variations while accommodating intra-speaker variability, wherein in updating the system-level parameters of the classifier, the at least one processor and the at least one memory, with the computer code instructions, being configured to cause the apparatus to update the system-level parameters each time a given number of successful identifications of at least one speaker is achieved; and employ the classifier, with the corresponding system-level parameters updated, in performing speaker recognition. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A non-transitory computer-readable medium with computer software instructions stored thereon, the computer software instructions when executed by a processor cause an apparatus to perform:
-
collecting representations of voice characteristics, in association with corresponding speakers; updating system-level parameters of a classifier, used in speaker recognition, based on the representations collected, the representations collected being indicative of inter-speaker and intra-speaker variations in voice characteristics, the system-level parameters updated to improve distinguishability between different speakers by maximizing inter-speaker variations while accommodating intra-speaker variability, wherein updating the system-level parameters of the classifier includes updating the system-level parameters each time a given number of successful identifications of at least one speaker is achieved; and employing the classifier, with the corresponding system-level parameters updated, in performing speaker recognition.
-
Specification