Subword-based speaker verification with multiple-classifier score fusion weight and threshold adaptation

US 6,539,352 B1
Filed: 11/21/1997
Issued: 03/25/2003
Est. Priority Date: 11/22/1996
Status: Expired due to Term

First Claim

Patent Images

1. An automatic speaker verification system comprising:

a receiver, the receiver obtaining enrollment speech over an enrollment channel;

a means, connected to the receiver, for developing an estimate of the enrollment channel;

a first storage device, connected to the receiver, for storing the enrollment channel estimate;

a means for extracting predetermined features of the enrollment speech;

a means, operably connected to the extracting means, for segmenting the predetermined features of the enrollment speech, wherein the features are segmented into a plurality of subwords;

a plurality of classifiers, connected to the segmenting means, wherein the classifiers model the plurality of subwords and output classifier scores and a means, connected to the classifier, for fusing the classifier scores, wherein the fusing means weighs the scores from the classifier models with a fusion constant and combines the weighted scores resulting in a final score for the combined system, and wherein the weighted scores are variable and are dynamically adapted.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The voice print system of the present invention is a subword-based, text-dependent automatic speaker verification system that embodies the capability of user-selectable passwords with no constraints on the choice of vocabulary words or the language. Automatic blind speech segmentation allows speech to be segmented into subword units without any linguistic knowledge of the password. Subword modeling is performed using a multiple classifiers. The system also takes advantage of such concepts as multiple classifier fusion and data resampling to successfully boost the performance. Key word/key phrase spotting is used to optimally locate the password phrase. Numerous adaptation techniques increase the flexibility of the base system, and include: channel adaptation, fusion adaptation, model adaptation and threshold adaptation.

76 Citations

View as Search Results

5 Claims

1. An automatic speaker verification system comprising:
- a receiver, the receiver obtaining enrollment speech over an enrollment channel;
  
  a means, connected to the receiver, for developing an estimate of the enrollment channel;
  
  a first storage device, connected to the receiver, for storing the enrollment channel estimate;
  
  a means for extracting predetermined features of the enrollment speech;
  
  a means, operably connected to the extracting means, for segmenting the predetermined features of the enrollment speech, wherein the features are segmented into a plurality of subwords;
  
  a plurality of classifiers, connected to the segmenting means, wherein the classifiers model the plurality of subwords and output classifier scores and a means, connected to the classifier, for fusing the classifier scores, wherein the fusing means weighs the scores from the classifier models with a fusion constant and combines the weighted scores resulting in a final score for the combined system, and wherein the weighted scores are variable and are dynamically adapted.

2. An automatic speaker verification method, comprising the steps of:
- obtaining enrollment speech over an enrollment channel;
  
  storing an estimate of the enrollment channel;
  
  extracting predetermined features of the enrollment speech;
  
  segmenting the enrollment speech, wherein the enrollment speech is segmented into a plurality of subwords;
  
  modeling the plurality of subwords using a plurality of classifier models resulting in an output of classifier scores;
  
  weighing the scores from the classifier models with a fusion constant, wherein the fusion constant is variable and is dynamically adapted; and
  
  combining the weighted scores resulting in a final score for the combined system.

3. An automatic speaker verification method, wherein the results of prior verifications are stored, including the steps of:
- obtaining test speech from a user seeking authorization or identification;
  
  generating subwords of the test speech;
  
  scoring the subwords against subwords of a known individual using a plurality of modeling classifiers;
  
  storing the results of each model classifiers as a classifier score;
  
  fusing the results of each classifier score using a fusion constant and weighing function to generate a final score;
  
  comparing the final score to a threshold value to determine whether the test speech and enrollment speech are from the known individual;
  
  determining that fusion adaptation inclusion criteria are met; and
  
  changing the fusion constant to provide more weight to the classifier score which more accurately corresponds to the threshold value.

4. An automatic speaker verification method, wherein the results of prior verifications are stored, including the steps of:
- obtaining test speech from a user seeking authorization or identification;
  
  generating subwords of the test speech;
  
  scoring the subwords against subwords of a known individual using a plurality of modeling classifiers;
  
  storing the results of each model classifiers as a classifier score;
  
  fusing the results of each classifier score using a fusion constant and weighing function to generate a final score;
  
  comparing final score to a threshold value to determine whether the test speech and enrollment speech are from the known individual;
  
  determining that model adaptation inclusion criteria are met, including that one or more verifications have been successful; and
  
  training the model classifiers with previously stored enrollment speech and with speech corresponding to the successful verifications, including the steps of generating a new threshold value; and
  
  storing the new threshold value.

5. An automatic speaker verification method, wherein the results of prior verifications are stored, including the steps of:
- obtaining test speech from a user seeking authorization or identification;
  
  generating subwords of the test speech;
  
  scoring the subwords against subwords of a known individual using a plurality of modeling classifiers;
  
  storing the results of each model classifiers as a classifier score;
  
  fusing the results of each classifier score using a fusion constant and weighing function to generate a final score;
  
  comparing the final score to a threshold value to determine whether the test speech and enrollment speech are from the known individual;
  
  determining that threshold adaptation inclusion criteria are met;
  
  analyzing the stored final scores;
  
  calculating a new threshold value in response to the analyzation; and
  
  storing the new threshold value.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
SpeechWorks International, Inc. (Microsoft Corporation)
Original Assignee
SpeechWorks International, Inc. (Microsoft Corporation)
Inventors
Sharma, Manish, Zhang, Xiaoyu, Mammone, Richard J.
Primary Examiner(s)
SMITS, TALIVALDIS IVARS

Application Number

US08/976,280
Time in Patent Office

1,950 Days
Field of Search

704/249, 704/250, 704/273
US Class Current

704/249
CPC Class Codes

G10L 15/04   Segmentation; Word boundary...

G10L 15/07   to the speaker

G10L 15/10   using distance or distortio...

G10L 15/16   using artificial neural net...

G10L 15/1815   Semantic context, e.g. disa...

G10L 17/04   Training, enrolment or mode...

G10L 17/10   Multimodal systems, i.e. ba...

Subword-based speaker verification with multiple-classifier score fusion weight and threshold adaptation

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

76 Citations

5 Claims

Specification

Solutions

Use Cases

Quick Links

Subword-based speaker verification with multiple-classifier score fusion weight and threshold adaptation

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

76 Citations

5 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links