Maximum likelihood method for finding an adapted speaker model in eigenvoice space

US 6,263,309 B1
Filed: 04/30/1998
Issued: 07/17/2001
Est. Priority Date: 04/30/1998
Status: Expired due to Term

First Claim

Patent Images

1. A method for performing speaker adaptation comprising the steps of:

constructing an eigenspace to represent a plurality of training speakers by providing a set of models for said training speakers, expressing said set of models as supervectors of a first predetermined dimension, and performing principal component analysis upon said supervectors to generate a set of principal component vectors of a second predetermined dimension substantially lower than said first predetermined dimension that define said eigenspace;

generating an adapted model, using input speech from a new speaker to generate a maximum likelihood vector and to train said adapted model, while using said set of principal component vectors and said maximum likelihood vector to constrain said adapted model such that said adapted model lies within said eigenspace.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A set of speaker dependent models is trained upon a comparatively large number of training speakers, one model per speaker, and model parameters are extracted in a predefined order to construct a set of supervectors, one per speaker. Principle component analysis is then performed on the set of supervectors to generate a set of eigenvectors that define an eigenvoice space. If desired, the number of vectors may be reduced to achieve data compression. Thereafter, a new speaker provides adaptation data from which a supervector is constructed by constraining this supervector to be in the eigenvoice space based on a maximum likelihood estimation. The resulting coefficients in the eigenspace of this new speaker may then be used to construct a new set of model parameters from which an adapted model is constructed for that speaker. Environmental adaptation may be performed by including environmental variations in the training data.

70 Citations

View as Search Results

4 Claims

1. A method for performing speaker adaptation comprising the steps of:
- constructing an eigenspace to represent a plurality of training speakers by providing a set of models for said training speakers, expressing said set of models as supervectors of a first predetermined dimension, and performing principal component analysis upon said supervectors to generate a set of principal component vectors of a second predetermined dimension substantially lower than said first predetermined dimension that define said eigenspace;
  
  generating an adapted model, using input speech from a new speaker to generate a maximum likelihood vector and to train said adapted model, while using said set of principal component vectors and said maximum likelihood vector to constrain said adapted model such that said adapted model lies within said eigenspace.
- View Dependent Claims (2, 3, 4)
- - 2. The method of claim 1 wherein said step of generating a maximum likelihood vector comprises:
3. The method of claim 1 wherein said adapted model is constrained by multiplying said maximum likelihood vector with said principal component vectors.
4. The method of claim 2 wherein said maximizing step is performed by:
- representing said maximum likelihood vector as a set of eigenvalue variables;
  
  taking a first derivative of said auxiliary function with respect to said eigenvalue variables; and
  
  solving for the corresponding values of said eigenvalue variables when said first derivative is equated to zero.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Panasonic Intellectual Property Corporation of America (Panasonic Holdings Corporation)
Original Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Inventors
Junqua, Jean-Claude, Nguyen, Patrick, Kuhn, Roland
Primary Examiner(s)
Tsang, Fan
Assistant Examiner(s)
Opsasnick, Michael N.

Application Number

US09/070,054
Time in Patent Office

1,174 Days
Field of Search

704/231, 704/255, 704/256
US Class Current

704/239
CPC Class Codes

G10L 15/07 to the speaker

Maximum likelihood method for finding an adapted speaker model in eigenvoice space

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

70 Citations

4 Claims

Specification

Use Cases

Quick Links

Others

Maximum likelihood method for finding an adapted speaker model in eigenvoice space

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

70 Citations

4 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others