Speech recognition and teaching apparatus able to rapidly adapt to difficult speech of children and foreign speakers

US 6,253,181 B1
Filed: 01/22/1999
Issued: 06/26/2001
Est. Priority Date: 01/22/1999
Status: Expired due to Term

First Claim

Patent Images

1. A speech recognition apparatus that adapts an initial speech model based on input speech from the user, comprising:

a speech model that represents speech as a plurality of speech unit models associated with a plurality of speech units;

a speech recognizer that processes input speech from a user using said speech model to recognize uttered speech units within said input speech;

a confidence measurement system associated with said speech recognizer for associating a confidence measure with each of said uttered speech units;

an adaptation system having data store containing information reflecting a priori knowledge about a speaker space, said adaptation system being operative to select uttered speech units that exceed a predetermined confidence measure and to use said selected uttered speech units and said information reflecting a priori knowledge to adapt said speech model; and

wherein said adaptation system includes a data store containing a set of eigenspace basis vectors representing a plurality of training speakers and wherein said adaptation system uses said selected uttered speech units to train an adapted speech model while using said basis vectors to constrain said adapted speech model such that said adapted speech model lies within said eigenspace.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The recognizer tests input utterances using a confidence measure to select words of high recognition confidence for use in the adaptation process. Adaptation is performed rapidly using a priori knowledge of about the class of speakers who will be using the system. This a priori knowledge can be expressed using eigenvoice basis vectors that capture information about the entire targeted user population. The dialogue system may also use the confidence measure to output a pronunciation example to the user, based on the confidence that the system has in the results of recognition, given the different possibilities that can be recognized. The dialogue system may also provide voiced prompts that teach the user how to correctly pronounce words.

Citations

6 Claims

1. A speech recognition apparatus that adapts an initial speech model based on input speech from the user, comprising:
- a speech model that represents speech as a plurality of speech unit models associated with a plurality of speech units;
  
  a speech recognizer that processes input speech from a user using said speech model to recognize uttered speech units within said input speech;
  
  a confidence measurement system associated with said speech recognizer for associating a confidence measure with each of said uttered speech units;
  
  an adaptation system having data store containing information reflecting a priori knowledge about a speaker space, said adaptation system being operative to select uttered speech units that exceed a predetermined confidence measure and to use said selected uttered speech units and said information reflecting a priori knowledge to adapt said speech model; and
  
  wherein said adaptation system includes a data store containing a set of eigenspace basis vectors representing a plurality of training speakers and wherein said adaptation system uses said selected uttered speech units to train an adapted speech model while using said basis vectors to constrain said adapted speech model such that said adapted speech model lies within said eigenspace.
- View Dependent Claims (2, 3)
- - 2. The speech recognition apparatus of claim 1 further comprising a dialogue system coupled to said confidence measurement system for selecting at least a portion of said uttered speech units and for prompting said user based on said selected portion of said uttered speech units.
  - 3. The speech recognition apparatus of claim 2 further comprising speech playback system containing speech data representing prerecorded speech, said playback system coupled with said dialogue system for confirming said portion of said uttered speech units to said user by using said speech data to provide an audible playback corresponding to said portion of said uttered speech units.

4. A speech recognition apparatus that adapts an initial speech model based on input speech from the user, comprising:
- a speech model that represents speech as a plurality of speech unit models associated with a plurality of speech units;
  
  a speech recognizer that processes input speech from a user using said speech model to recognize uttered speech units within said input speech;
  
  a confidence measurement system associated with said speech recognizer for associating a confidence measure with each of said uttered speech units;
  
  an adaptation system having data store containing information reflecting a priori knowledge about a speaker space, said adaptation system being operative to select uttered speech units that exceed a predetermined confidence measure and to use said selected uttered speech units and said information reflecting a priori knowledge to adapt said speech model;
  
  wherein said adaptation system includes a data store containing an eigenspace data structure that represents a plurality of training speakers as a set of models for said training speakers that has been dimensionally reduced to generate a set of basis vectors that define said eigenspace; and
  
  wherein said adaptation system uses said selected uttered speech units to train an adapted speech model while using said basis vectors to constrain said adapted speech model such that said adapted speech model lies within said eigenspace.
- View Dependent Claims (5, 6)
- - 5. The speech recognition apparatus of claim 4 further comprising a dialogue system coupled to said confidence measurement system for selecting at least a portion of said uttered speech units and for prompting said user based on said selected portion of said uttered speech units.
  - 6. The speech recognition apparatus of claim 5 further comprising speech playback system containing speech data representing prerecorded speech, said playback system coupled with said dialogue system for confirming said portion of said uttered speech units to said user by using said speech data to provide an audible playback corresponding to said portion of said uttered speech units.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Sovereign Peak Ventures, LLC (Dominion Harbor Enterprises, LLC)
Original Assignee
Matsushita Electric Industrial Company Limited (Panasonic Holdings Corporation)
Inventors
Junqua, Jean-Claude
Primary Examiner(s)
Korzuch, William R.
Assistant Examiner(s)
Storm, Donald L.

Application Number

US09/235,181
Time in Patent Office

886 Days
Field of Search

704/244, 704/255, 704/236, 704/245, 704/243
US Class Current

704/255
CPC Class Codes

G10L 15/063 Training

G10L 15/065 Adaptation

Speech recognition and teaching apparatus able to rapidly adapt to difficult speech of children and foreign speakers

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

Citations

6 Claims

Specification

Solutions

Use Cases

Quick Links

Speech recognition and teaching apparatus able to rapidly adapt to difficult speech of children and foreign speakers

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

6 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links