Mandarin speech input method for Chinese computers and a mandarin speech recognition machine

US 5,220,639 A
Filed: 12/01/1989
Issued: 06/15/1993
Est. Priority Date: 12/01/1989
Status: Expired due to Term

First Claim

Patent Images

1. A speech recognition method comprising steps of:

segmented a training speech syllable into an initial part and a final part;

training a Continuous Hidden Markov Model (CHMM) on the initial part to create an initial part model having trained initial part model parameters;

training a CHMM on the final part to create a final part model having trained final part model parameters;

training a CHMM on the training speech syllable to create a syllable model using the trained initial part parameter values and the trained final part parameter values as starting parameters for the syllable model;

operating on an object speech sample with the syllable model;

recognizing the object speech sample as an object speech syllable based on a degree of match of the object speech sample to the syllable model;

representing the object speech sample as a Chinese character in accordance with the object speech syllable.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of inputting Chinese characters into a computer directly from Mandarin speech which recognizes a series of monosyllables by separately recognizing syllables and Mandarin tones and assembling the recognized parts to recognize the mono-syllable using Hidden Markov Models. The recognized mono-syllable is used by a Markov Chinese Language Model in a Linguistic decoder section to determine the corresponding Chinese character A Mandarin dictation machine which uses the above method, using a speech input device to receive the Mandarin speech and digitizing it so a personal computer can further process that information. A pitch frequency detector, a Voice signal pre-processing unit, a Hidden Markov Model processor, and a training facility are all attached to the personal computer to perform their associated functions of the method above.

270 Citations

10 Claims

1. A speech recognition method comprising steps of:
- segmented a training speech syllable into an initial part and a final part;
  
  training a Continuous Hidden Markov Model (CHMM) on the initial part to create an initial part model having trained initial part model parameters;
  
  training a CHMM on the final part to create a final part model having trained final part model parameters;
  
  training a CHMM on the training speech syllable to create a syllable model using the trained initial part parameter values and the trained final part parameter values as starting parameters for the syllable model;
  
  operating on an object speech sample with the syllable model;
  
  recognizing the object speech sample as an object speech syllable based on a degree of match of the object speech sample to the syllable model;
  
  representing the object speech sample as a Chinese character in accordance with the object speech syllable.
- View Dependent Claims (2, 3, 4, 5)
- - 2. A method as in claim 1 further comprising a step of training a CHMM on a Mandarin tone in an input speech syllable to create a tone model.
  - 3. A method as in claim 2, wherein the step of training a CHMM to create a tone model includes a step of training on pitch frequency, short time energy, and duration of an input speech syllable.
  - 4. A method as in claim 1 further comprising steps of:
    - training a Markov Model (MM) on a sequence of chinese characters as used in context to create a Chinese language model;
      
      operating on a sequence of object speech syllables in the object speech sample with the Chinese language model; and
      
      representing the object speech sample as a Chinese character sequence in accordance with a match of the sequence of object speech syllables to the Chinese language model, thereby representing the object speech sample as a Chinese character sequence in accordance with a sequence of matches to the object speed syllables.
  - 5. A speech recognition apparatus as in claim 4, further comprising:
    - a display connected to the computer for illustrating a recognized Chinese syllable; and
      
      an input device connected to the computer for receiving corrections from an operator.

6. A speech recognition apparatus for Mandarin speech including high level, high rising, low dipping and high falling lexical tones, comprising:
- a speech signal filter for receiving a speech signal and creating a filtered analog signal;
  
  an analog-to-digital (A/D) converter connected to the speech signal filter for converting the filtered analog signal to a digital speech signal;
  
  a computer connected to the A/D converter for receiving and processing the digital signal;
  
  a pitch frequency detector connected to the computer for detecting characteristics of the pitch frequency of the speech signal thereby recognizing tones in the speech signal;
  
  a speech signal pre-processor connected to the computer for detecting the endpoints of syllables of speech signals thereby defining a beginning and ending of a syllable;
  
  a Hidden Markov Model processor connected to the computer for determining degrees of match between the speech signal and a syllable model, a tone model and a language model and recognizing speech signal syllables based on the degrees of match;
  
  a training apparatus connected to the computer for training an initial part Hidden Markov model and a final part Hidden Markov model and for training a syllable model based on trained parameters of the initial part Hidden Markov model and the final part Hidden Markov model.
- View Dependent Claims (7, 8)
- - 7. A speech recognition apparatus as in claim 6 further comprising:
    - a Chinese Language Model training apparatus connected to the computer for training a Markov Chinese Language Model.
  - 8. A speech recognition apparatus as in claim 6 further comprising:
    - a tone training apparatus connected to the computer for training the tone model.

9. A Mandarin tone recognition method comprising steps of:
- dividing training syllable utterances into five groups according to tones of the syllable utterances;
  
  training a Hidden Markov Model (HMM) on the five groups of training syllable utterances to create a Mandarin tone model;
  
  operating on an object speech sample with the Mandarin tone model;
  
  recognizing the object speech sample as an object Mandarin tone based on a degree of match of the object speech sample to the Mandarin tone model;
  
  representing the object speech sample as a Mandarin tone in accordance with the object Mandarin tone.
- View Dependent Claims (10)
- - 10. A method as in claim 9, wherein the step of training a HMM to create the Mandarin tone models includes a step of training on pitch frequency, short time energy, and duration of an input speech syllable.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
National Science Council
Original Assignee
National Science Council
Inventors
Lee, Lin S.
Primary Examiner(s)
Fleming, Michael R.
Assistant Examiner(s)
Doerrler, Michelle

Application Number

US07/444,405
Time in Patent Office

1,292 Days
Field of Search

381/41-45, 364/513.5
US Class Current

704/200
CPC Class Codes

G10L 15/144 Training of HMMs

G10L 25/15 the extracted parameters be...

Mandarin speech input method for Chinese computers and a mandarin speech recognition machine

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

270 Citations

10 Claims

Specification

Use Cases

Quick Links

Others

Mandarin speech input method for Chinese computers and a mandarin speech recognition machine

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

270 Citations

10 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others