Speech recognition device and speech recognition method

US 20050119883A1
Filed: 07/13/2001
Published: 06/02/2005
Est. Priority Date: 07/13/2000
Status: Active Grant

First Claim

Patent Images

1. A speech recognition device for recognizing speech of unspecified speakers using hidden Markov models, the device comprising:

detection means for detecting feature parameters of input speech;

data storage means for storing transition probabilities and output probability functions, which use, as arguments, said feature parameters for multiple predetermined types of hidden Markov models, the models each representing a plurality of predetermined words; and

recognition means for determining the occurrence probability that a sequence of said feature parameters will occur using said hidden Markov models, said recognition means assigning each of said words a state sequence of one hidden Markov model common to said multiple types of hidden Markov models, the occurrence probability determined by selecting a largest product of a transition probability with an output probability function associated with each state of said common hidden Markov model, wherein the input speech is recognized based on the occurrence probability thus determined.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Each word to be recognized is represented by gender-specific hidden Markov models that are stored in a ROM 6 along with output probability functions and preset transition probabilities. A speech recognizer 4 determines an occurrence probability of a feature parameter sequence detected by a feature value detector 3 using the hidden Markov models. The speech recognizer 4 determines the occurrence probability by giving each word a state sequence of one hidden Markov model common to the gender-specific hidden Markov models, multiplying each preset pair of an output probability function value and a transition probability together among the output probability functions and transition probabilities stored in the ROM 6, selecting the largest product as the probability of each state of the common hidden Markov model, determining the occurrence probability based on the selected product, and recognizing the input speech based on the occurrence probability thus determined.

25 Citations

View as Search Results

10 Claims

1. A speech recognition device for recognizing speech of unspecified speakers using hidden Markov models, the device comprising:
- detection means for detecting feature parameters of input speech;
  
  data storage means for storing transition probabilities and output probability functions, which use, as arguments, said feature parameters for multiple predetermined types of hidden Markov models, the models each representing a plurality of predetermined words; and
  
  recognition means for determining the occurrence probability that a sequence of said feature parameters will occur using said hidden Markov models, said recognition means assigning each of said words a state sequence of one hidden Markov model common to said multiple types of hidden Markov models, the occurrence probability determined by selecting a largest product of a transition probability with an output probability function associated with each state of said common hidden Markov model, wherein the input speech is recognized based on the occurrence probability thus determined.
- View Dependent Claims (2, 3, 9)
- - 2. The speech recognition device according to claim 1, wherein recognition means shares the transition probability of each state of said hidden Markov model among said multiple types of hidden Markov models in order to determine said occurrence probability.
  - 3. The speech recognition device according to claim 1, wherein the multiple predetermined types of hidden Markov models is selected from the group comprising gender-specific hidden Markov models, age-specific multiple hidden Markov models, and multiple hidden Markov models based on voice data which contain different types of noise.
  - 9. The speech recognition device according to claim 2, wherein the multiple predetermined types of hidden Markov models is selected from the group comprising gender-specific hidden Markov models, age-specific multiple hidden Markov models, and multiple hidden Markov models based on voice data which contain different types of noise.

4. A speech recognition device for recognizing speech of unspecified speakers using hidden Markov models, said device comprising:
- detection means for detecting feature parameters of input speech;
  
  data storage means for storing transition probabilities and output probability functions, which use as arguments, said feature parameters for hidden Markov models (HMMs), each of said HMM representing a plurality of predetermined words and for a plurality of hidden Markov models which partially express differences in pronunciations of each of words which are allowed multiple pronunciations out of said predetermined words; and
  
  recognition means for determining the occurrence probability that a sequence of said feature parameters will occur using said hidden Markov models, said recognition means sharing a state sequence of one hidden Markov model among said plurality of hidden Markov models for partial expression, and the occurrence probability determined by selecting a largest product of a transition probability with an output probability function associated with each state of said plurality of hidden Markov models for partial expression, wherein the input speech is recognized based on the occurrence probability thus determined.

5. A method for recognizing input speech:
- detecting feature parameters of said input speech;
  
  retrieving transition probabilities and output probability functions, said transition probabilities and said output probability functions associated with multiple predetermined types of hidden Markov models which represent each of a plurality of predetermined words;
  
  determining the occurrence probability that a sequence of said feature parameters will occur using said hidden Markov models, wherein each of said words is represented by a hidden Markov model common to said multiple types of hidden Markov models;
  
  multiplying each preset pair of an output probability function value and a transition probability together among the output probability functions and transition probabilities and selecting the largest product as the probability of each state of said common hidden Markov model; and
  
  recognizing the input speech by selecting the hidden Markov model having the largest occurrence probability.
- View Dependent Claims (6, 7, 10)
- - 6. The speech recognition method according to claim 5, wherein the transition probability of each state of said hidden Markov model is shared among said multiple types of hidden Markov models in order to determine said occurrence probability.
  - 7. The speech recognition method according to claim 5, wherein the multiple predetermined types of hidden Markov models is selected from the group comprising gender-specific hidden Markov models, age-specific hidden Markov models, and multiple hidden Markov models based on voice data which contain different types of noise.
  - 10. The speech recognition method according to claim 6, wherein the multiple predetermined types of hidden Markov models is selected from the group comprising gender-specific hidden Markov models, age-specific hidden Markov models, and multiple hidden Markov models based on voice data which contain different types of noise.

8. A speech recognition method comprising the steps of:
- detecting feature parameters of said input speech;
  
  retrieving transition probabilities and output probability functions, said transition probabilities and said output probability functions associated with a plurality of hidden Markov models which represent each of a plurality of predetermined words and with a plurality of hidden Markov models that partially express differences in pronunciations of each of said words which are allowed multiple pronunciations out of said predetermined words;
  
  determining the occurrence probability that a sequence of said feature parameters will occur using said hidden Markov models, wherein for each of said words allowing multiple pronunciations a common hidden Markov model shares a state sequence among said plurality of hidden Markov models for partial expression;
  
  multiplying each pair of an output probability function value and a transition probability together among the output probability functions and transition probabilities characterizing said plurality of hidden Markov models for partial expression and selecting the largest product as the probability of each state of said common hidden Markov model; and
  
  recognizing the input speech by selecting the hidden Markov model having the largest occurrence probability.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Asahi Kasei Kabushiki Kaisha
Original Assignee
Asahi Kasei Kabushiki Kaisha
Inventors
Miyazaki, Toshiyuki, Ishikawa, Yoji

Granted Patent

US 7,272,561 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/231
CPC Class Codes

G10L 15/142   Hidden Markov Models [HMMs]

G10L 15/28   Constructional details of s...

G10L 15/32   Multiple recognisers used i...

Speech recognition device and speech recognition method

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

25 Citations

10 Claims

Specification

Solutions

Use Cases

Quick Links

Speech recognition device and speech recognition method

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

25 Citations

10 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links