SPEECH RECOGNITION SYSTEM AND PROGRAM THEREOF

US 20080183472A1
Filed: 01/09/2008
Published: 07/31/2008
Est. Priority Date: 03/15/2002
Status: Active Grant

First Claim

Patent Images

1. (canceled)

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM for each speech frame of the inputted speech by use of the composite HMM.

99 Citations

17 Claims

1. (canceled)

2. A speech recognition apparatus comprising:
- a characteristic quantity extraction unit for extracting a characteristic quantity of an inputted speech to be recognized, wherein said apparatus performs speech recognition by matching between a predetermined speech and a phoneme hidden Markov model of speech data previously recorded;
  
  a composite model generation unit for generating a composite model by synthesizing the phoneme hidden Markov model of speech data and a hidden Markov model of noise data previously recorded; and
  
  a speech recognition unit for recognizing the inputted speech by matching the characteristic quantity being extracted in the characteristic quantity extraction unit from the inputted speech to the composite model generated in the composite model generation unit, wherein the speech recognition unit executes matching between the characteristic quantity of the inputted speech and the composite model for each of adequate segments defined by punctuating a speech sequence in the inputted speech, andwherein the speech recognition unit selects the composite model to be matched to the characteristic quantity of the inputted speech independently of each speech frame thereof and executes matching between the characteristic quantity of the inputted speech and the composite model.

3. (canceled)

4. A speech recognition apparatus comprising:
- a speech database storing speech data as models for speech recognition;
  
  a noise database storing noise data assumed to generate under a predetermined noise environment;
  
  a composite model generation unit for generating a composite model by synthesizing a speech model generated based on the speech data read out from the speech database and a noise model generated based on the noise data read out from the noise database; and
  
  a speech recognition unit for performing speech recognition by matching between a characteristic quantity of an inputted speech to be recognized and the composite model generated in the composite model generation unit independently of each speech frame of the inputted speech.

5. (canceled)

6. (canceled)

7. (canceled)

8. (canceled)

9. (canceled)

10. A computer program product comprising a tangible storage medium readable by a processing circuit and storing computer-readable instructions for execution by the processing circuit for performing a method of speech recognition, the method comprising steps of:
- extracting a characteristic quantity of an inputted speech to be recognized;
  
  generating a composite model including synthesizing a phoneme hidden Markov model of speech data previously recorded and a hidden Markov model of noise data previously recorded;
  
  recognizing the inputted speech including matching between the characteristic quantity of the inputted speech and the composite model for each of adequate segments defined by punctuating a speech sequence in the inputted speech; and
  
  selecting the composite model to be matched to the characteristic quantity of the inputted speech independently of each speech frame thereof and executes matching between the characteristic quantity of the inputted speech and the composite model.

11. (canceled)

12. (canceled)

13. (canceled)

14. (canceled)

15. (canceled)

16. (canceled)

17. (canceled)

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Original Assignee
International Business Machines Corporation
Inventors
Takiguchi, Teksuya, Nishimura, Masafumi

Granted Patent

US 7,660,717 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/256
CPC Class Codes

G10L 15/02   Feature extraction for spee...

G10L 15/20   Speech recognition techniqu...

G10L 15/30   Distributed recognition, e....

SPEECH RECOGNITION SYSTEM AND PROGRAM THEREOF

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

99 Citations

17 Claims

Specification

Solutions

Use Cases

Quick Links

SPEECH RECOGNITION SYSTEM AND PROGRAM THEREOF

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

99 Citations

17 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links