combined engine system and method for voice recognition

US 6,671,669 B1
Filed: 07/18/2000
Issued: 12/30/2003
Est. Priority Date: 07/18/2000
Status: Expired due to Term

First Claim

Patent Images

1. A voice recognition system, comprising:

an acoustic processor configured to extract speech parameters from digitized speech samples of an utterance;

a plurality of voice recognition engines coupled to the acoustic processor, each voice recognition engine configured to produce a plurality of hypotheses; and

decision logic configured to compare a most likely hypothesis of a first voice recognition engine to a second most likely hypothesis of the first voice recognition engine to form a first difference, delta 1;

compare a most likely hypothesis of the second voice recognition engine to a second most likely hypothesis of the second voice recognition engine to form a second difference, delta 2;

add delta 1 and delta 2 to form a delta sum; and

accept the most likely hypothesis of the first voice recognition engine if the most likely hypothesis of the first voice recognition engine is equal in likeliness to the most likely hypothesis of the first voice recognition engine and the delta sum is greater than a first predetermined threshold.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method and system that combines voice recognition engines and resolves any differences between the results of individual voice recognition engines. A speaker independent (SI) Hidden Markov Model (HMM) engine, a speaker independent Dynamic Time Warping (DTW-SI) engine and a speaker dependent Dynamic Time Warping (DTW-SD) engine are combined. Combining and resolving the results of these engines results in a system with better recognition accuracy and lower rejection rates than using the results of only one engine.

93 Citations

View as Search Results

15 Claims

1. A voice recognition system, comprising:
- an acoustic processor configured to extract speech parameters from digitized speech samples of an utterance;
  
  a plurality of voice recognition engines coupled to the acoustic processor, each voice recognition engine configured to produce a plurality of hypotheses; and
  
  decision logic configured to compare a most likely hypothesis of a first voice recognition engine to a second most likely hypothesis of the first voice recognition engine to form a first difference, delta 1;
  
  compare a most likely hypothesis of the second voice recognition engine to a second most likely hypothesis of the second voice recognition engine to form a second difference, delta 2;
  
  add delta 1 and delta 2 to form a delta sum; and
  
  accept the most likely hypothesis of the first voice recognition engine if the most likely hypothesis of the first voice recognition engine is equal in likeliness to the most likely hypothesis of the first voice recognition engine and the delta sum is greater than a first predetermined threshold.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The voice recognition system of claim 1, wherein the plurality of voice recognition engines includes a speaker-independent voice recognition engine.
  - 3. The voice recognition system of claim 1, wherein the plurality of voice recognition engines includes a speaker-dependent voice recognition engine.
  - 4. The voice recognition system of claim 2, wherein the plurality of voice recognition engines includes a speaker-dependent voice recognition engine.
  - 5. The voice recognition system of claim 4, wherein the plurality of voice recognition engines includes a speaker-independent Dynamic Time Warping voice recognition engine.
  - 6. The voice recognition system of claim 4, wherein the plurality of voice recognition engines includes a speaker-independent Hidden Markov Model voice recognition engine.
  - 7. The voice recognition system of claim 4, wherein the plurality of voice recognition engines includes a speaker-dependent Dynamic Time Warping voice recognition engine.
  - 8. The voice recognition system of claim 4, wherein the plurality of voice recognition engines includes a speaker-dependent Hidden Markov Model recognition engine.
  - 9. The voice recognition system of claim 4, wherein the plurality of voice recognition engines includes a speaker-dependent Dynamic Time Warping voice recognition engine and a speaker-independent Dynamic Time Warping engine.

10. A method for voice recognition, comprising:
- extracting speech parameters with an acoustic processor from digitized speech samples of an utterance;
  
  coupling a plurality of voice recognition engines to the acoustic processor; and
  
  producing a plurality of hypotheses from each voice recognition engine;
  
  comparing the most likely hypothesis of the first voice recognition engine to the second most likely hypothesis of the first voice recognition engine to form a first difference, delta 1;
  
  comparing the most likely hypothesis of the second voice recognition engine to the second most likely hypothesis of the second voice recognition engine to form a second difference, delta 2;
  
  adding delta 1 and delta 2 to form a delta sum; and
  
  accepting the most likely hypothesis of the first voice recognition engine if the most likely hypothesis of the first voice recognition engine is equal in likeliness to the most likely hypothesis of the first voice recognition engine and the delta sum is greater than a first predetermined threshold.
- View Dependent Claims (11, 12, 13, 14, 15)
- - 11. A method as in claim 10 wherein the most likely hypothesis of the first voice recognition engine is not equal in likeliness to the most likely hypothesis of the first voice recognition engine and/or the delta sum is not greater than a predetermined threshold, the method further comprising:
12. A method as in claim 11 wherein the most likely hypothesis of the first voice recognition engine is not equal in likeliness to the most likely hypothesis of the first voice recognition engine and/or the delta sum is not greater than a predetermined threshold, the method further comprising:
- comparing the most likely hypothesis of the second voice recognition engine to the second most likely hypothesis of the first voice recognition engine and, if the likeliness of the most likely hypothesis of the second voice recognition engine is equal to the likeliness of the second most likely hypothesis of the first voice recognition 2 engine and delta 2 is greater than a third predetermined threshold, accepting the most likely hypothesis of the second voice recognition engine.
13. The method of claim 10 wherein the voice recognition engines are selected from the group consisting of speaker independent Dynamic Time Warping, speaker independent Hidden Markov Model, speaker dependent Dynamic Time Warping, speaker dependent Hidden Markov Model.
14. The method of claim 11 wherein the voice recognition engines are selected from the group consisting of speaker independent Dynamic Time Warping, speaker independent Hidden Markov Model, speaker dependent Dynamic Time Warping, speaker dependent Hidden Markov Model.
15. The method of claim 12 wherein the voice recognition engines are selected from the group consisting of speaker independent Dynamic Time Warping, speaker independent Hidden Markov Model, speaker dependent Dynamic Time Warping, speaker dependent Hidden Markov Model.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Qualcomm, Inc.
Original Assignee
Qualcomm, Inc.
Inventors
Qi, Yingyong, Garudadri, Harinath, Oses, David Puig, Bi, Ning
Primary Examiner(s)
To, Doris H.
Assistant Examiner(s)
Opsasnick, Michael N.

Application Number

US09/618,177
Time in Patent Office

1,260 Days
Field of Search

704/255
US Class Current

704/255
CPC Class Codes

G10L 15/32 Multiple recognisers used i...

combined engine system and method for voice recognition

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

93 Citations

15 Claims

Specification

Solutions

Use Cases

Quick Links

combined engine system and method for voice recognition

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

93 Citations

15 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links