Adaptive multi-pass speech recognition system

US 20060178879A1
Filed: 04/04/2006
Published: 08/10/2006
Est. Priority Date: 04/20/1999
Status: Active Grant

First Claim

Patent Images

1-15. -15. (canceled)

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Method and apparatus for multi-pass speech recognition. An input device receives spoken input. A processor performs a first pass speech recognition technique on the spoken input and forms first pass results. The first pass results include a number of alternative speech expressions, each having an assigned score related to the certainty that the corresponding expression correctly matches the spoken input. The processor selectively performs a second pass speech recognition technique on the spoken input according to the first pass results. Preferably, the second pass attempts to correctly match the spoken input to only those expressions which were identified during the first pass. Otherwise, if one of the expressions identified by the first pass is assigned a score higher than a predetermined threshold (e.g., 95%), the second pass is not performed. Because the second pass is performed only when necessary, the invention recognizes speech with a faster average speed for a given accuracy in comparison to prior systems. Alternately, the first pass results identify a characteristic of the spoken input. The characteristic can be the gender of the speaker or a type of telephone the speaker is calling from. In which case, the second pass speech recognition technique is selected from a plurality of speech recognition techniques according to the characteristic identified by the first pass. Because the selected second pass technique is specific to the characteristic of the spoken input, the second pass technique can perform speech recognition faster for a given accuracy than a technique which is not specific.

60 Citations

View as Search Results

58 Claims

1-15. -15. (canceled)

16. A speech recognition system for recognizing spoken input received from a source of the spoken input coupled to the speech recognition system wherein the speech recognition system comprises:
- a. input means for receiving the spoken input from the source of the spoken input; and
  
  b. processing means coupled to the input means for performing a first pass speech recognition technique on the spoken input and for forming first pass results;
  
  wherein a characteristic of the spoken input is identified based upon the first pass results, the processing means selectively performs a second pass speech recognition technique on the spoken input according to the first pass results, the second pass speech recognition technique is selected from a plurality of speech recognition techniques according to the characteristic of the spoken input identified by the first pass, and further the characteristic of the spoken input is gender of a speaker of the spoken input.

17-18. -18. (canceled)

19. A speech recognition system for recognizing spoken input received from a source of the spoken input coupled to the speech recognition system wherein the speech recognition system comprises:
- a. input means for receiving the spoken input from the source of the spoken input, and b. processing means coupled to the input means for performing a first pass speech recognition technique on the spoken input and for forming first pass results;
  
  wherein the processing means selectively performs a second pass speech recognition technique on the spoken input according to the first pass results, and further the first pass results identify the spoken input as being in one of the following three categories;
  
  (1) originating from a female speaker;
  
  (2) originating from a male speaker; and
  
  (3) originating from a hands-free telephone where a speaker of the spoken input is female or male.

20-40. -40. (canceled)

41. A method of recognizing spoken input received from a source of the spoken input wherein the method comprises steps of:
- a. receiving the spoken input from the source of the spoken input;
  
  b. performing a first pass speech recognition technique on the spoken input;
  
  c. forming first pass results; and
  
  d. selectively performing a second pass speech recognition technique on the spoken input according to the first pass results;
  
  wherein the first pass results identify a characteristic of the spoken input, the second pass speech recognition technique is selected from a plurality of speech recognition techniques according to the characteristic of the spoken input identified by the first pass, and further the characteristic of the spoken input is gender of a speaker of the spoken input.

42-43. -43. (canceled)

44. A method of recognizing spoken input received from a source of the spoken input wherein the method comprises steps of:
- a. receiving the spoken input from the source of the spoken input;
  
  b. performing a first pass speech recognition technique on the spoken input;
  
  c. forming first pass results; and
  
  d. selectively performing a second pass speech recognition technique on the spoken input according to the first pass results;
  
  wherein the first pass results identify the spoken input as being in one of the following three categories;
  
  (1) originating from a female speaker;
  
  (2) originating from a male speaker; and
  
  (3) originating from a hands-free telephone where a speaker of the spoken input is female or male.

45-52. -52. (canceled)

53. A method of recognizing spoken input received from a source of the spoken input wherein the method comprises steps of:
- a. receiving the spoken input from the source of the spoken input;
  
  b. performing a first pass speech recognition technique on the spoken input;
  
  c. forming first pass results wherein the first pass results identify a speech expression as corresponding to the spoken input with a corresponding score and also identify a characteristic of the spoken input; and
  
  d. performing a second pass speech recognition technique on the spoken input when the corresponding score is below a predetermined threshold and wherein the second pass speech recognition technique is selected from a plurality of speech recognition techniques according to the identified characteristic;
  
  wherein the characteristic of the spoken input is gender of a speaker of the spoken input.

54. A method of recognizing spoken input received from a source of the spoken input wherein the method comprises steps of:
- a. receiving the spoken input from the source of the spoken input;
  
  b. performing a first pass speech recognition technique on the spoken input;
  
  c. forming first pass results wherein the first pass results identify a speech expression as corresponding to the spoken input with a corresponding score and also identify a characteristic of the spoken input; and
  
  d. performing a second pass speech recognition technique on the spoken input when the corresponding score is below a predetermined threshold and wherein the second pass speech recognition technique is selected from a plurality of speech recognition techniques according to the identified characteristic;
  
  wherein the characteristic of the spoken input is a type of telephone channel a speaker of the spoken input is calling from.

55. (canceled)

56. A method of recognizing spoken input received from a source of the spoken input wherein the method comprises steps of:
- a. receiving the spoken input from the source of the spoken input;
  
  b. performing a first pass speech recognition technique on the spoken input;
  
  c. forming first pass results wherein the first pass results identify a plurality of speech expressions as corresponding to the spoken input each speech expression having a corresponding score and also identify a characteristic of the spoken input; and
  
  d. performing a second pass speech recognition technique on the spoken input when a difference between two highest of the scores does not exceed a predetermined threshold and wherein the second pass speech recognition technique is selected from a plurality of speech recognition techniques according to the identified characteristic;
  
  wherein the characteristic of the spoken input is gender of a speaker of the spoken input.

57. A method of recognizing spoken input received from a source of the spoken input wherein the method comprises steps of:
- a. receiving the spoken input from the source of the spoken input;
  
  b. performing a first pass speech recognition technique on the spoken input;
  
  c. forming first pass results wherein the first pass results identify a plurality of speech expressions as corresponding to the spoken input each speech expression having a corresponding score and also identify a characteristic of the spoken input; and
  
  d. performing a second pass speech recognition technique on the spoken input when a difference between two highest of the scores does not exceed a predetermined threshold and wherein the second pass speech recognition technique is selected from a plurality of speech recognition techniques according to the identified characteristic;
  
  wherein the characteristic of the spoken input is a type of telephone channel a speaker of the spoken input is calling from.

58-60. -60. (canceled)

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Original Assignee
Nuance Communications, Inc. (Microsoft Corporation)
Inventors
Kannan, Ashvin, Murveit, Hy, Leggetter, Chris, Shahshahani, Ben, Knill, Katherine

Granted Patent

US 7,401,017 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/231
CPC Class Codes

G10L 15/08 Speech classification or se...

G10L 2015/085 Methods for reducing search...

Adaptive multi-pass speech recognition system

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

60 Citations

58 Claims

Specification

Solutions

Use Cases

Quick Links

Adaptive multi-pass speech recognition system

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

60 Citations

58 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links