LEVERAGING INTERACTION CONTEXT TO IMPROVE RECOGNITION CONFIDENCE SCORES

US 20150046163A1
Filed: 10/23/2014
Published: 02/12/2015
Est. Priority Date: 10/27/2010
Status: Active Grant

First Claim

Patent Images

1-10. -10. (canceled)

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

On a computing device a speech utterance is received from a user. The speech utterance is a section of a speech dialog that includes a plurality of speech utterances. One or more features from the speech utterance are identified. Each identified feature from the speech utterance is a specific characteristic of the speech utterance. One or more features from the speech dialog are identified. Each identified feature from the speech dialog is associated with one or more events in the speech dialog. The one or more events occur prior to the speech utterance. One or more identified features from the speech utterance and one or more identified features from the speech dialog are used to calculate a confidence score for the speech utterance.

11 Citations

View as Search Results

30 Claims

1-10. -10. (canceled)

11. A method for improving speech recognition on a computing device, the method comprising:
- on the computing device, obtaining a log file for a speech dialog, the speech dialog including one or more speech utterances;
  
  on the computing device, automatically extracting from the log file one or more dialog-level features from the speech dialog, each dialog-level feature being a specific characteristic of the speech dialog;
  
  on the computing device, automatically extracting from the log file a value associated with each of the one or more dialog-level features from the speech dialog.on the computing device, obtaining from the log file one or more features associated with the one or more speech utterances;
  
  on the computing device, obtaining from the log file a first confidence score associated with the one or more features associated with the one or more speech utterances; and
  
  on the computing device, using values associated with the one or more dialog-level features to adjust the first confidence score associated with one or more of the speech utterances.
- View Dependent Claims (12, 15, 16, 17, 18)
- - 12. The method of claim 11, wherein the dialog-level features include a position of each utterance in the speech dialog.
  - 15. The method of claim 11, wherein the adjusting of the first confidence score is associated with recalibrating a confidence classifier module.
  - 16. The method of claim 11, wherein the one more features associated with the one or more speech utterances includes a degree to which an acoustic match is determined for each speech utterance.
  - 17. The method of claim 11, wherein the one or more features associated with the one or more speech utterances includes a noise of an acoustic signal associated with each speech utterance.
  - 18. The method of claim 11, wherein the one or more features associated with the one or more speech utterances includes a degree to which a first recognition for a speech utterance is similar to a second recognition for the speech utterance.

19. A computer-readable storage medium comprising instructions that, when executed by a computing device, cause the computing device to:
- receive a speech utterance from a user, the speech utterance being a section of a speech dialog, the speech dialog including a plurality of speech utterances;
  
  identify one or more features from the speech utterance, each identified feature from the speech utterance being a specific characteristic of the speech utterance;
  
  identify one or more features from the speech dialog, each identified feature from the speech dialog being associated with one or more events in the speech dialog, the one or more events occurring prior to the speech utterance;
  
  input the one or more features of the speech utterance to a first confidence classifier module;
  
  use the first confidence classifier module to calculate a first confidence score for the speech utterance;
  
  input the first confidence score for the speech utterance to a second confidence classifier module;
  
  input the one or more features from the speech dialog to the second confidence classifier module; and
  
  use the first confidence score and the one or more features from the speech dialog to calculate a second confidence score for the speech utterance.
- View Dependent Claims (20)
- - 20. The method of claim 19, wherein the first confidence classifier module and the second confidence classifier module are the same confidence classifier module.

21. A system for improving speech recognition on a computing device, the system comprising:
- a confidence classifier module for;
  
  obtaining a log file for a speech dialog, the speech dialog including one or more speech utterances;
  
  automatically extracting from the log file one or more dialog-level features from the speech dialog, each dialog-level feature being a specific characteristic of the speech dialog, and wherein the one or more dialog-level features include at least a position of each utterance in the speech dialog;
  
  automatically extracting from the log file a value associated with each of the one or more dialog-level features from the speech dialog.obtaining from the log file one or more features associated with the one or more speech utterances;
  
  obtaining from the log file a first confidence score associated with the one or more features associated with the one or more speech utterances; and
  
  using values associated with the one or more dialog-level features to adjust the first confidence score associated with one or more of the speech utterances.
- View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
- - 22. The system of claim 21, wherein the dialog-level features further include at least a degree in which re-prompting occurred for one or more utterances in the speech dialog.
  - 23. The system of claim 21, wherein the adjusting of the first confidence score is associated with recalibrating one or more speech recognition models on the computing device.
  - 24. The system of claim 21, wherein the adjusting of the first confidence score is associated with recalibrating a confidence classifier module.
  - 25. The system of claim 21, wherein the one more features associated with the one or more speech utterances includes a degree to which an acoustic match is determined for each speech utterance.
  - 26. The system of claim 21, wherein the one or more features associated with the one or more speech utterances includes a noise of an acoustic signal associated with each speech utterance.
  - 27. The system of claim 21, wherein the one or more features associated with the one or more speech utterances includes a degree to which a first recognition for a speech utterance is similar to a second recognition for the speech utterance.
  - 28. The system of claim 21, wherein the log file includes contextual information for the one or more speech utterances.
  - 29. The system of claim 28, wherein the contextual information includes information from previous and future speech utterances in the speech dialog.
  - 30. The system of claim 21, wherein the speech dialog includes a plurality of dialog events, and wherein the one or more dialog-level features are derived from the log file for a first dialog event of the plurality of dialog events occurring previous to a current speech utterance and for a second dialog event of the plurality of dialog events occurring after the current speech utterance.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Corporation
Inventors
Levit, Michael, Buntschuh, Bruce Melvin

Granted Patent

US 9,542,931 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/254
CPC Class Codes

G10L 15/01   Assessment or evaluation of...

G10L 15/08   Speech classification or se...

G10L 15/22   Procedures used during a sp...

LEVERAGING INTERACTION CONTEXT TO IMPROVE RECOGNITION CONFIDENCE SCORES

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

11 Citations

30 Claims

Specification

Use Cases

Quick Links

Others

LEVERAGING INTERACTION CONTEXT TO IMPROVE RECOGNITION CONFIDENCE SCORES

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

11 Citations

30 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others