Automated learning for speech-based applications

US 10,102,847 B2
Filed: 08/12/2016
Issued: 10/16/2018
Est. Priority Date: 09/11/2008
Status: Active Grant

First Claim

Patent Images

1. A method comprising:

receiving, by a computer-based speech recognition system that is communicatively coupled to a communication network, a speech input including one or more words or phrases, the computer-based speech recognition system comprising at least one processor and at least one memory device, wherein the speech input is from a call to a call center via the communication network;

determining, by the computer-based speech recognition system, to provide the speech input to a human;

receiving a response from the human that identifies a first task for the speech input, the first task including a first transcription of the speech input;

performing the first task that is identified by the human for the speech input;

processing the speech input to identify a second task for the speech input, the processing using a set of internal representations of the computer-based speech recognition system, the set of internal representations comprising one or more machine-readable parameters for recognizing speech in a speech utterance, the second task including a second transcription of the speech input;

comparing the first transcription of the speech input included in the first task identified by the human with the second transcription of the speech input included in the second task identified by the computer-based recognition system to determine one or more differences between the first transcription and the second transcription;

modifying the set of internal representations of the computer-based speech recognition system based at least in part on the one or more differences between the first transcription and the second transcription to create a modified set of internal representations, wherein the modifying includes adjusting at least a portion of the set of internal representations;

checking the performance of the modified set of internal representations to prevent the modification from degrading the set internal representations, wherein the checking comprises determining that a performance difference between the set of internal representations before and after modification is within a margin of error;

receiving, by the computer-based speech recognition system, another input; and

processing, by the at least one processor and based at least in part on the modified set of internal representations, the other input to identify a third task for the other input.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Systems and methods for modifying a computer-based speech recognition system. A speech utterance is processed with the computer-based speech recognition system using a set of internal representations, which may comprise parameters for recognizing speech in a speech utterance, such as parameters of an acoustic model and/or a language model. The computer-based speech recognition system may perform a first task in response to the processed speech utterance. The utterance may also be provided to a human who performs a second task based on the utterance. Data indicative of the first task, performed by the computer system, is compared to data indicative of a second task, performed by the human in response to the speech utterance. Based on the comparison, the set of internal representations may be updated or modified to improve the speech recognition performance and capabilities of the speech recognition system.

26 Citations

19 Claims

1. A method comprising:
- receiving, by a computer-based speech recognition system that is communicatively coupled to a communication network, a speech input including one or more words or phrases, the computer-based speech recognition system comprising at least one processor and at least one memory device, wherein the speech input is from a call to a call center via the communication network;
  
  determining, by the computer-based speech recognition system, to provide the speech input to a human;
  
  receiving a response from the human that identifies a first task for the speech input, the first task including a first transcription of the speech input;
  
  performing the first task that is identified by the human for the speech input;
  
  processing the speech input to identify a second task for the speech input, the processing using a set of internal representations of the computer-based speech recognition system, the set of internal representations comprising one or more machine-readable parameters for recognizing speech in a speech utterance, the second task including a second transcription of the speech input;
  
  comparing the first transcription of the speech input included in the first task identified by the human with the second transcription of the speech input included in the second task identified by the computer-based recognition system to determine one or more differences between the first transcription and the second transcription;
  
  modifying the set of internal representations of the computer-based speech recognition system based at least in part on the one or more differences between the first transcription and the second transcription to create a modified set of internal representations, wherein the modifying includes adjusting at least a portion of the set of internal representations;
  
  checking the performance of the modified set of internal representations to prevent the modification from degrading the set internal representations, wherein the checking comprises determining that a performance difference between the set of internal representations before and after modification is within a margin of error;
  
  receiving, by the computer-based speech recognition system, another input; and
  
  processing, by the at least one processor and based at least in part on the modified set of internal representations, the other input to identify a third task for the other input.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The method of claim 1, wherein:
    - the input comprises a speech utterance.
  - 3. The method of claim 1, further comprising, in response to determining to provide the speech input to the human, providing the speech input to the human in real-time as the speech input is delivered to the computer-based speech recognition system.
  - 4. The method of claim 1, wherein comparing the first transcription identified by the human with the second transcription identified by the computer-based recognition system to determine one or more differences between the first transcription and the second transcription comprises determining that the second transcription is not within an acceptable margin of error to the first transcription.
  - 5. The method of claim 1, further comprising:
    - receiving, by the computer-based speech recognition system, another speech input; and
      
      processing, by the at least one processor and based at least partly on the modified set of internal representations, the other speech input to identify a particular transcription for the other speech input.

6. A computer-based speech recognition system, comprising:
- one or more processors communicatively coupled to a communication network; and
  
  memory storing instructions that, when executed by the one or more processors, cause the computer-based speech recognition system to perform acts comprising;
  
  obtaining a speech input including one or more words or phrases, wherein the speech input is from a call to a call center via the communication network;
  
  determining to provide the speech input to a human;
  
  receiving a response from the human that identifies a first task for the input, the first task including a first transcription of the speech input;
  
  processing the speech input to identify a second task for the speech input, the processing using a set of internal representations of the computer-based speech recognition system, the set of internal representations comprising one or more machine-readable parameters for recognizing speech in a speech utterance, the second task including a second transcription of the speech input;
  
  comparing the speech input and the first transcription of the speech input included in the first task with the second transcription of the speech input included in the second task, the set of internal representations comprising one or more machine-readable parameters for recognizing speech in a speech utterance;
  
  modifying the set of internal representations of the computer-based speech recognition system based at least in part on the comparing the first transcription with the second transcription to create a modified set of internal representations, wherein the modifying includes adjusting at least a portion of the set of internal representations;
  
  checking the performance of the modified set of internal representations to prevent the modification from degrading the set internal representations, wherein the checking comprises determining that a performance difference between the set of internal representations before and after modification is within a margin of error;
  
  receiving, by the computer-based speech recognition system, another input; and
  
  processing, by the one or more processors and based at least in part on the modified set of internal representations, the other input to identify a third task for the other input.
- View Dependent Claims (7, 8, 9, 10, 11, 12)
- - 7. The computer-based speech recognition system of claim 6, the acts further comprising processing the speech input to identify a second task for the input, the processing using the set of internal representations of the computer-based speech recognition system.
  - 8. The computer-based speech recognition system of claim 7, wherein comparing the speech input and the first task with the set of internal representations of the computer-based speech recognition system comprises comparing the first transcription of the speech input included in the first task and the second transcription of the speech input included in the second task to determine one or more differences between the transcription and the second transcription.
  - 9. The computer-based speech recognition system of claim 8, wherein the modifying the set of internal representations is based at least in part one the one or more differences between the first transcription and the second transcription.
  - 10. The computer-based speech recognition system of claim 6, wherein:
    - the speech input comprises a speech utterance.
  - 11. The computer-based speech recognition system of claim 6, the acts further comprising, in response to determining to provide the speech input to the human, providing the speech input to the human in real-time as the speech input is delivered to the computer-based speech recognition system.
  - 12. The computer-based speech recognition system of claim 6, the acts further comprising performing the first task that is identified by the human for the speech input, wherein at least a portion of the first task that is identified by the human is performed by the human.

13. One or more non-transitory computer-readable storage media storing instructions that, when executed by one or more processors, communicatively coupled to a communication network, configure the one or more processors to perform acts comprising:
- receiving, by a computer-based speech recognition system, a speech input including one or more words or phrases, wherein the speech input is from a call to a call center via the communication network;
  
  determining, by the computer-based speech recognition system, to provide the speech input to a human;
  
  receiving a response from the human that identifies a first task for the speech input, the first task including a first transcription of the speech input;
  
  processing the speech input to identify a second task for the speech input, the processing using a set of internal representations of the computer-based speech recognition system, the set of internal representations comprising one or more machine-readable parameters for recognizing speech in a speech utterance, the second task including a second transcription of the speech input;
  
  comparing the speech input and the first transcription of the speech input included in the first task with the second transcription of the speech input included in the second task, the set of internal representations comprising one or more machine-readable parameters for recognizing speech in a speech utterance;
  
  modifying the set of internal representations of the computer-based speech recognition system based at least in part on the comparing the first transcription with the second transcription to create a modified set of internal representations, wherein the modifying includes adjusting at least a portion of the set of internal representations;
  
  checking the performance of the modified set of internal representations to prevent the modification from degrading the set internal representations, wherein the checking comprises determining that a performance difference between the set of internal representations before and after modification is within a margin of error;
  
  receiving, by the computer-based speech recognition system, another input; and
  
  processing, by the one or more processors and based at least in part on the modified set of internal representations, the other input to identify a third task for the other input.
- View Dependent Claims (14, 15, 16, 17, 18, 19)
- - 14. The one or more non-transitory computer-readable storage media of claim 13, the acts further comprising processing the speech input to identify a second task for the speech input, the processing using the set of internal representations of the computer-based speech recognition system.
  - 15. The one or more non-transitory computer-readable storage media of claim 14, wherein comparing the speech input and the first transcription of the speech input included in the first task with the set of internal representations of the computer-based speech recognition system comprises comparing the first transcription and the second transcription of the speech input included in the second task to determine one or more differences between the first task and the second task.
  - 16. The computer-based speech recognition system of claim 15, wherein the modifying the set of internal representations is based at least in part one the one or more differences between the first transcription and the second transcription.
  - 17. The computer-based speech recognition system of claim 13, wherein:
    - the speech input comprises a speech utterance.
  - 18. The computer-based speech recognition system of claim 13, the acts further comprising, in response to determining to provide the input to the human, providing the speech input to the human in real-time as the speech input is delivered to the computer-based speech recognition system.
  - 19. The computer-based speech recognition system of claim 13, the acts further comprising performing the first task that is identified by the human for the speech input, wherein at least a portion of the first task that is identified by the human is performed by the human.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Verint Americas Incorporated (Verint Systems Incorporated)
Original Assignee
Verint Americas Incorporated (Verint Systems Incorporated)
Inventors
Wooters, Charles C
Primary Examiner(s)
Jackson, Jakieda

Application Number

US15/235,961
Publication Number

US 20160351186A1
Time in Patent Office

795 Days
Field of Search

704235, 704 9
US Class Current
CPC Class Codes

G10L 15/065   Adaptation

G10L 15/08   Speech classification or se...

G10L 15/18   using natural language mode...

G10L 15/22   Procedures used during a sp...

G10L 15/26   Speech to text systems G10L...

G10L 2015/0638   Interactive procedures

G10L 2015/223   Execution procedure of a sp...

Automated learning for speech-based applications

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

26 Citations

19 Claims

Specification

Solutions

Use Cases

Quick Links

Automated learning for speech-based applications

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

26 Citations

19 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links