Language model customization in speech recognition for speech analytics

US 10,643,604 B2
Filed: 12/13/2018
Issued: 05/05/2020
Est. Priority Date: 01/16/2016
Status: Active Grant

First Claim

Patent Images

1. A method for performing voice analytics on interactions with an organization, comprising:

training a customized language model for the organization by;

receiving, by a speech recognition engine, organization-specific training data and generic training data;

computing, by the speech recognition engine, a plurality of similarities between the generic training data and the organization-specific training data;

assigning, by the speech recognition engine, a plurality of weights to the generic training data through partitioning the generic training data into a plurality of partitions in accordance with the computed similarities wherein the computed similarities comprise a fixed set of one or more threshold similarities, associating a partition similarity with each of the partitions, the partition similarity corresponding to the average similarity of the data in the partition, and assigning a desired weight to each partition, the desired weight corresponding to the partition similarity of the partition;

combining, by the speech recognition engine, the generic training data with the organization-specific training data in accordance with the weights to generate customized training data;

training, by the speech recognition engine, the customized language model using the customized training data; and

outputting, by the speech recognition engine, the customized language model, the customized language model being configured to compute a likelihood of phrases in a medium;

receiving, by the speech recognition engine, an input speech from an interaction between a customer and an agent of the organization; and

performing voice analytics on the received input speech.

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method for generating a language model for an organization includes: receiving, by a processor, organization-specific training data; receiving, by the processor, generic training data; computing, by the processor, a plurality of similarities between the generic training data and the organization-specific training data; assigning, by the processor, a plurality of weights to the generic training data in accordance with the computed similarities; combining, by the processor, the generic training data with the organization-specific training data in accordance with the weights to generate customized training data; training, by the processor, a customized language model using the customized training data; and outputting, by the processor, the customized language model, the customized language model being configured to compute the likelihood of phrases in a medium.

Citations

16 Claims

1. A method for performing voice analytics on interactions with an organization, comprising:
- training a customized language model for the organization by;
  
  receiving, by a speech recognition engine, organization-specific training data and generic training data;
  
  computing, by the speech recognition engine, a plurality of similarities between the generic training data and the organization-specific training data;
  
  assigning, by the speech recognition engine, a plurality of weights to the generic training data through partitioning the generic training data into a plurality of partitions in accordance with the computed similarities wherein the computed similarities comprise a fixed set of one or more threshold similarities, associating a partition similarity with each of the partitions, the partition similarity corresponding to the average similarity of the data in the partition, and assigning a desired weight to each partition, the desired weight corresponding to the partition similarity of the partition;
  
  combining, by the speech recognition engine, the generic training data with the organization-specific training data in accordance with the weights to generate customized training data;
  
  training, by the speech recognition engine, the customized language model using the customized training data; and
  
  outputting, by the speech recognition engine, the customized language model, the customized language model being configured to compute a likelihood of phrases in a medium;
  
  receiving, by the speech recognition engine, an input speech from an interaction between a customer and an agent of the organization; and
  
  performing voice analytics on the received input speech.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1, wherein a silhouette score is used to determine a number of the plurality of partitions.
  - 3. The method of claim 1, wherein a test set of the generic training data and the organization-specific training data empirically determine a number of the plurality of partitions.
  - 4. The method of claim 1, wherein k-means clustering is used to determine a number of the plurality of partitions.
  - 5. The method of claim 1, wherein the desired weight of a partition is exponentially decreasing with decreasing partition similarity.
  - 6. The method of claim 1, wherein the training a customized language model for the organization further comprise:
    - receiving organization-specific in-medium data;
      
      combining the organization-specific in-medium data with the generic training data and the organization-specific training data to generate the customized training data; and
      
      retraining the language model in accordance with the customized training data.
  - 7. The method of claim 1, wherein the organization-specific training data comprise at least one of:
    - in-medium data and out-of-medium data.
  - 8. The method of claim 7, wherein the in-medium data comprise speech recognition transcript text and the out-of-medium data comprise non-speech text.

9. A voice analytics system comprising:
- a speech model training system comprising;
  
  a processor; and
  
  memory coupled to the processor and storing instructions that, when executed by the processor, cause the processor to;
  
  receive organization-specific training data and generic training data;
  
  compute a plurality of similarities between the generic training data and the organization-specific training data;
  
  assign a plurality of weights to the generic training data through partitioning the generic training data into a plurality of partitions in accordance with the computed similarities wherein the computed similarities comprise a fixed set of one or more threshold similarities, associating a partition similarity with each of the partitions, the partition similarity corresponding to the average similarity of the data in the partition, and assigning a desired weight to each partition, the desired weight corresponding to the partition similarity of the partition;
  
  combine the generic training data with the organization-specific training data in accordance with the weights to generate customized training data;
  
  train a customized language model using the customized training data; and
  
  output the customized language model, the customized language model being configured to compute the likelihood of phrases in a medium; and
  
  a speech analytics system configured to;
  
  receive an input speech from an interaction between a customer and an agent of the organization; and
  
  perform voice analytics on the received input speech.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- - 10. The speech recognition system of claim 9, wherein a silhouette score is used to determine a number of the plurality of partitions.
  - 11. The speech recognition system of claim 9, wherein a test set of the generic training data and the organization-specific training data empirically determine a number of the plurality of partitions.
  - 12. The speech recognition system of claim 9, wherein k-means clustering is used to determine a number of the plurality of partitions.
  - 13. The speech recognition system of claim 9, wherein the desired weight of a partition is exponentially decreasing with decreasing partition similarity.
  - 14. The speech recognition system of claim 9, wherein the memory of the speech training model system further stores instructions that, when executed by the processor, cause the processor to:
    - receive organization-specific in-medium data;
      
      combine the organization-specific in-medium data with the generic training data and the organization-specific training data to generate the customized training data; and
      
      retrain the language model in accordance with the customized training data.
  - 15. The speech recognition system of claim 9, wherein the organization-specific training data comprise at least one of:
    - in-medium data and out-of-medium data.
  - 16. The speech recognition system of claim 15, wherein the in-medium data comprise speech recognition transcript text and the out-of-medium data comprise non-speech text.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Genesys Cloud Services Incorporated
Original Assignee
Genesys Telecommunications Laboratories Incorporated (Genesys Cloud Services Incorporated)
Inventors
Tapuhi, Tamir, Lev-Tov, Amir, Faizakof, Avraham, Konig, Yochai
Primary Examiner(s)
Bost, Dwayne D
Assistant Examiner(s)
Brinich, Stephen M

Application Number

US16/219,537
Publication Number

US 20190122653A1
Time in Patent Office

509 Days
Field of Search

704 1- 10, 704244, 704256-2568, 704250-252, 704255, 704266
US Class Current
CPC Class Codes

G06F 40/232   Orthographic correction, e....

G06N 20/00   Machine learning

G06N 3/006   based on simulated virtual ...

G10L 15/063   Training

G10L 15/183   using context dependencies,...

G10L 15/26   Speech to text systems G10L...

G10L 2015/0635   updating or merging of old ...

G10L 2015/0636   Threshold criteria for the ...

G10L 2015/088   Word spotting

Language model customization in speech recognition for speech analytics

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

Language model customization in speech recognition for speech analytics

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links