×

Data shredding for speech recognition language model training under data retention restrictions

  • US 9,514,740 B2
  • Filed: 03/13/2013
  • Issued: 12/06/2016
  • Est. Priority Date: 03/13/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for training a language model of an automatic speech recognition system, the method comprising:

  • producing segments of text in a text corpus and counts corresponding to the segments of text, the text corpus being in a depersonalized state, the producing including dynamically shredding the text corpus into the segments of text in the depersonalized state;

    further depersonalizing the segments of text based on the corresponding counts, each count representing a number of occurrences of a respective segment of text in the text corpus; and

    enabling an automatic speech recognition system to train a language model using the segments of text in the depersonalized state and the counts.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×