×

Training an automatic speech recognition system using compressed word frequencies

  • US 8,543,398 B1
  • Filed: 11/01/2012
  • Issued: 09/24/2013
  • Est. Priority Date: 02/29/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • obtaining, at a computing system, respective word frequencies fi from a corpus of utterance-to-text-string mappings, wherein the corpus of utterance-to-text-string mappings contains associations between audio utterances and respective text string transcriptions of the audio utterances, and wherein the respective word frequencies fi are based on occurrences of words in the text string transcriptions;

    determining respective compressed word frequencies ci by raising each of the respective word frequencies fi to a power m, wherein m<

    1 and ci=fim;

    selecting sample utterance-to-text-string mappings from the corpus of utterance-to-text-string mappings based on the respective compressed word frequencies ci; and

    training an automatic speech recognition (ASR) system with the sample utterance-to-text-string mappings.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×