×

System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies

  • US 7,801,727 B2
  • Filed: 02/24/2005
  • Issued: 09/21/2010
  • Est. Priority Date: 03/17/1999
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of analyzing a language for providing speech recognition, the method comprising steps of:

  • determining a threshold frequency of occurrence, within a corpus, of word forms in a vocabulary V for the language, by using at least one processor;

    in response to determining that a subset of the word forms has a frequency of occurrence in the corpus less than the threshold frequency, splitting at least some of the word forms in the subset to generate word form components, at least some of the word form components not being full words;

    generating a language component vocabulary VC comprising the word forms in the vocabulary V and the word form components; and

    generating and storing information indicating a correspondence between the word forms in the vocabulary V and corresponding word form components.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×