×

AUTOMATIC SPOKEN LANGUAGE IDENTIFICATION BASED ON PHONEME SEQUENCE PATTERNS

  • US 20110035219A1
  • Filed: 08/04/2009
  • Published: 02/10/2011
  • Est. Priority Date: 08/04/2009
  • Status: Active Grant
First Claim
Patent Images

1. A language identification engine, comprising:

  • a front end module having an input configured to receive an audio stream consisting of a spoken language of at least one of a set of two or more potential languages being spoken in the audio stream under analysis;

    a universal phoneme decoder that contains a universal phoneme set representing both

         1) all phonemes occurring in the set of two or more spoken languages, and

         2) captures phoneme correspondences across languages, such that a set of unique phoneme patterns and probabilities are calculated in order to identify a most likely phoneme occurring each time in the audio stream in the set of two or more potential languages in which the universal phoneme decoder was trained on;

    one or more statistical language models having logic configured to supply to a run-time language identifier module probabilities of how linguistically likely a particular uttered phoneme identified by the universal phoneme decoder comes from a particular spoken language based on an identified sequence of phonemes, wherein each statistical language model uses linguistic features from the identified phonemes from the universal phoneme decoder including the set of unique phoneme patterns created for each language in the set to distinguish between spoken human languages in the set of two or more spoken languages;

    a bank of human language specific databases for the one or more statistical language models to reference, where of the databases were filled with phoneme and phoneme sequences being trained on for a particular language in the set of two or more spoken languages, and each of the human language specific databases received the phoneme and phoneme sequences from a phone output from the same universal phoneme decoder independent of which spoken language in the set of two or more potential languages was being trained on; and

    the run-time language identifier module identifies a particular human language being spoken in the received audio stream in the set of two or more potential languages by utilizing the one or more statistical language models.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×