Operating method for an automated language recognizer intended for the speaker-independent language recognition of words in different languages and automated language recognizer

US 7,974,843 B2
Filed: 01/02/2003
Issued: 07/05/2011
Est. Priority Date: 01/17/2002
Status: Expired due to Fees

First Claim

Patent Images

1. A method for automated language recognition of words from different languages said method embodied as computer program instructions encoded in tangible, non-transitory computer readable media associated with a mobile device and comprising the steps of:

(a) loading a phoneme set associated with a language specified as a mother tongue into a mother tongue language recognizer;

(b) for each of a plurality of words, determining phonetic transcripts for the word for N various languages not specified as the mother tongue to generate N first phoneme sequences for the word corresponding to N first pronunciation variants, each of the N first phoneme sequences formed from phonemes associated with one of the N different languages;

(c) determining a phoneme map by mapping the generated first phoneme sequences of each of said N languages to a relevant phoneme set of the mother tongue;

(d) for each of the plurality of words, applying the phoneme map to each of the N first phoneme sequences for that word in order to translate the N first phoneme sequences into N second phoneme sequences, each of the N second phoneme sequences formed from phonemes associated with the mother tongue language,wherein each of the N first phoneme sequences of the N various language is translated into a corresponding second phoneme sequence of the mother tongue language (a) regardless of whether the mobile device includes a speech model for each of the N various languages, and (b) regardless of whether the mother tongue language is the most acoustically similar to each of the N various languages, with respect to the respective first and second phoneme sequences, andsuch that for each word, two different phonetic transcripts are generated for each of the N different languages, including (1) the N first phoneme sequences for the word, each formed from phonemes associated with one of the N different languages, and (2) the N second phoneme sequences for the word, each formed by applying the phoneme map to translate one of the N first phoneme sequences formed from phonemes associated with one of the N different languages into a sequence of phonemes associated with the mother tongue language; and

(e) processing said N second phoneme sequences with the phoneme set associated with the language specified as the mother tongue to identify at least one of a matching word and a similar word.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The invention relates to an operating method for an automated language recognizer intended for the speaker-independent language recognition of words from different languages, particularly for recognizing names from different languages. The method is based on a language defined as the mother tongue and has an input phase for establishing a language recognizer vocabulary. Phonetic transcripts are determined for words in various languages in order to obtain phoneme sequences for pronunciation variants. The phonemes of each relevant phoneme set of the mother tongue are then specifically mapped to determine phoneme sequences that correspond to pronunciation variants.

34 Citations

View as Search Results

18 Claims

1. A method for automated language recognition of words from different languages said method embodied as computer program instructions encoded in tangible, non-transitory computer readable media associated with a mobile device and comprising the steps of:
- (a) loading a phoneme set associated with a language specified as a mother tongue into a mother tongue language recognizer;
  
  (b) for each of a plurality of words, determining phonetic transcripts for the word for N various languages not specified as the mother tongue to generate N first phoneme sequences for the word corresponding to N first pronunciation variants, each of the N first phoneme sequences formed from phonemes associated with one of the N different languages;
  
  (c) determining a phoneme map by mapping the generated first phoneme sequences of each of said N languages to a relevant phoneme set of the mother tongue;
  
  (d) for each of the plurality of words, applying the phoneme map to each of the N first phoneme sequences for that word in order to translate the N first phoneme sequences into N second phoneme sequences, each of the N second phoneme sequences formed from phonemes associated with the mother tongue language,wherein each of the N first phoneme sequences of the N various language is translated into a corresponding second phoneme sequence of the mother tongue language (a) regardless of whether the mobile device includes a speech model for each of the N various languages, and (b) regardless of whether the mother tongue language is the most acoustically similar to each of the N various languages, with respect to the respective first and second phoneme sequences, andsuch that for each word, two different phonetic transcripts are generated for each of the N different languages, including (1) the N first phoneme sequences for the word, each formed from phonemes associated with one of the N different languages, and (2) the N second phoneme sequences for the word, each formed by applying the phoneme map to translate one of the N first phoneme sequences formed from phonemes associated with one of the N different languages into a sequence of phonemes associated with the mother tongue language; and
  
  (e) processing said N second phoneme sequences with the phoneme set associated with the language specified as the mother tongue to identify at least one of a matching word and a similar word.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
- - 2. The method according to claim 1, further comprising a step of adding the N second phoneme sequences for each word in a language recognition vocabulary located in the mother tongue language recognizer.
  - 3. The method according to claim 1, further determining distances to the N second pronunciation variants based at least on the processed N second phoneme sequences.
  - 4. The method according to claim 3, further comprising a step of classifying each N second phoneme sequences to identify respective distances.
  - 5. The method according to claim 4, further comprising a step of eliminating any N second phoneme sequences that do not exceed a predetermined threshold.
  - 6. The method according to claim 5, wherein the distances are Leveshtein distances.
  - 7. The method according to claim 1, further comprising the step of determining probabilities that each word for N various languages not specified as the mother tongue belong to a specified set of languages, said step of determining probabilities occurring before step (a).
  - 8. The method according to claim 7, further comprising the step of eliminating languages from said specified set that do not exceed a predetermined threshold.
  - 9. The method according to claim 1, wherein the step of determining the phonetic transcripts of each word for N various languages not specified as the mother tongue is performed by at least one neural network.
  - 10. The method according to claim 1, wherein processing said N second phoneme sequences with the phoneme set associated with the language specified as a mother tongue is performed using a Hidden Markov Model.

11. An automatic language recognizing apparatus, including computer program modules encoded in tangible, non-transitory computer readable media associated with a mobile device, the computer program modules comprising:
- a mother tongue language recognizer, said recognizer storing a phoneme set of a predetermined mother tongue;
  
  a first processing module for determining phonetic transcripts for each word of a plurality of words from N various languages in order to obtain N first phoneme sequences for each word corresponding to N first pronunciation variants, each of the N first phoneme sequences formed from phonemes associated with one of the N different languages;
  
  a second processing module for implementing a mapping of first phoneme sequence of each of N various languages to a particular phoneme set of the mother tongue;
  
  a third processing module for applying the implemented mapping of phonemes to translate the N first phoneme sequences for each word determined by means of the first processing module into N second phoneme sequences corresponding to N second pronunciation variants being obtained for each word, the N second phoneme sequences formed from phonemes associated with the mother tongue language and being recognized by the mother tongue language recognizer;
  
  wherein the third processing module translates each of the N first phoneme sequences of the N various language into a corresponding second phoneme sequence of the mother tongue language (a) regardless of whether the mobile device includes a speech model for each of the N various languages, and (b) regardless of whether the mother tongue language is the most acoustically similar to each of the N various languages, with respect to the respective first and second phoneme sequences, andsuch that for each word, two different phonetic transcripts are generated for each of the N different languages, including (1) the N first phoneme sequences for the word, each formed from phonemes associated with one of the N different languages, and (2) the N second phoneme sequences for the word, each formed by applying the phoneme map to translate one of the N first phoneme sequences formed from phonemes associated with one of the N different languages into a sequence of phonemes associated with the mother tongue language; and
  
  a fourth processing module for creating a language recognizable vocabulary with the N second phoneme sequences for each word, obtained by the third processing module, for the mother tongue language recognizer.
- View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
- - 12. The automatic language recognizing apparatus according to claim 11, further comprising a fifth processing module for processing the N second phoneme sequences corresponding to the N second pronunciation variants of each word to obtain distances for each N second phoneme sequence.
  - 13. The automatic language recognizing apparatus according to claim 12, wherein said distances are Levenshtein distances.
  - 14. The automatic language recognizing apparatus according to claim 13, wherein the N second phoneme sequence distances not exceeding a predetermined threshold are eliminated from further processing.
  - 15. The automatic language recognizing apparatus according to claim 11, further comprising a language identifier, coupled to the first processing module, wherein the language identifier determines a probability of each word belonging to each of the N various languages.
  - 16. The automatic language recognizing apparatus according to claim 15, further comprising a language reducer that reduces the number of languages from the first processing module to be processed if said probability does not exceed a predetermined thresholds.
  - 17. The automatic language recognizing apparatus according to claim 11, wherein the first processing module comprises at least one neural network for determining the phonetic transcripts.
  - 18. The automatic language recognizing apparatus according to claim 11, wherein the mother tongue language recognizer comprises a Hidden Markov model that has been created for the phoneme set of the predetermined mother tongue.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Siemens AG
Original Assignee
Siemens AG
Inventors
Schneider, Tobias
Primary Examiner(s)
SAINT CYR, LEONARD

Application Number

US10/501,700
Publication Number

US 20050033575A1
Time in Patent Office

3,106 Days
Field of Search

704/246, 704/251, 704/252, 704/254, 704/256
US Class Current

704/254
CPC Class Codes

G10L 15/005   Language recognition

G10L 15/142   Hidden Markov Models [HMMs]

G10L 15/16   using artificial neural net...

G10L 2015/025   Phonemes, fenemes or fenone...

Operating method for an automated language recognizer intended for the speaker-independent language recognition of words in different languages and automated language recognizer

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

34 Citations

18 Claims

Specification

Solutions

Use Cases

Quick Links

Operating method for an automated language recognizer intended for the speaker-independent language recognition of words in different languages and automated language recognizer

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

34 Citations

18 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links