Multi-lingual speech synthesis
First Claim
1. A method for speech synthesis of a word (20) in a first language (A), comprising:
- dividing said word (20) into a first sequence (21) of pronunciation phonemes in said first language (A), mapping said first phoneme sequence (21) to a second sequence (22) of pronunciation phonemes in at least one second language (B), and generating an audio output (23) of the phonemes in said second phoneme sequence (22) using prosody models for said at least one second language (B).
1 Assignment
0 Petitions
Accused Products
Abstract
A method for speech synthesis of a word in a first language, comprising dividing the word into a first sequence of pronunciation phonemes in the first language, mapping the first phoneme sequence to a second sequence of pronunciation phonemes in at least one second language, and generating an audio output of the phonemes in the second phoneme sequence using prosody models adapted for the at least one second language. According to this method, an audio output of a word in a first language can be generated by a speech synthesizing engine not having actual support for this language. Instead, the pronunciation phonemes of the word are mapped onto phonemes of at least one second language, for which the speech synthesizing engine does have support.
202 Citations
15 Claims
-
1. A method for speech synthesis of a word (20) in a first language (A), comprising:
-
dividing said word (20) into a first sequence (21) of pronunciation phonemes in said first language (A), mapping said first phoneme sequence (21) to a second sequence (22) of pronunciation phonemes in at least one second language (B), and generating an audio output (23) of the phonemes in said second phoneme sequence (22) using prosody models for said at least one second language (B). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A speech synthesizer (6) for speech synthesis of a word (20) in a first language (A) comprising:
-
a pronunciation module (11) for dividing said word (20) into a first sequence (21) of pronunciation phonemes in said first language (A), processing means (13) for mapping said first phoneme sequence (21) to a second sequence (22) of pronunciation phonemes in at least one second language (B), and a speech synthesis engine (15) for generating an audio output (23) of the phonemes in said second phoneme sequence (22) using prosody models for said at least one second language (B). - View Dependent Claims (11, 12, 13, 14, 15)
-
Specification