Multi-lingual speech synthesis

US 20050144003A1
Filed: 12/08/2003
Published: 06/30/2005
Est. Priority Date: 12/08/2003
Status: Abandoned Application

First Claim

Patent Images

1. A method for speech synthesis of a word (20) in a first language (A), comprising:

dividing said word (20) into a first sequence (21) of pronunciation phonemes in said first language (A), mapping said first phoneme sequence (21) to a second sequence (22) of pronunciation phonemes in at least one second language (B), and generating an audio output (23) of the phonemes in said second phoneme sequence (22) using prosody models for said at least one second language (B).

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method for speech synthesis of a word in a first language, comprising dividing the word into a first sequence of pronunciation phonemes in the first language, mapping the first phoneme sequence to a second sequence of pronunciation phonemes in at least one second language, and generating an audio output of the phonemes in the second phoneme sequence using prosody models adapted for the at least one second language. According to this method, an audio output of a word in a first language can be generated by a speech synthesizing engine not having actual support for this language. Instead, the pronunciation phonemes of the word are mapped onto phonemes of at least one second language, for which the speech synthesizing engine does have support.

202 Citations

15 Claims

1. A method for speech synthesis of a word (20) in a first language (A), comprising:
- dividing said word (20) into a first sequence (21) of pronunciation phonemes in said first language (A), mapping said first phoneme sequence (21) to a second sequence (22) of pronunciation phonemes in at least one second language (B), and generating an audio output (23) of the phonemes in said second phoneme sequence (22) using prosody models for said at least one second language (B).
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The method according to claim 1, further comprising selecting said at least one second language (B) in dependence of said first language (A).
  - 3. The method in claim 1, wherein said second sequence (22) of phonemes belong to a plurality of different languages.
  - 4. The method according to claims 1, wherein said mapping is performed so as to optimize the sound correspondence between said first and said second sequence (21, 22) of phonemes.
  - 5. The method according to claim 1, wherein said mapping includes using a look-up table.
  - 6. The method in claim 1, wherein said prosody models are provided by a text-to-speech (TTS) engine (11) adapted for said at least one second language (B).
  - 7. The method according to claim 1, further comprising smoothening transitions between different phonemes in said second phoneme sequence (22).
  - 8. A computer program product, loadable into memory (3) of a computer (2), said computer program product comprising computer code portions (11, 13, 15) for performing the method according to claim 1 when executed by said computer.
  - 9. The computer program product in claim 8, stored on a computer readable medium (3).

10. A speech synthesizer (6) for speech synthesis of a word (20) in a first language (A) comprising:
- a pronunciation module (11) for dividing said word (20) into a first sequence (21) of pronunciation phonemes in said first language (A), processing means (13) for mapping said first phoneme sequence (21) to a second sequence (22) of pronunciation phonemes in at least one second language (B), and a speech synthesis engine (15) for generating an audio output (23) of the phonemes in said second phoneme sequence (22) using prosody models for said at least one second language (B).
- View Dependent Claims (11, 12, 13, 14, 15)
- - 11. The speech synthesizer in claim 10, wherein said processing means (13) has access to a look-up table (17).
  - 12. The speech synthesizer in claim 11, wherein said look-up table is stored in a memory (3).
  - 13. The speech synthesizer in claim 10, further comprising post processing means, for smoothening transitions between different phonemes in said second phoneme sequence (22).
  - 14. A communication device comprising a speech synthesizer (6) according to claim 10.
  - 15. The communication device in claim 14, further comprising a voice recognition system (5).

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nokia Corporation
Original Assignee
Nokia Corporation
Inventors
Iso-Sipila, Juha

Application Number

US10/730,373
Publication Number

US 20050144003A1
Time in Patent Office

Days
Field of Search
US Class Current

704/269
CPC Class Codes

G10L 13/08 Text analysis or generation...

Multi-lingual speech synthesis

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

202 Citations

15 Claims

Specification

Use Cases

Quick Links

Others

Multi-lingual speech synthesis

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

202 Citations

15 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others