SYSTEM AND METHOD FOR ENRICHING SPOKEN LANGUAGE TRANSLATION WITH PROSODIC INFORMATION
First Claim
1. A method of enriching spoken language translation with prosodic information in a statistical speech translation framework, the method comprising:
- receiving speech for translation to a target language;
generating pitch accent labels representing segments of the received speech which are prosodically prominent; and
injecting pitch accent labels with word tokens within a translation engine to create enriched target language output text.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and computer readable-media for enriching spoken language translation with prosodic information in a statistical speech translation framework. The method includes receiving speech for translation to a target language, generating pitch accent labels representing segments of the received speech which are prosodically prominent, and injecting pitch accent labels with word tokens within the translation engine to create enriched target language output text. A further step may be added of synthesizing speech in the target language based on the prosody enriched target language output text. An automatic prosody labeler can generate pitch accent labels. An automatic prosody labeler can exploit lexical, syntactic, and prosodic information of the speech. A maximum entropy model may be used to determine which segments of the speech are prosodically prominent. A pitch accent label can include an indication of certainty that a respective segment of the speech is prosodically prominent and/or an indication of prosodic prominence of a respective segment of speech.
67 Citations
20 Claims
-
1. A method of enriching spoken language translation with prosodic information in a statistical speech translation framework, the method comprising:
-
receiving speech for translation to a target language; generating pitch accent labels representing segments of the received speech which are prosodically prominent; and injecting pitch accent labels with word tokens within a translation engine to create enriched target language output text. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for enriching spoken language translation with prosodic information in a statistical speech translation framework, the system comprising:
-
a module configured to receive speech for translation to a target language; a module configured to generate pitch accent labels representing segments of the received speech which are prosodically prominent; and a module configured to inject pitch accent labels with word tokens within a translation engine to create enriched target language output text. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable medium storing a computer program having instruction for enriching spoken language translation with prosodic information in a statistical speech translation framework, the instructions comprising:
-
receiving speech for translation to a target language; generating pitch accent labels representing segments of the received speech which are prosodically prominent; and injecting pitch accent labels with word tokens within a translation engine to create enriched target language output text. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification