Method and Apparatus to Model and Transfer the Prosody of Tags across Languages
First Claim
Patent Images
1. A method and Apparatus to Model and Transfer the Prosody of Tags across Languages comprising the steps of a first person in speaking in language one (L1);
- where the L1 speech is recognized by the ASR;
searching the speech for a known tag;
searching the pieces of text that have common, cpnsistent, or idiomatic intonation patterns, translating the text to language number two (L2);
examine the speech signal of L! to find the segments that correspond to the tag;
extract the fundamental frequency from those sigments and fit a smooth contour such as a cubic spline;
map the stylized smooth contour into the corresponding part of the pitch range of the intended L2 synthesized speech;
stretch or contract stylized smooth contour over time because the duration of the translation will be different;
align the contour with the corresponding L2 segments and impose it on the synthesized L2 speech.
2 Assignments
0 Petitions
Accused Products
Abstract
Identify, Capture, Retain and Synthesize Non-Linguistic and Discourse Components of Speech across Languages
-
Citations
1 Claim
-
1. A method and Apparatus to Model and Transfer the Prosody of Tags across Languages comprising the steps of a first person in speaking in language one (L1);
- where the L1 speech is recognized by the ASR;
searching the speech for a known tag;
searching the pieces of text that have common, cpnsistent, or idiomatic intonation patterns, translating the text to language number two (L2);
examine the speech signal of L! to find the segments that correspond to the tag;extract the fundamental frequency from those sigments and fit a smooth contour such as a cubic spline;
map the stylized smooth contour into the corresponding part of the pitch range of the intended L2 synthesized speech;
stretch or contract stylized smooth contour over time because the duration of the translation will be different;
align the contour with the corresponding L2 segments and impose it on the synthesized L2 speech.
- where the L1 speech is recognized by the ASR;
Specification