Automated speech alignment for image synthesis
First Claim
Patent Images
1. A computerized method for synchronizing audio signals to computer generated visual images comprising the steps of:
- analyzing an audio signal to produce a stream of time aligned acoustic-phonetic units, there is one acoustic-phonetic unit for each portion of audio signal determined to be phonetically distinct, each acoustic phonetic unit having a starting time and an ending time of the phonetically distinct portion of the audio signal;
translating each acoustic-phonetic unit to a corresponding time aligned image unit representative of the acoustic-phonetic unit; and
displaying an image including the time aligned image units while synchronizing to the audio signal.
3 Assignments
0 Petitions
Accused Products
Abstract
In a computerized method, speech signals are analyzed using statistical trajectory modeling to produce time aligned acoustic-phonetic units. There is one acoustic-phonetic unit for each portion of the speech signal determined to be phonetically distinct. The acoustic-phonetic units are translated to corresponding time aligned image units representative of the acoustic-phonetic units. An image including the time aligned image units is displayed. The display of the time aligned image units is synchronized to a replaying of the digitized natural speech signal.
124 Citations
44 Claims
-
1. A computerized method for synchronizing audio signals to computer generated visual images comprising the steps of:
-
analyzing an audio signal to produce a stream of time aligned acoustic-phonetic units, there is one acoustic-phonetic unit for each portion of audio signal determined to be phonetically distinct, each acoustic phonetic unit having a starting time and an ending time of the phonetically distinct portion of the audio signal; translating each acoustic-phonetic unit to a corresponding time aligned image unit representative of the acoustic-phonetic unit; and displaying an image including the time aligned image units while synchronizing to the audio signal. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method for synchronizing audio signals to computer generated visual images, the method comprising the steps of:
-
analyzing an audio signal to produce time aligned acoustic-phonetic units; translating each one of the time aligned acoustic-phonetic units to a time aligned image unit representative of the each one to produce a stream of time aligned image units; and displaying the stream of time aligned image units wherein the stream of time aligned image units is synchronized to the audio signal. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A computing device for synchronizing audio signals to computer generated visual images, the computing device comprising:
-
an audio signal; and a processor coupled to the audio signal, the processor configured to; analyze the audio signal to produce time aligned acoustic-phonetic units; to translate each one of the time aligned acoustic-phonetic units with a time aligned image unit representative of the each one to produce a stream of time aligned image units; and to display the stream of time aligned image units wherein the stream of time aligned image units is synchronized to the audio signal. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44)
-
Specification