Navigation and orientation tools for speech synthesis
First Claim
1. A method for synchronizing speech output and display output of a text, said text being synthesized to the speech output, the method comprising:
- a. receiving a text portion of the text, wherein display of an entirety of said text portion requires at least two text areas;
b. receiving a start event indicating a next text unit of said text portion to be displayed and synthesized, wherein the start event comprises moving a time indicator to a new position on a time line indicating a corresponding position of said next text unit in the text portion to be synthesized;
c. in response to the new position in a time line, calculating display parameters associated with said next text unit, wherein said display parameters are designated to synchronize, on the basis of phonemes, the speech output and the display output of said next text unit, said display parameters including;
1) a position of the next text unit in the text portion, and2) a position of the next text unit on a time line indicating a respective point in time of the next text unit in an entire playback time of the speech output of the text portion, wherein said entire playback time is calculated by multiplying an average time required to play back synthesized speech output of a single character by a number of total characters in the text portion;
d. synchronizing, on the basis of phonemes, the speech output and the display output of the next text unit, including;
displaying an indication of the next text unit according to said display parameters, executing a text to speech synthesis of the next text unit indicated by the new position on the time line, and outputting the speech output of the next text unit, said displaying including;
i. portraying a text indicator indicating the position of the next text unit in the text portion, andii. portraying said time indicator indicating the position of the next text unit on the time line; and
e. repeating steps (c)-(d) with a subsequent text unit following the next text unit, the subsequent text unit becoming the next text unit of (c) upon repetition thereof.
0 Assignments
0 Petitions
Accused Products
Abstract
TTS is a well known technology for decades used for various applications from Artificial Call centers attendants to PC software that allows people with visual impairments or reading disabilities to listen to written works on a home computer. However to date TTS is not widely adopted for PC and Mobile users for daily reading tasks such as reading emails, reading pdf and word documents, reading through website content, and for reading books. The present invention offers new user experience for operating TTS for day to day usage. More specifically this invention describes a synchronization technique for following text being read by TTS engines and specific interfaces for touch pads, touch and multi touch screens. Nevertheless this invention also describes usage of other input methods such as touchpad, mouse, and keyboard.
43 Citations
17 Claims
-
1. A method for synchronizing speech output and display output of a text, said text being synthesized to the speech output, the method comprising:
-
a. receiving a text portion of the text, wherein display of an entirety of said text portion requires at least two text areas; b. receiving a start event indicating a next text unit of said text portion to be displayed and synthesized, wherein the start event comprises moving a time indicator to a new position on a time line indicating a corresponding position of said next text unit in the text portion to be synthesized; c. in response to the new position in a time line, calculating display parameters associated with said next text unit, wherein said display parameters are designated to synchronize, on the basis of phonemes, the speech output and the display output of said next text unit, said display parameters including; 1) a position of the next text unit in the text portion, and 2) a position of the next text unit on a time line indicating a respective point in time of the next text unit in an entire playback time of the speech output of the text portion, wherein said entire playback time is calculated by multiplying an average time required to play back synthesized speech output of a single character by a number of total characters in the text portion; d. synchronizing, on the basis of phonemes, the speech output and the display output of the next text unit, including; displaying an indication of the next text unit according to said display parameters, executing a text to speech synthesis of the next text unit indicated by the new position on the time line, and outputting the speech output of the next text unit, said displaying including; i. portraying a text indicator indicating the position of the next text unit in the text portion, and ii. portraying said time indicator indicating the position of the next text unit on the time line; and e. repeating steps (c)-(d) with a subsequent text unit following the next text unit, the subsequent text unit becoming the next text unit of (c) upon repetition thereof. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for synchronizing speech output and display output of a text, said text being synthesized to the speech output, the method comprising:
-
a) providing a display of a time line indicating an entire playback time of the speech output of the text, wherein said entire playback time is calculated by multiplying an average time required to play back synthesized speech output of a single character by a number of total characters in the text; b) providing a display of a time indicator on the time line, wherein a position of the time indicator on the time line indicates a point in time within the entire playback time of the speech output of the text; c) synchronizing the display of a text unit within the text that is next to be synthesized to the speech output, constituting a next text unit, with the position of the time indicator on the time line such that changing the displayed next text unit will cause the time indicator to be moved to the point on the time line corresponding to the time of the speech output corresponding to that of the next text unit and changing the position of the time indicator on the time line will cause the displayed next text unit to be that which occurs at the indicated position on the time line; d) displaying a selected next text unit; e) feeding the selected next text unit to a text-to-speech engine and executing a text to speech synthesis of the selected next text unit, as indicated by the position of the time indicator on the time line, thereby generating a speech output of the next text unit; f) outputting said speech output, and g) repeating steps (d)-(f) with a subsequent text unit following the next text unit, the subsequent text unit becoming the selected next text unit of (d) upon repetition thereof. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification