Audio synchronization for document narration with user-selected playback
First Claim
Patent Images
1. A method comprises:
- applying speech recognition by a hand-held device that includes a processor device and memory to an audio recording to generate a text file including as text in the file recognized speech from the audio recording and determine an elapsed time period from a reference time in the audio recording to text in the file of recognized speech;
comparing by the hand-held device text of recognized speech to expected text;
generating by the hand-held device a timing file that is stored on a computer-readable storage medium, the timing file comprising the elapsed time information for the expected text;
rendering on a display device associated with the hand-held device, text;
rendering on the display device, a menu that displays graphics of multiple characters each of which is associated with a different audio recording;
receiving by the hand-held device an indication of user-selected text, the user-selected text corresponding to portions of the text that are rendered aloud from corresponding portions of the audio recording for a first user selected character;
determining by the hand-held device an elapsed time in the audio recording by referencing the timing file associated with the user-selected text; and
providing by the mobile device an audible output corresponding to the audio in the audio recording at the determined elapsed time in the audio recording for the user selected portions of text for narration with the corresponding portions of the audio recording.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are techniques and systems to provide a narration of a text. In some aspects, the techniques and systems described herein include generating a timing file that includes elapsed time information for expected portions of text that provides an elapsed time period from a reference time in an audio recording to each portion of text in recognized portions of text.
147 Citations
22 Claims
-
1. A method comprises:
-
applying speech recognition by a hand-held device that includes a processor device and memory to an audio recording to generate a text file including as text in the file recognized speech from the audio recording and determine an elapsed time period from a reference time in the audio recording to text in the file of recognized speech; comparing by the hand-held device text of recognized speech to expected text; generating by the hand-held device a timing file that is stored on a computer-readable storage medium, the timing file comprising the elapsed time information for the expected text; rendering on a display device associated with the hand-held device, text; rendering on the display device, a menu that displays graphics of multiple characters each of which is associated with a different audio recording; receiving by the hand-held device an indication of user-selected text, the user-selected text corresponding to portions of the text that are rendered aloud from corresponding portions of the audio recording for a first user selected character; determining by the hand-held device an elapsed time in the audio recording by referencing the timing file associated with the user-selected text; and providing by the mobile device an audible output corresponding to the audio in the audio recording at the determined elapsed time in the audio recording for the user selected portions of text for narration with the corresponding portions of the audio recording. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprises:
-
one or more processor devices; memory coupled to the one or more processor devices; and a computer readable hardware storage device storing a computer program product comprising instructions for causing the one or more processors to; determine an elapsed time period from a reference time in an audio recording to text in a file of recognized speech from the audio recording; compare text of recognized speech to expected text; generate a timing file comprising the elapsed time information for the expected text; render on a display device associated with the system, text; render on the display device, a menu that displays graphics of multiple characters each of which is associated with a different audio recording; receive an indication of user-selected text, the user-selected text corresponding to portions of the text that are rendered aloud from corresponding portions of the audio recording for a first user selected character; determine an elapsed time in the audio recording by referencing the timing file associated with the user-selected text; and provide an audible output corresponding the audio in the audio recording at the determined elapsed time in the audio recording. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A computer program product tangibly stored on a computer readable hardware storage device, the computer program product comprising instructions for causing a processor to:
-
determine an elapsed time period from a reference time in an audio recording to text in a file of recognized speech; compare text of recognized speech to expected text; generate a timing file comprising the elapsed time information for the expected text; render on a display device, text; render on the display device, a menu that displays graphics of multiple characters each of which is associated with a different audio recording; receive an indication of user-selected text, the user-selected text corresponding to portions of the text that are rendered aloud from corresponding portions of the audio recording for a first user selected character; determine an elapsed time in the audio recording by referencing the timing file associated with the user-selected text; and provide an audible output corresponding the audio in the audio recording at the determined elapsed time in the audio recording. - View Dependent Claims (18, 19, 20, 21, 22)
-
Specification