Audio synchronization for document narration with user-selected playback
First Claim
Patent Images
1. A computer implemented method comprising:
- applying speech recognition by one or more computer systems to an audio recording to generate a text version of recognized portions of text;
providing an audible output corresponding to the audio recording;
displaying, on a user interface rendered on a display device, an expected portion of text that corresponds to the words in the audio recording, the displayed expected portion of text including at least a portion of the expected portion of text that is currently being provided on the audible output;
providing visual indicia for the displayed text that corresponds to;
the audio that is currently being provided on the audible output, if the recognized portion of text matches the corresponding expected portion of text; and
otherwise one or more portions of text which does not match the recognized portion of text, if the recognized portion of text does not match the corresponding expected portion of text.
8 Assignments
0 Petitions
Accused Products
Abstract
Disclosed are techniques and systems to provide a narration of a text. In some aspects, the techniques and systems described herein include generating a timing file that includes elapsed time information for expected portions of text that provides an elapsed time period from a reference time in an audio recording to each portion of text in recognized portions of text.
-
Citations
20 Claims
-
1. A computer implemented method comprising:
-
applying speech recognition by one or more computer systems to an audio recording to generate a text version of recognized portions of text; providing an audible output corresponding to the audio recording; displaying, on a user interface rendered on a display device, an expected portion of text that corresponds to the words in the audio recording, the displayed expected portion of text including at least a portion of the expected portion of text that is currently being provided on the audible output; providing visual indicia for the displayed text that corresponds to; the audio that is currently being provided on the audible output, if the recognized portion of text matches the corresponding expected portion of text; and
otherwise one or more portions of text which does not match the recognized portion of text, if the recognized portion of text does not match the corresponding expected portion of text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer implemented method comprising:
-
applying speech recognition by one or more computer systems to an audio recording to generate a text version of recognized portions of text; comparing by the one or more computer systems the recognized portion of text to an expected portion of text; providing an audible output corresponding to the audio recording; determining by the one or more computer systems a recognized portion of text corresponding to a currently audible portion of the audio recording; displaying an expected portion of text on a user interface rendered on a display device such that the displayed expected portion of text includes at least an expected portion of text previous to the determined currently audible portion of the audio recording; and providing visual indicia for the displayed expected portion of text according to whether there is a match between expected and recognized text. - View Dependent Claims (11, 12)
-
-
13. A computer program product tangibly stored on a computer readable hardware storage device, the computer program product comprising instructions to cause a processor to:
-
apply speech recognition by one or more computer systems to an audio recording to generate a text version of recognized portions of text; provide an audible output corresponding to the audio recording; display, on a user interface rendered on a display device, an expected portion of text that corresponds to the words in the audio recording, the displayed expected portion of text including at least a portion of the expected portion of text that is currently being provided on the audible output; provide visual indicia for the displayed text that corresponds to; the audio that is currently being provided on the audible output, if the recognized portion of text matches the corresponding expected portion of text; and
otherwiseone or more portions of text which does not match the recognized portion of text, if the recognized portion of text does not match the corresponding expected portion of text. - View Dependent Claims (14, 15, 16)
-
-
17. A device comprises:
-
a processor; a display in communication with the processor; a memory in communication with the processor; and a computer readable hardware storage device storing a computer program product to configure the processor to; apply speech recognition by one or more computer systems to an audio recording to generate a text version of recognized portions of text; provide an audible output corresponding to the audio recording; display, on a user interface rendered on a display device, an expected portion of text that corresponds to the words in the audio recording, the displayed expected portion of text including at least a portion of the expected portion of text that is currently being provided on the audible output; provide visual indicia for the displayed text that corresponds to; the audio that is currently being provided on the audible output, if the recognized portion of text matches the corresponding expected portion of text; and
otherwiseone or more portions of text which does not match the recognized portion of text, if the recognized portion of text does not match the corresponding expected portion of text. - View Dependent Claims (18, 19, 20)
-
Specification