Content-based audio playback emphasis
First Claim
1. A method comprising steps of:
- (A) identifying an estimate of a likelihood that a region of a document correctly represents content in a corresponding region of a spoken audio stream; and
(B) identifying, based on the identified likelihood, an emphasis factor for modifying emphasis placed on the region of the spoken audio stream when played back.
11 Assignments
0 Petitions
Accused Products
Abstract
Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.
-
Citations
52 Claims
-
1. A method comprising steps of:
-
(A) identifying an estimate of a likelihood that a region of a document correctly represents content in a corresponding region of a spoken audio stream; and
(B) identifying, based on the identified likelihood, an emphasis factor for modifying emphasis placed on the region of the spoken audio stream when played back. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 27, 35, 41)
-
-
26. A method comprising steps of:
-
(A) identifying an estimate of a likelihood that a region of a document correctly represents content in a corresponding region of a spoken audio stream;
(B) identifying a measure of relevance of the region of the spoken audio stream; and
(C) identifying, based on the identified likelihood and the identified measure of relevance, a timescale adjustment factor for adjusting a playback rate of the region of the spoken audio stream when played back.
-
-
28. An apparatus comprising:
-
first identification means for identifying an estimate of a likelihood that a region of a document correctly represents content in a corresponding region of a spoken audio stream; and
second identification means for identifying, based on the identified likelihood, an emphasis factor for modifying emphasis placed on the region of the spoken audio stream when played back. - View Dependent Claims (29, 30, 31, 32, 33, 34, 36, 37, 38, 39, 40)
-
-
42. A method comprising steps of:
-
(A) identifying an estimate of a likelihood that a region of a document correctly represents particular content;
(B) identifying, based on the identified likelihood, an emphasis factor; and
(C) using a text-to-speech engine to play an audio stream representing the region of the document with an emphasis specified by the emphasis factor. - View Dependent Claims (43, 44, 45, 46, 47)
-
-
48. An apparatus comprising:
-
first identification means for identifying an estimate of a likelihood that a region of a document correctly represents particular content;
second identification means for identifying, based on the identified likelihood, an emphasis factor; and
a text-to-speech engine to play an audio stream representing the region of the document with an emphasis specified by the emphasis factor. - View Dependent Claims (49, 50, 51, 52)
-
Specification