SYSTEM FOR GENERATING CAPTIONS FOR LIVE VIDEO BROADCASTS
First Claim
Patent Images
1. A method of performing distributed caption generation, the method comprising:
- selecting first and second respeakers to perform respeaking with a voice recognition engine for a broadcast program based at least in part on past performance ratings of the first and second respeakers;
receiving first text generated by the first respeaker for inclusion in the broadcast program;
receiving second text generated by the second respeaker for inclusion in the broadcast program, wherein the second text is being received as backup in case receipt of the first text is interrupted;
outputting the first text for inclusion as captions in the broadcast program;
determining whether receipt of the first text is interrupted;
in response to determining that receipt of the first text is interrupted, outputting the second text for inclusion in the broadcast program; and
calculating new performance ratings for the first and second respeakers, the new performance ratings configured to be used to assign the first or second respeaker to a subsequent broadcast program;
wherein at least said determining is implemented by a computer system comprising computer hardware.
35 Assignments
0 Petitions
Accused Products
Abstract
An adaptive workflow system can be used to implement captioning projects, such as projects for creating captions or subtitles for live and non-live broadcasts. Workers can repeat words spoken during a broadcast program or other program into a voice recognition system, which outputs text that may be used as captions or subtitles. The process of workers repeating these words to create such text can be referred to as respeaking. Respeaking can be used as an effective alternative to more expensive and hard-to-find stenographers for generating captions and subtitles.
81 Citations
34 Claims
-
1. A method of performing distributed caption generation, the method comprising:
-
selecting first and second respeakers to perform respeaking with a voice recognition engine for a broadcast program based at least in part on past performance ratings of the first and second respeakers; receiving first text generated by the first respeaker for inclusion in the broadcast program; receiving second text generated by the second respeaker for inclusion in the broadcast program, wherein the second text is being received as backup in case receipt of the first text is interrupted; outputting the first text for inclusion as captions in the broadcast program; determining whether receipt of the first text is interrupted; in response to determining that receipt of the first text is interrupted, outputting the second text for inclusion in the broadcast program; and calculating new performance ratings for the first and second respeakers, the new performance ratings configured to be used to assign the first or second respeaker to a subsequent broadcast program; wherein at least said determining is implemented by a computer system comprising computer hardware. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of performing distributed caption generation, the method comprising:
-
receiving first text generated by a first respeaker with a voice recognition engine for inclusion in an audio program; receiving second text generated by a second respeaker with the voice recognition engine for inclusion in the audio program, the second text being received as backup in case the first text is no longer received; outputting the first text for inclusion in the audio program; determining whether an interruption has occurred related to receipt of the first text; and in response to determining that the interruption has occurred, outputting the second text of the second respeaker for inclusion in the audio program; wherein at least said determining is implemented by a computer system comprising computer hardware. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A system for performing distributed caption generation, the system comprising:
-
a project network application comprising a respeaking module configured to; provide functionality for first and second respeakers to generate text responsive to audio of a broadcast, receive first text generated by the first respeaker, and receive second text generated by the second respeaker as backup in case the first text is no longer received; and a failover module comprising computer hardware, the failover module configured to; output the first text for inclusion in the broadcast, determining whether an interruption has occurred related to receipt of the first text, and in response to determining that the interruption has occurred related to receipt of the first text, output the second text of the second respeaker for inclusion in the broadcast. - View Dependent Claims (13, 14)
-
-
15. Non-transitory physical computer storage comprising instructions stored thereon for implementing, in one or more processors, operations for performing distributed caption generation, the operations comprising:
-
receiving first text generated by a first respeaker with a voice recognition engine for inclusion in an audio program; receiving second text generated by a second respeaker with the voice recognition engine for inclusion in the audio program, the second text being received as backup in case the first text is no longer received; outputting the first text for inclusion in the audio program; determining whether the first text is no longer being received; and in response to determining that the first text is no longer being received, outputting the second text of the second respeaker for inclusion in the audio program. - View Dependent Claims (16, 17)
-
-
18. A method of performing distributed caption generation, the method comprising:
-
selecting a respeaker to perform respeaking with a voice recognition engine for a broadcast program based at least in part on a past performance rating of the respeaker; receiving text generated by the respeaker for inclusion in the broadcast program; outputting the text for inclusion in the broadcast program; calculating a new performance rating for the respeaker, the new performance rating configured to be used to evaluate whether to assign the respeaker to a subsequent broadcast program; and wherein at least said calculating is implemented by a computer system comprising computer hardware. - View Dependent Claims (19, 20, 21, 22)
-
-
23. A system for performing distributed caption generation, the system comprising:
-
a project management module configured to select a respeaker to perform respeaking with a voice recognition engine for a broadcast based at least in part on a past performance rating of the respeaker; a project network application comprising a respeaking module configured to; provide functionality for the respeaker to generate text responsive to audio from a broadcast, receive text generated by the respeaker, and output the text for inclusion in the broadcast; and a worker ratings calculator comprising computer hardware, the worker ratings calculator configured to calculate a new performance rating for the respeaker, the new performance rating configured to be used to evaluate whether to assign the respeaker to a subsequent broadcast program. - View Dependent Claims (24)
-
-
25. Non-transitory physical computer storage comprising instructions stored thereon for implementing, in one or more processors, operations for performing distributed caption generation, the operations comprising:
-
selecting a respeaker to perform respeaking with a voice recognition engine for a program based at least in part on a past performance rating of the respeaker; receiving text generated by the respeaker for inclusion in the program; outputting the text for inclusion in the program; and calculating a new performance rating for the respeaker, the new performance rating configured to be used to evaluate whether to assign the respeaker to a subsequent program. - View Dependent Claims (26, 27, 28, 29, 30, 31)
-
-
32. Non-transitory physical computer storage comprising instructions stored thereon for implementing, in one or more processors, operations for performing distributed caption generation, the operations comprising:
-
receiving speech audio from a respeaker user with a voice recognition engine, the voice recognition engine comprising a plurality of voice recognition systems, the speech audio corresponding to speech output by the respeaker user in order to transcribe broadcast audio; providing the speech audio to the plurality of voice recognition systems; receiving text output from each of the voice recognition systems; receiving a calculated probability of accuracy for the output text from each of the voice recognition systems; and selecting the output text from one of the voice recognition systems based on the calculated probability of accuracy. - View Dependent Claims (33, 34)
-
Specification