System for generating captions for live video broadcasts
First Claim
Patent Images
1. A method of performing distributed caption generation, the method comprising:
- programmatically selecting first and second respeakers to perform respeaking with a voice recognition engine for a broadcast program based at least in part on past performance ratings of the first and second respeakers;
receiving first text generated by the first respeaker for inclusion in the broadcast program;
receiving second text generated by the second respeaker for inclusion in the broadcast program, wherein the second text is being received as backup in case receipt of the first text is interrupted;
electronically outputting the first text for inclusion as captions in the broadcast program;
determining whether receipt of the first text is interrupted;
in response to determining that receipt of the first text is interrupted, outputting the second text for inclusion in the broadcast program; and
programmatically calculating new performance ratings for the first and second respeakers, the new performance ratings configured to be used to assign the first or second respeaker to a subsequent broadcast program, said calculating the new performance ratings for the first and second respeakers comprising taking into account at least the following factors;
network connections of the first and second respeakers;
whether the first and second respeakers train the voice recognition engine, subsequent to the broadcast program, with words incorrectly transcribed during the broadcast program; and
whether the first and second respeakers train the voice recognition engine with new words not previously recognized by the voice recognition engine;
wherein the new performance ratings are negatively impacted if the first and second respeakers fail to train the voice recognition engine and are positively impacted if the first and second respeakers train the voice recognition engine; and
wherein at least said determining and said calculating are implemented by a computer system comprising computer hardware.
35 Assignments
0 Petitions
Accused Products
Abstract
An adaptive workflow system can be used to implement captioning projects, such as projects for creating captions or subtitles for live and non-live broadcasts. Workers can repeat words spoken during a broadcast program or other program into a voice recognition system, which outputs text that may be used as captions or subtitles. The process of workers repeating these words to create such text can be referred to as respeaking. Respeaking can be used as an effective alternative to more expensive and hard-to-find stenographers for generating captions and subtitles.
79 Citations
18 Claims
-
1. A method of performing distributed caption generation, the method comprising:
-
programmatically selecting first and second respeakers to perform respeaking with a voice recognition engine for a broadcast program based at least in part on past performance ratings of the first and second respeakers; receiving first text generated by the first respeaker for inclusion in the broadcast program; receiving second text generated by the second respeaker for inclusion in the broadcast program, wherein the second text is being received as backup in case receipt of the first text is interrupted; electronically outputting the first text for inclusion as captions in the broadcast program; determining whether receipt of the first text is interrupted; in response to determining that receipt of the first text is interrupted, outputting the second text for inclusion in the broadcast program; and programmatically calculating new performance ratings for the first and second respeakers, the new performance ratings configured to be used to assign the first or second respeaker to a subsequent broadcast program, said calculating the new performance ratings for the first and second respeakers comprising taking into account at least the following factors; network connections of the first and second respeakers; whether the first and second respeakers train the voice recognition engine, subsequent to the broadcast program, with words incorrectly transcribed during the broadcast program; and whether the first and second respeakers train the voice recognition engine with new words not previously recognized by the voice recognition engine; wherein the new performance ratings are negatively impacted if the first and second respeakers fail to train the voice recognition engine and are positively impacted if the first and second respeakers train the voice recognition engine; and wherein at least said determining and said calculating are implemented by a computer system comprising computer hardware. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method of performing distributed caption generation, the method comprising:
-
selecting a respeaker to perform respeaking with a voice recognition engine for a broadcast program based at least in part on a past performance rating of the respeaker; receiving text generated by the respeaker for inclusion in the broadcast program; electronically outputting the text for inclusion in the broadcast program; calculating a new performance rating for the respeaker, the new performance rating configured to be used to evaluate whether to assign the respeaker to a subsequent broadcast program, said calculating the new performance rating comprising taking into account at least the following factors; a network connection of the respeaker; whether the respeaker trains the voice recognition engine, subsequent to the broadcast program, with words incorrectly transcribed during the broadcast program; whether the respeaker trains the voice recognition engine with new words not previously recognized by the voice recognition engine; and wherein the new performance rating is negatively impacted if the respeaker fails to train the voice recognition engine and is positively impacted if the respeaker trains the voice recognition engine; and wherein at least said calculating is implemented by a computer system comprising computer hardware. - View Dependent Claims (8, 9)
-
-
10. A system for performing distributed caption generation, the system comprising:
-
a project management module configured to select a respeaker to perform respeaking with a voice recognition engine for a broadcast based at least in part on a past performance rating of the respeaker; a project network application comprising a respeaking module configured to; provide functionality for the respeaker to generate text responsive to audio from a broadcast, receive text generated by the respeaker, and electronically output the text for inclusion in the broadcast; and a worker ratings calculator comprising computer hardware, the worker ratings calculator configured to calculate a new performance rating for the respeaker, the new performance rating configured to be used to evaluate whether to assign the respeaker to a subsequent broadcast program, wherein the worker rating calculator is further configured to calculate the new performance rating by taking into account at least the following factors; a network connection of the respeaker; whether the respeaker trains the voice recognition engine, subsequent to the broadcast program, with words incorrectly transcribed during the broadcast program; and whether the respeaker trains the voice recognition engine with new words not previously recognized by the voice recognition engine; wherein the new performance rating is negatively impacted if the respeaker fails to train the voice recognition engine and is positively impacted if the respeaker trains the voice recognition engine. - View Dependent Claims (11)
-
-
12. Non-transitory physical computer storage comprising instructions stored thereon for implementing, in one or more processors, operations for performing distributed caption generation, the operations comprising:
-
selecting a respeaker to perform respeaking with a voice recognition engine for a first program based at least in part on a past performance rating of the respeaker; receiving text generated by the respeaker for inclusion in the first program; electronically outputting the text for inclusion in the first program; and programmatically calculating a new performance rating for the respeaker, the new performance rating configured to be used to evaluate whether to assign the respeaker to a subsequent program, wherein said calculating the new performance rating comprises taking into account at least the following factors; a network connection of the respeaker; whether the respeaker trains the voice recognition engine, subsequent to the first program, with words incorrectly transcribed during the broadcast program; and whether the respeaker trains the voice recognition engine with new words not previously recognized by the voice recognition engine; wherein the new performance rating is negatively impacted if the respeaker fails to train the voice recognition engine and is positively impacted if the respeaker trains the voice recognition engine. - View Dependent Claims (13, 14, 15, 16, 17, 18)
-
Specification