Generation of subtitles or captions for moving pictures
First Claim
1. A method of parsing an electronic text file to identify different components thereof, comprising:
- identifying blocks of text in an input electronic text file;
providing a plurality of possible script format properties for the blocks;
providing a definition of each of the possible components of the text file;
in relation to each block, determining the value of each script format property;
for each block, determining from the script format properties of the block and the component definitions a probability value that the block is each of the component types;
selecting the component type for each block on the basis of the probabilities being each of the component types; and
generating output file based on the selecting.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for generating subtitles for audiovisual material received and analyses a text file containing dialogue spoken in audiovisual material and provides a signal representative of the text. The text information and audio signal are aligned in time using time alignment speech recognition and the text and timing information are then output to a subtitle file. Colors can be assigned to different speakers or groups of speakers. Subtitles are derived by receiving and analyzing a text file containing dialogue spoken by considering each word in turn and the next information signal, assigning a score to each subtitle in a plurality of different possible subtitle formatting options which lead to that word. The steps are then repeated until all the words in the text information signal have been used and the subtitle formatting option which gives the best overall score is then derived.
-
Citations
4 Claims
-
1. A method of parsing an electronic text file to identify different components thereof, comprising:
-
identifying blocks of text in an input electronic text file; providing a plurality of possible script format properties for the blocks; providing a definition of each of the possible components of the text file; in relation to each block, determining the value of each script format property; for each block, determining from the script format properties of the block and the component definitions a probability value that the block is each of the component types; selecting the component type for each block on the basis of the probabilities being each of the component types; and generating output file based on the selecting. - View Dependent Claims (2, 3, 4)
-
Specification