Integrated speech recognition, closed captioning, and translation system and method
First Claim
1. A method for processing a speech portion of audio signals from multiple speakers in a broadcast program signal comprising the steps of:
- (a) receiving said audio signal from said broadcast program signal comprising at least a speech portion, wherein said speech portion of said audio signal is not previously processed by a human operator for syntax or context;
(b) processing said speech portion of said audio signal for a speaker without any input from a human operator;
(c) converting said speech portion of said audio signal for said speaker to text without any input from a human operator;
(d) transmitting said text to a first closed caption channel;
(e) translating said text in real time to produce translated text for said speaker;
(f) transmitting said translated text to a second closed caption channel; and
(g) converting said translated text to an audio signal comprising speech generated for said speaker according to said translated text.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method that integrates automated voice recognition technology and speech-to-text technology with automated translation and closed captioning technology to provide translations of “live” or “real-time” television content is disclosed. It converts speech to text, translates the converted text to other languages, and provides captions through a single device that may be installed at the broadcast facility. The device accepts broadcast quality audio, recognizes the speaker'"'"'s voice, converts the audio to text, translates the text, processes the text for multiple caption outputs, and then sends multiple text streams out to caption encoders and/or other devices in the proper format. Because it automates the process, it dramatically reduces the cost and time traditionally required to package television programs for broadcast into foreign or multi-language U.S. markets.
-
Citations
5 Claims
-
1. A method for processing a speech portion of audio signals from multiple speakers in a broadcast program signal comprising the steps of:
-
(a) receiving said audio signal from said broadcast program signal comprising at least a speech portion, wherein said speech portion of said audio signal is not previously processed by a human operator for syntax or context; (b) processing said speech portion of said audio signal for a speaker without any input from a human operator; (c) converting said speech portion of said audio signal for said speaker to text without any input from a human operator; (d) transmitting said text to a first closed caption channel; (e) translating said text in real time to produce translated text for said speaker; (f) transmitting said translated text to a second closed caption channel; and (g) converting said translated text to an audio signal comprising speech generated for said speaker according to said translated text. - View Dependent Claims (2, 3, 4, 5)
-
Specification