EFFICIENT CONVERSION OF VOICE MESSAGES INTO TEXT
First Claim
1. A method for transcribing verbal messages into text, comprising the steps of:
- (a) receiving verbal messages over a network and queuing the verbal messages in a queue for processing into text;
(b) automatically processing at least portions of successive verbal messages from the queue with online processors using an automated speech recognition (ASR) program to produce corresponding text;
(c) assigning whole verbal messages or segments of the verbal messages that have been automatically processed to selected workbench stations for further editing and transcription by operators at the workbench stations;
(d) enabling the operators at the workbench stations to which the whole or the segments of the verbal messages have been assigned to listen to the verbal messages, correct errors in the text that was produced by the automatic processing, and transcribe portions of the verbal messages that have not been automatically processed by the ASR program, producing final text messages or segments of final text messages corresponding to the verbal messages that were in the queue; and
(e) assembling segments of the text messages produced by the operators at the workbench stations from the segments of the verbal messages that were processed and using whole text messages corresponding to the whole verbal messages that were processed, producing final output text messages.
21 Assignments
0 Petitions
Accused Products
Abstract
A system and method for efficiently transcribing verbal messages transmitted over the Internet (or other network) into text. The verbal messages are initially checked to ensure that they are in a valid format and include a return network address, and if so, are processed either as whole verbal messages or split into segments. These whole verbal messages and segments are processed by an automated speech recognition (ASR) program, which produces automatically recognized text. The automatically recognized text messages or segments are assigned to selected workbenches for manual editing and transcription, producing edited text. The segments of edited text are reassembled to produce whole edited text messages, undergo post processing to correct minor errors and output as an email, an SMS message, a file, or an input to a program. The automatically recognized text and manual edits thereof are returned as feedback to the ASR program to improve its accuracy.
-
Citations
33 Claims
-
1. A method for transcribing verbal messages into text, comprising the steps of:
-
(a) receiving verbal messages over a network and queuing the verbal messages in a queue for processing into text; (b) automatically processing at least portions of successive verbal messages from the queue with online processors using an automated speech recognition (ASR) program to produce corresponding text; (c) assigning whole verbal messages or segments of the verbal messages that have been automatically processed to selected workbench stations for further editing and transcription by operators at the workbench stations; (d) enabling the operators at the workbench stations to which the whole or the segments of the verbal messages have been assigned to listen to the verbal messages, correct errors in the text that was produced by the automatic processing, and transcribe portions of the verbal messages that have not been automatically processed by the ASR program, producing final text messages or segments of final text messages corresponding to the verbal messages that were in the queue; and (e) assembling segments of the text messages produced by the operators at the workbench stations from the segments of the verbal messages that were processed and using whole text messages corresponding to the whole verbal messages that were processed, producing final output text messages. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 31)
-
-
18. A system for efficiently transcribing verbal messages that are provided to the system over a network, to produce corresponding text, comprising:
-
(a) a plurality of processors coupled to the network, for receiving and processing verbal messages to be transcribed to text; (b) one or more of the plurality of processors processing the verbal messages using an automatic speech recognition (ASR) program to produce automatically recognized text; (c) one or more of the plurality of processors on corresponding one or more workbench stations each providing a graphical interface on a display to enable operators using the one or more workbench stations to review and edit the automatically recognized text, and to further transcribe the verbal messages to produce edited text; and (d) one or more of the plurality of processors reassembling text segments comprising the edited text, producing final output text messages that can be conveyed to an end user. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 32, 33)
-
Specification