System and method for the secure, real-time, high accuracy conversion of general-quality speech into text
First Claim
1. A system, comprising:
- a receiving element to receive audio segments which are portions of audio streams, the receiving element creating sub-segments from the audio segments;
a mixing element to receive the sub-segments from the audio streams and randomize the order of the sub-segments;
a transmitting element to send the randomized sub-segments to a plurality of transcribers, each of the randomized sub-segments to be transcribed into text by the transcriber which received the randomized sub-segment; and
a text receiving element to receive the transcribed text from each of the transcribers.
3 Assignments
0 Petitions
Accused Products
Abstract
The system is designed to interface with external devices and services, to transcribe audio that may be stored elsewhere such as a wireless phone'"'"'voice mail, or occurring between two or more parties such as a conference call. An audio stream is separated into many audio shreds, each of which has duration of only a few seconds and cannot reveal the context of the conversation. A workforce of geographically distributed transcription agents who transcribe the audio shreds is able to generate transcription in real time, with many agents working in parallel on a single conversation. No one agent (or group of agents) receives a sufficient number of audio shreds to reconstruct the context of any conversation. The use of human transcribers allows the system to overcome limitations typical of computer-based speech recognition and permits accurate transcription of general-quality speech even in acoustically hostile environments.
-
Citations
33 Claims
-
1. A system, comprising:
-
a receiving element to receive audio segments which are portions of audio streams, the receiving element creating sub-segments from the audio segments; a mixing element to receive the sub-segments from the audio streams and randomize the order of the sub-segments; a transmitting element to send the randomized sub-segments to a plurality of transcribers, each of the randomized sub-segments to be transcribed into text by the transcriber which received the randomized sub-segment; and a text receiving element to receive the transcribed text from each of the transcribers. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A method, comprising:
-
receiving audio segments which are portions of audio streams; creating sub-segments from the audio segments; randomizing the order of the sub-segments; sending the randomized sub-segments to a plurality of transcribers, each of the randomized sub-segments being transcribed into text by the transcriber which received the randomized sub-segment; and receiving the transcribed text from each of the transcribers. - View Dependent Claims (24, 25, 26, 27)
-
-
28. A system comprising:
-
a receiving element to create a plurality of audio shreds; a mixing element to randomize the order of the audio shreds; a transmitting element to send the randomized audio shreds to a plurality of transcribers, each of the randomized audio shreds to be transcribed into text by the transcriber which received the randomized audio shred; and a reassembling element to receive text corresponding to the randomized audio shreds and to combine the text to create a text file. - View Dependent Claims (29, 30, 31, 32, 33)
-
Specification