REAL-TIME TRANSCRIPTION
First Claim
1. A system, comprising a processor and a memory in communication with the processor, the memory storing programming instructions executable by the processor to:
- cause a first chunk of audio data to be played for a first analyst, where the first chunk of audio data represents a segment of a first audio stream;
accept input from the first analyst sufficient to indicate a transcription of the segment of the first audio stream; and
cause the transcription to be displayed for a user in substantially real time relative to the capture of the segment of the first audio stream.
17 Assignments
0 Petitions
Accused Products
Abstract
A computing system accepts audio from one or more sources, parses the audio into chunks, and transcribes the chunks in substantially real time. Some transcription is performed automatically, while other transcription is performed by humans who listen to the audio and enter the words spoken and/or the intent of the caller (such as directions given to the system). The system provides for participants a user interface that is updated in substantially real time with the transcribed text from the audio stream(s). A single audio line can be used for simple transcription, and multiple audio lines are used to provide a real-time transcript of a conference call, deposition, or the like. A pool of analysts creates, checks, and/or corrects transcription, and callers/observers can even assist in the correction process through their respective user interfaces. Ads derived from the transcript are displayed together with the text in substantially real time.
228 Citations
20 Claims
-
1. A system, comprising a processor and a memory in communication with the processor, the memory storing programming instructions executable by the processor to:
-
cause a first chunk of audio data to be played for a first analyst, where the first chunk of audio data represents a segment of a first audio stream; accept input from the first analyst sufficient to indicate a transcription of the segment of the first audio stream; and cause the transcription to be displayed for a user in substantially real time relative to the capture of the segment of the first audio stream. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising a processor and a memory in communication with the processor, the memory storing programming instructions executable by the processor to:
-
play each of a plurality of audio chunks, each for at least one of a plurality of analysts, where the audio chunks were each captured from a single line in a conference call, and together represent speech arriving on all lines of the conference call; accept input from the plurality of analysts, the input indicating a transcript of each of the plurality of chunks, and collectively indicating a transcript of speech on all lines of the conference call; and causing the transcript to be displayed to at least one participant in the conference call, where the display of the transcript of each chunk occurs in substantially real time relative to the capture of that chunk. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification