Speech transcription tool for efficient speech transcription
First Claim
1. A speech transcription tool comprising:
- control logic configured to play back portions of an audio stream;
an input device configured to receive text from a user defining a transcription of the portions of the audio stream and receive annotation information from the user further defining the text; and
a graphical user interface including a first section configured to display a graphical representation of a waveform corresponding to the audio stream, and a second section configured to display the text and representations of the annotation information for the text.
4 Assignments
0 Petitions
Accused Products
Abstract
A transcription tool [115] includes a graphical user interface [209] that displays the waveform of an input audio signal to a user. The user may define speaker turn segments using the displayed waveform. The graphical user interface further displays a transcription section [302] that includes a textual representation of speech that was transcribed by the user and a graphical representation of annotation information [314] relating to the transcribed text. The user may enter the annotation information on-the-fly while transcribing the text using predefined keyboard shortcut commands or other mechanisms. The graphical user interface may further display a structured representation section [303] that may present the transcribed text as a hierarchical tree structure.
-
Citations
28 Claims
-
1. A speech transcription tool comprising:
-
control logic configured to play back portions of an audio stream;
an input device configured to receive text from a user defining a transcription of the portions of the audio stream and receive annotation information from the user further defining the text; and
a graphical user interface including a first section configured to display a graphical representation of a waveform corresponding to the audio stream, and a second section configured to display the text and representations of the annotation information for the text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method comprising:
-
receiving an audio stream containing speech data;
receiving text from a user defining a transcription of the speech data;
receiving annotation information from the user further defining the text;
displaying the text; and
displaying symbolic representations of the annotation information with the text. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computing device for transcribing an audio file that includes speech, the computing device comprising:
-
an audio output device;
a processor; and
a computer memory coupled to the processor and containing programming instructions that when executed by the processor cause the processor to;
play a current one of a plurality of segments of the audio file through the audio output device, receive transcription information for speech segments of the segments of the audio file played through the audio output device, receive annotation information relating to the transcription information, and display the transcription information in an output section of a graphical user interface, and display the annotation information as graphical icons in the output section of the graphical user interface. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26)
-
-
27. A computer-readable medium containing program instructions for execution by a processor, the program instructions comprising:
-
instructions for obtaining an audio stream containing speech data;
instructions for receiving text from a user that defines a transcription of the speech data;
instructions for receiving annotation information from the user further defining the text;
instructions for presenting the text; and
instructions for providing symbolic representations of the annotation information with the text.
-
-
28. A device comprising:
-
means for receiving an audio stream containing speech data;
means for receiving text from a user defining a transcription of the speech data;
means for receiving annotation information from the user further defining the text;
means for displaying the text; and
means for displaying symbolic representations of the annotation information as graphical icons associated with the text.
-
Specification