Methods and systems for correcting transcribed audio files
First Claim
Patent Images
1. A method comprising:
- receiving audio data from an audio data source;
segmenting the audio data into a plurality of audio data segments and transcribing a first one of the audio data segments based on a voice model to generate first text data;
making the first text data available to at least one user over an electronic network;
receiving second corrected text data that is different than the first text data over the electronic network responsive to making the first text data available to the at least one user over the electronic network;
modifying the voice model based on the second corrected text data;
in response to modifying the voice model, performing additional transcription based on the modified voice model to generate third text data representing a portion of the received audio data, said portion of the received audio data including audio data of a second audio data segment of the plurality of audio data segments that is different than the first audio data segment.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and systems for correcting transcribed text. One method includes receiving audio data from one or more audio data sources and transcribing the audio data based on a voice model to generate text data. The method also includes making the text data available to a plurality of users over at least one computer network and receiving corrected text data over the at least one computer network from the plurality of users. In addition, the method can include modifying the voice model based on the corrected text data.
133 Citations
20 Claims
-
1. A method comprising:
-
receiving audio data from an audio data source; segmenting the audio data into a plurality of audio data segments and transcribing a first one of the audio data segments based on a voice model to generate first text data; making the first text data available to at least one user over an electronic network; receiving second corrected text data that is different than the first text data over the electronic network responsive to making the first text data available to the at least one user over the electronic network; modifying the voice model based on the second corrected text data; in response to modifying the voice model, performing additional transcription based on the modified voice model to generate third text data representing a portion of the received audio data, said portion of the received audio data including audio data of a second audio data segment of the plurality of audio data segments that is different than the first audio data segment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A memory device having instructions stored thereon that, in response to execution by a processing device, cause the processing device to perform operations comprising:
-
segmenting audio data from an audio source into a plurality of audio data segments and transcribing a first one of the audio data segments based on a voice model to generate first text data; making the first text data available to at least one user over an electronic network; receiving second corrected text data that is different than the first text data over the electronic network responsive to making the first text data available to the at least one user over the electronic network; modifying the voice model based on the second corrected text data; and performing additional transcription based on the modified voice model to generate third text data representing a portion of the received audio data, said portion of the received audio data including audio data of a second audio data segment of the plurality of audio data segments that is different than the first audio segment. - View Dependent Claims (12, 13, 14, 15)
-
-
16. An apparatus, comprising:
-
means for segmenting audio data from an audio source into a plurality of audio data segments; means for transcribing a first one of the audio data segments based on a voice model to generate first text data; means for making the first text data available to a plurality of users over an electronic network; means for modifying the voice model based on second corrected text data that is different from the first text data and received responsive to making the first text data available to the plurality of users over the electronic network; and means for transcribing based on the modified voice model to generate third text data representing a portion of the audio data, said portion of the audio data including audio data of a second audio data segment of the plurality of audio data segments that is different than the first audio data segment. - View Dependent Claims (17, 18, 19, 20)
-
Specification