Training a transcription system
First Claim
Patent Images
1. A method comprising:
- accessing recorded voice data of a user from one or more sources, the recorded voice data comprising a plurality of voice samples;
accessing a transcript of the recorded voice data, the transcript comprising text representing one or more words of each voice sample;
identifying an origin of a voice sample, the origin being a device used to input the voice sample;
determining that the origin is associated with the user;
determining that the voice sample matches a voice profile of the user, wherein the voice profile comprises voice signal characteristics to identify the voice of the user and user speech information to convert the voice sample to corresponding text and;
providing electronic mail and a text message generated by the user to identify one or more words commonly used by the user, the transcript, and the recorded voice data to a transcription system to generate an updated voice profile for the user;
determining portions of the transcript that are transcribed at a low confidence of accuracy;
flagging the portions of the transcript that are transcribed at a low confidence of accuracy; and
communicating the flagged portions of the transcript to a transcript refiner.
1 Assignment
0 Petitions
Accused Products
Abstract
According to certain embodiments, training a transcription system includes accessing recorded voice data of a user from one or more sources. The recorded voice data comprises voice samples. A transcript of the recorded voice data is accessed. The transcript comprises text representing one or more words of each voice sample. The transcript and the recorded voice data are provided to a transcription system to generate a voice profile for the user. The voice profile comprises information used to convert a voice sample to corresponding text.
-
Citations
15 Claims
-
1. A method comprising:
-
accessing recorded voice data of a user from one or more sources, the recorded voice data comprising a plurality of voice samples; accessing a transcript of the recorded voice data, the transcript comprising text representing one or more words of each voice sample; identifying an origin of a voice sample, the origin being a device used to input the voice sample; determining that the origin is associated with the user; determining that the voice sample matches a voice profile of the user, wherein the voice profile comprises voice signal characteristics to identify the voice of the user and user speech information to convert the voice sample to corresponding text and; providing electronic mail and a text message generated by the user to identify one or more words commonly used by the user, the transcript, and the recorded voice data to a transcription system to generate an updated voice profile for the user; determining portions of the transcript that are transcribed at a low confidence of accuracy; flagging the portions of the transcript that are transcribed at a low confidence of accuracy; and communicating the flagged portions of the transcript to a transcript refiner. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. One or more non-transitory computer readable media storing one or more instructions, when executed by one or more processors, configured to:
-
access recorded voice data of a user from one or more sources, the recorded voice data comprising a plurality of voice samples; access a transcript of the recorded voice data, the transcript comprising text representing one or more words of each voice sample; identify an origin of a voice sample, the origin being a device used to input the voice sample; determine that the origin is associated with the user; determine that the voice sample matches a voice profile of the user, wherein the voice profile comprises voice signal characteristics to identify a voice of the user and user speech information to convert the voice sample to corresponding text; provide electronic mail and a text message generated by the user to identify one or more words commonly used by the user, the transcript, and the recorded voice data to a transcription system to generate an updated voice profile for the user; determine portions of the transcript that are transcribed at a low confidence of accuracy; flag the portions of the transcript that are transcribed at a low confidence of accuracy; and communicate the flagged portions of the transcript to a transcript refiner. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
a memory configured to store computer executable instructions; and one or more processors coupled to the memory, the processors configured, when executing the instructions, to; access recorded voice data of a user from one or more sources, the recorded voice data comprising a plurality of voice samples; access a transcript of the recorded voice data, the transcript comprising text representing one or more words of each voice sample; identify an origin of a voice sample, the origin being a device used to input the voice sample; determine that the origin is associated with the user; determine that the voice sample matches a voice profile of the user, wherein the voice profile comprises voice signal characteristics to identify a voice of the user and user speech information to convert the voice sample to corresponding text; provide electronic mail and a text message generated by the user to identify one or more words commonly used by the user, the transcript, and the recorded voice data to a transcription system to generate an updated voice profile for the user; determine portions of the transcript that are transcribed at a low confidence of accuracy; and flag the portions of the transcript that are transcribed at a low confidence of accuracy; and communicate the flagged portions of the transcript to a transcript refiner. - View Dependent Claims (14, 15)
-
Specification