System and method of automated evaluation of transcription quality
First Claim
Patent Images
1. A method of automated evaluation of a transcription quality, the method comprising:
- obtaining audio data;
segmenting the audio data into a plurality of utterances with a voice activity detector operating on a computer processor, wherein each of the plurality of utterances is separated by non-speech segments in the audio data;
transcribing the plurality of utterances into at least one word lattice with a large vocabulary continuous speech recognition system operating on the processor;
applying, by the processor, a minimum Bayes risk decoder to the at least one word lattice to create at least one confusion network representing the at least one word lattice as a plurality sequential word bins and ε
-bins; and
calculating, by the processor, at least one conformity ratio from the least one confusion network, wherein the at least one conformity ratio is an automated indication of transcription quality.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods automatedly evaluate a transcription quality. Audio data is obtained. The audio data is segmented into a plurality of utterances with a voice activity detector operating on a computer processor. The plurality of utterances are transcribed into at least one word lattice with a large vocabulary continuous speech recognition system operating on the processor. A minimum Bayes risk decoder is applied to the at least one word lattice to create at least one confusion network. At least conformity ratio is calculated from the at least one confusion network.
27 Citations
20 Claims
-
1. A method of automated evaluation of a transcription quality, the method comprising:
-
obtaining audio data; segmenting the audio data into a plurality of utterances with a voice activity detector operating on a computer processor, wherein each of the plurality of utterances is separated by non-speech segments in the audio data; transcribing the plurality of utterances into at least one word lattice with a large vocabulary continuous speech recognition system operating on the processor; applying, by the processor, a minimum Bayes risk decoder to the at least one word lattice to create at least one confusion network representing the at least one word lattice as a plurality sequential word bins and ε
-bins; andcalculating, by the processor, at least one conformity ratio from the least one confusion network, wherein the at least one conformity ratio is an automated indication of transcription quality. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system for automated evaluation of transcription quality, the system comprising:
-
an audio data source upon which a plurality of audio data files are stored; a processor that receives the plurality of audio data files, segments the audio data files into a plurality of utterances and applies at least one transcription model to the plurality of utterances to transcribe the plurality of utterances into at least one word lattice, wherein each of the plurality of utterances is separated by non-speech segments in the audio data; and a non-transient computer readable medium communicatively connected to the processor and programmed with computer readable code that when executed by the processor causes the processor to; apply a minimum Bayes risk decoder to the at least one word lattice to create at least one confusion network representing the at least one word lattice as a plurality of sequential word bins and ε
-bins; andcalculate at least one conformity ratio from the at least one confusion network, wherein the at least one conformity ratio is an automated indication of transcription quality. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification