System and Method for a High Performance Audio Codec
First Claim
1. A system for high performance audio codec comprising:
- A CELP-based codec,An ASR engine; and
,A text comparator.
0 Assignments
0 Petitions
Accused Products
Abstract
A system for a high performance audio codec provides higher voice quality and higher recognition accuracy from an ASR engine at an increased data rate and computational power and embodiments include those having a CELP-based codec, an ASR engine, a text comparator, an encoder, a decoder, an LPC Computation and formant analysis module, a dual stage data rate determination module, a VQ of LSP coefficients module, a pitch synthesis and optimal pitch parameter search module, and an excitation codebook parameter search module. A method for high performance audio codec includes three stages and comprises the steps of having an ASR engine yield transcribed text from each of an uncompressed reference signal and a decompressed signal that has passed through an encoder and wherein the transcribed text is compared with original text to determine word error rates in an iterative process whereby both voice quality and recognition accuracy are optimized.
-
Citations
51 Claims
-
1. A system for high performance audio codec comprising:
-
A CELP-based codec, An ASR engine; and
,A text comparator. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for high performance audio codec including an encoder and a decoder and further comprising:
-
An LPC computation and formant analysis module, a dual stage data rate determination module, an LPC to LSP conversion module, a VQ of LSP Coefficients module, an interpolation and LSP to LPC conversion module, a pitch synthesis and optimal pitch parameter search module, an excitation codebook parameter search module; and
,a data packing module. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41)
-
-
42. A method for high performance audio Codec comprising the steps of:
-
For Stage 1; Input speech as an uncompressed reference signal is sent to an ASR Engine, bypassing the audio Codec, whereby the ASR engine yields transcribed text from the uncompressed reference signal, The transcribed text from the uncompressed reference signal is also sent to the text comparator which compares the transcribed text from the uncompressed reference signal received from the ASR engine with the original text in order to determine a percent word error rate, % WER REF, with respect to the uncompressed reference signal, For Stage 2; input speech is sent to an encoder of the audio Codec as an uncompressed reference signal, The encoder yields compressed speech, The compressed speech from the encoder is sent to a decoder yielding a decoded signal in the form of a decompressed reference signal, The decompressed reference signal is sent to an ASR Engine yielding transcribed text from the decompressed reference signal, The transcribed text from the decompressed reference signal is sent to a text comparator which compares the transcribed text from the decompressed reference signal received from the ASR engine with the original text in order to determine a percent word error rate, % WER DEC, with respect to the decompressed signal, For Stage 3; a Δ
WER is computed as a function of the % WER REF and the % WER Dec.
-
- 44. The method for high performance audio Codec of claim 43 further comprising the uncompressed reference signal being a pulse code modulated reference signal, PCM REF.
Specification