Frame erasure concealment technique for a bitstream-based feature extractor
First Claim
Patent Images
1. A method comprising:
- receiving, at a speech processing system, a speech signal representing audible speech;
measuring, via a processor of the speech processing system, a distance between first line spectrum pair coefficients of a first frame and second line spectrum pair coefficients of a second frame representative of the speech signal in a bitstream, wherein the first frame and the second frame are adjacent frames;
comparing, via the processor of the speech processing system, the distance to a steady-state threshold distance, wherein the steady-state threshold distance identifies a steady-state region;
when the distance is one of less than or equal to the steady-state threshold distance, deleting one of the first line spectrum pair coefficients and the second line spectrum pair coefficients from the bitstream to yield a modified bitstream that represents the speech signal; and
performing automatic speech recognition of the speech signal by processing, via the processor of the speech processing system, the modified bitstream.
6 Assignments
0 Petitions
Accused Products
Abstract
A frame erasure concealment technique for a bitstream-based feature extractors in a speech recognition system particularly suited for use in a wireless communication system operates to “delete” each frame in which an erasure is declared. The deletions thus reduce the length of the observation sequence, but have been found to provide for sufficient speech recognition based on both single word and “string” tests of the deletion technique.
16 Citations
17 Claims
-
1. A method comprising:
-
receiving, at a speech processing system, a speech signal representing audible speech; measuring, via a processor of the speech processing system, a distance between first line spectrum pair coefficients of a first frame and second line spectrum pair coefficients of a second frame representative of the speech signal in a bitstream, wherein the first frame and the second frame are adjacent frames; comparing, via the processor of the speech processing system, the distance to a steady-state threshold distance, wherein the steady-state threshold distance identifies a steady-state region; when the distance is one of less than or equal to the steady-state threshold distance, deleting one of the first line spectrum pair coefficients and the second line spectrum pair coefficients from the bitstream to yield a modified bitstream that represents the speech signal; and performing automatic speech recognition of the speech signal by processing, via the processor of the speech processing system, the modified bitstream. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A speech processing system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; receiving a speech signal representing audible speech; measuring a distance between first line spectrum pair coefficients of a first frame and second line spectrum pair coefficients of a second frame representative of the speech signal in a bitstream, wherein the first frame and the second frame are adjacent frames; comparing the distance to a steady-state threshold distance, wherein the steady-state threshold distance identifies a steady-state region; when the distance is one of less than or equal to the steady-state threshold distance, deleting one of the first line spectrum pair coefficients and the second line spectrum pair coefficients from the bitstream to yield a modified bitstream that represents the speech signal; and performing automatic speech recognition of the speech signal by processing the modified bitstream. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage device having instructions stored which, when executed by a speech processing computing device, cause the speech processing computing device to perform operations comprising:
-
receiving a speech signal representing audible speech; measuring a distance between first line spectrum pair coefficients of a first frame and second line spectrum pair coefficients of a second frame representative of the speech signal in a bitstream, wherein the first frame and the second frame are adjacent frames; comparing the distance to a steady-state threshold distance, wherein the steady-state threshold distance identifies a steady-state region; when the distance is one of less than or equal to the steady-state threshold distance, deleting one of the first line spectrum pair coefficients and the second line spectrum pair coefficients from the bitstream to yield a modified bitstream that represents the speech signal; and performing automatic speech recognition of the speech signal by processing the modified bitstream. - View Dependent Claims (16, 17)
-
Specification