CODING OF TRANSITIONAL SPEECH FRAMES FOR LOW-BIT-RATE APPLICATIONS
First Claim
Patent Images
1. A method of processing speech signal frames, said method comprising:
- calculating a first position within a first speech signal frame, the first position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame;
generating a first packet that represents the first speech signal frame and includes the first position;
calculating a second position within a second speech signal frame, the second position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame; and
generating a second packet that represents the second speech signal frame and includes a third position within the second speech signal frame, the third position being a position of said terminal pitch pulse of the frame with respect to the other among the first sample of the frame and the last sample of the frame.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods, and apparatus for low-bit-rate coding of transitional speech frames are disclosed.
94 Citations
57 Claims
-
1. A method of processing speech signal frames, said method comprising:
-
calculating a first position within a first speech signal frame, the first position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame; generating a first packet that represents the first speech signal frame and includes the first position; calculating a second position within a second speech signal frame, the second position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame; and generating a second packet that represents the second speech signal frame and includes a third position within the second speech signal frame, the third position being a position of said terminal pitch pulse of the frame with respect to the other among the first sample of the frame and the last sample of the frame. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus for processing speech signal frames, said apparatus comprising:
-
means for calculating a first position within a first speech signal frame, the first position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame; means for generating a first packet that represents the first speech signal frame and includes the first position; means for calculating a second position within a second speech signal frame, the second position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame; and means for generating a second packet that represents the second speech signal frame and includes a third position within the second speech signal frame, the third position being a position of said terminal pitch pulse of the frame with respect to the other among the first sample of the frame and the last sample of the frame. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. An apparatus for processing speech signal frames, said apparatus comprising:
-
a pitch pulse position calculator configured to calculate a first position within a first speech signal frame, the first position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame; a packet generator configured to generate a first packet that represents the first speech signal frame and includes the first position; wherein said pitch pulse calculator is configured to calculate a second position within a second speech signal frame, the second position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame; and wherein said packet generator is configured to generate a second packet that represents the second speech signal frame and includes a third position within the second speech signal frame, the third position being a position of said terminal pitch pulse of the frame with respect to the other among the first sample of the frame and the last sample of the frame. - View Dependent Claims (16, 17, 18, 19, 20)
-
-
21. A computer-readable medium comprising instructions which when executed by a processor cause the processor to:
-
calculate a first position within a first speech signal frame, the first position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame; generate a first packet that represents the first speech signal frame and includes the first position; calculate a second position within a second speech signal frame, the second position being a position of a terminal pitch pulse of the frame with respect to one among the first sample of the frame and the last sample of the frame; and generate a second packet that represents the second speech signal frame and includes a third position within the second speech signal frame, the third position being a position of said terminal pitch pulse of the frame with respect to the other among the first sample of the frame and the last sample of the frame. - View Dependent Claims (22, 23, 24, 25, 26)
-
-
27. A method of decoding packets of an encoded speech signal, said method comprising:
-
from a first packet that conforms to a template having a first set of bit positions and a second set of bit positions, the first and second sets being disjoint, extracting a first value from the first set of bit positions; comparing the first value to a mode value; in response to a result of said comparing the first value, arranging a pitch pulse within a first excitation signal according to the first value; from a second packet that conforms to the template, extracting a second value from the first set of bit positions; comparing the second value to the mode value; extracting a third value from the second set of bit positions of the second packet; and in response to a result of said comparing the second value, arranging a pitch pulse within a second excitation signal according to the third value. - View Dependent Claims (28, 29, 30)
-
-
31. A method of encoding a shape of a pitch pulse, said method comprising:
-
estimating a pitch period of a speech signal frame; based on the estimated pitch period, selecting one of a plurality of tables of pulse shape vectors; and based on information from at least one pitch pulse of the speech signal frame, selecting a pulse shape vector in the selected table of pulse shape vectors, wherein the length of each pulse shape vector in the selected table of pulse shape vectors is equal to a first value, and wherein the length of each pulse shape vector in another of the plurality of tables of pulse shape vectors is equal to a second value different than the first value. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38)
-
-
39. An apparatus for encoding a shape of a pitch pulse, said apparatus comprising:
-
means for estimating a pitch period of a speech signal frame; means for selecting, based on the estimated pitch period, one of a plurality of tables of pulse shape vectors; and means for selecting, based on information from at least one pitch pulse of the speech signal frame, a pulse shape vector in the selected table of pulse shape vectors, wherein the length of each pulse shape vector in the selected table of pulse shape vectors is equal to a first value, and wherein the length of each pulse shape vector in another of the plurality of tables of pulse shape vectors is equal to a second value different than the first value. - View Dependent Claims (40, 41, 42, 43)
-
-
44. A computer-readable medium comprising instructions which when executed by a processor cause the processor to:
-
estimate a pitch period of a speech signal frame; select, based on the estimated pitch period, one of a plurality of tables of pulse shape vectors; and select, based on information from at least one pitch pulse of the speech signal frame, a pulse shape vector in the selected table of pulse shape vectors, wherein the length of each pulse shape vector in the selected table of pulse shape vectors is equal to a first value, and wherein the length of each pulse shape vector in another of the plurality of tables of pulse shape vectors is equal to a second value different than the first value. - View Dependent Claims (45, 46, 47, 48)
-
-
49. An apparatus for encoding a shape of a pitch pulse, said apparatus comprising:
-
a pitch period estimator configured to estimate a pitch period of a speech signal frame; a vector table selector configured to select, based on the estimated pitch period, one of a plurality of tables of pulse shape vectors; and a pulse shape vector selector configured to select, based on information from at least one pitch pulse of the speech signal frame, a pulse shape vector in the selected table of pulse shape vectors, wherein the length of each pulse shape vector in the selected table of pulse shape vectors is equal to a first value, and wherein the length of each pulse shape vector in another of the plurality of tables of pulse shape vectors is equal to a second value different than the first value. - View Dependent Claims (50, 51, 52, 53)
-
-
54. A method of decoding a shape of a pitch pulse, said method comprising:
-
extracting an encoded pitch period value from a first packet of an encoded speech signal; based on the encoded pitch period value, selecting one of a plurality of tables of pulse shape vectors; extracting a first index from said first packet; and based on said first index, obtaining a pulse shape vector from the selected table of pulse shape vectors. - View Dependent Claims (55, 56, 57)
-
Specification