Speech encoding apparatus and speech encoding and decoding apparatus
First Claim
1. A speech encoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information and for encoding said excitation signal information by the frame, said speech encoding apparatus comprising:
- target speech generation means for generating from said input speech a target speech vector of a vector length corresponding to a delay parameter;
an adaptive codebook for generating from previously generated excitation signals an adaptive vector of said vector length corresponding to said delay parameter;
adaptive code search means for evaluating the distortion of a synthesis vector obtained from said adaptive vector with respect to said target speech vector so as to search for an adaptive vector conducive to the least distortion; and
frame excitation generation means for generating an excitation signal of a frame length from said adaptive vector conducive to the least distortion,wherein said vector length of said target speech vector and said vector length of said adaptive vector are less than said frame length.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech encoding apparatus capable of averting the deterioration of synthesis speech quality in encoding the input speech and of generating a high-quality synthesis output speech through small quantities of computation. The apparatus includes a target speech generation part for generating from the input speech a target speech vector of a vector length corresponding to a delay parameter; an adaptive codebook for generating from previously generated excitation signals an adaptive vector of the vector length corresponding to the delay parameter; an adaptive code search part for evaluating the distortion of a synthesis vector obtained from the adaptive vector with respect to the target speech vector so as to search for the adaptive vector conducive to the least distortion; and a frame code generation part for generating an excitation signal of a frame length from the adaptive vector conducive to the least distortion.
-
Citations
47 Claims
-
1. A speech encoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information and for encoding said excitation signal information by the frame, said speech encoding apparatus comprising:
-
target speech generation means for generating from said input speech a target speech vector of a vector length corresponding to a delay parameter; an adaptive codebook for generating from previously generated excitation signals an adaptive vector of said vector length corresponding to said delay parameter; adaptive code search means for evaluating the distortion of a synthesis vector obtained from said adaptive vector with respect to said target speech vector so as to search for an adaptive vector conducive to the least distortion; and frame excitation generation means for generating an excitation signal of a frame length from said adaptive vector conducive to the least distortion, wherein said vector length of said target speech vector and said vector length of said adaptive vector are less than said frame length. - View Dependent Claims (2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
3. A speech encoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information and for encoding said excitation signal information by the frame, said speech encoding apparatus comprising:
-
target speech generation means for generating from said input speech a target speech vector of a vector length corresponding to a delay parameter; a random codebook for generating a random vector of said vector length corresponding to said delay parameter; random code search means for evaluating the distortion of a synthesis vector obtained from said random vector with respect to said target speech vector so as to search for a random vector conducive to the least distortion; and frame excitation generation means for generating an excitation signal of a frame length from said random vector conducive to the least distortion, wherein said vector length of said target speech vector and said vector length of said random vector are less than said length. - View Dependent Claims (4)
-
-
21. A speech encoding and decoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information, encoding said excitation signal information by the frame, and decoding the encoded excitation signal information so as to generate an output speech, the encoding side of said speech encoding and decoding apparatus comprising:
-
target speech generation means for generating from said input speech a target speech vector of a vector length corresponding to a delay parameter; an adaptive codebook for generating from previously generated excitation signals an adaptive vector of said vector length corresponding to said delay parameter; adaptive code search means for evaluating the distortion of a synthesis vector obtained from said adaptive vector with respect to said target speech vector so as to search for an adaptive vector conducive to the least distortion; and frame excitation generation means for generating an excitation signal of a frame length from said adaptive vector conducive to the least distortion; the decoding side of said speech encoding and decoding apparatus comprising; an adaptive codebook for generating said adaptive vector of said vector length corresponding to said delay parameter; and frame excitation generation means for generating said excitation signal of said frame length from said adaptive vector, wherein said vector length of said target speech vector and said vector length of said adaptive vector are less than said frame length. - View Dependent Claims (22)
-
-
23. A speech encoding and decoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information, encoding said excitation signal information by the frame, and decoding the encoded excitation signal information so as to generate an output speech, the encoding side of said speech encoding and decoding apparatus comprising:
-
target speech generation means for generating from said input speech a target speech vector of a vector length corresponding to a delay parameter; a random codebook for generating a random vector of said vector length corresponding to said delay parameter; random code search means for evaluating the distortion of a synthesis vector obtained from said random vector with respect to said target speech vector so as to search for a random vector conducive to the least distortion; and frame excitation generation means for generating an excitation signal of a frame length from said random vector conducive to the least distortion; the decoding side of said speech encoding and decoding apparatus comprising; a random codebook for generating said random vector of said vector length corresponding to said delay parameter; and frame excitation generation means for generating said excitation signal of said frame length from said random vector, wherein said vector length of said target speech vector and said vector length of said random vector are less than said frame length.
-
-
24. A speech encoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information and for encoding said excitation signal information by frame, said speech encoding apparatus comprising:
-
an adaptive codebook for generating, from previously generated excitation signals of a frame length, an adaptive vector of a vector length corresponding to a delay parameter; and adaptive code search means for evaluating the distortion of a synthesis vector from said adaptive vector to determine an adaptive vector conducive to the least distortion of a vector length corresponding to a delay parameter conducive to the least distortion, wherein said vector length of said adaptive vector is less than said frame length, and said vector length of said adaptive vector conductive to the least distortion is less than said frame length.
-
-
25. A speech encoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information and for encoding said excitation signal information by the frame, said speech encoding apparatus comprising:
-
target speech generation means for generating from said input speech a target speech vector of a vector length corresponding to a delay parameter; an adaptive codebook for generating from previously generated excitation signals an adaptive vector of said vector length corresponding to said delay parameter; adaptive code search means for evaluating the distortion of a synthesis vector obtained from said adaptive vector with respect to said target speech vector so as to search for an adaptive vector conducive to the least distortion; and frame excitation generation means for generating an excitation signal of a frame length from said adaptive vector conducive to the least distortion, wherein said target speech generation means divides an input speech in a frame into portions each having said vector length corresponding to said delay parameter, and computes a weighted mean of the input speech portions each having said vector length so as to generate said target speech vector. - View Dependent Claims (26, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43)
-
-
27. A speech encoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information and for encoding said excitation signal information by the frame, said speech encoding apparatus comprising:
-
target speech generation means for generating from said input speech a target speech vector of a vector length corresponding to a delay parameter; a random codebook for generating a random vector of said vector length corresponding to said delay parameter; random code search means for evaluating the distortion of a synthesis vector obtained from said random vector with respect to said target speech vector so as to search for the random vector conducive to the least distortion; and frame excitation generation means for generating an excitation signal of a frame length from said random vector conducive to the least distortion, wherein said target speech generation means divides an input speech in a frame into portions each having said vector length corresponding to said delay parameter, and computes a weighted mean of the input speech portions each having said vector length so as to generate said target speech vector. - View Dependent Claims (28)
-
-
44. A speech encoding and decoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information, encoding said excitation signal information by the frame, and decoding the encoded excitation signal information so as to generate an output speech, the encoding side of said speech encoding and decoding apparatus comprising:
-
target speech generation means for generating from said input speech a target speech vector of a vector length corresponding to a delay parameter; an adaptive codebook for generating from previously generated excitation signals an adaptive vector of said vector length corresponding to said delay parameter; adaptive code search means for evaluating the distortion of a synthesis vector obtained from said adaptive vector with respect to said target speech vector so as to search for an adaptive vector conducive to the least distortion; and frame excitation generation means for generating an excitation signal of a frame length from said adaptive vector conducive to the least distortion; wherein said target speech generation means divides an input speech in a frame into portions each having said vector length corresponding to said delay parameter, and computes a weighted mean of the input speech portions each having said vector length so as to generate said target speech vector; the decoding side of said speech encoding and decoding apparatus comprising; an adaptive codebook for generating said adaptive vector of said vector length corresponding to said delay parameter; and frame excitation generation means for generating said excitation signal of said frame length from said adaptive vector. - View Dependent Claims (45)
-
-
46. A speech encoding and decoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information, encoding said excitation signal information by the frame, and decoding the encoded excitation signal information so as to generate an output speech, the encoding side of said speech encoding and decoding apparatus comprising:
-
target speech generation means for generating from said input speech a target speech vector of a vector length corresponding to a delay parameter; a random codebook for generating a random vector of said vector length corresponding to said delay parameter; random code search means for evaluating the distortion of a synthesis vector obtained from said random vector with respect to said target speech vector so as to search for a random vector conducive to the least distortion; and frame excitation generation means for generating an excitation signal of a frame length from said random vector conducive to the least distortion; wherein said target speech generation means divides an input speech in a frame into portions each having said vector length corresponding to said delay parameter, and computes a weighted mean of the input speech portions each having said vector length so as to generate said target speech vector; the decoding side of said speech encoding and decoding apparatus comprising; a random codebook for generating said random vector of said vector length corresponding to said delay parameter; and frame excitation generation means for generating said excitation signal of said frame length from said random vector.
-
-
47. A speech encoding apparatus for dividing an input speech into spectrum envelope information and excitation signal information and for encoding said excitation signal information by frame, said speech encoding apparatus comprising:
-
an adaptive codebook for generating, from previously generated excitation signals of a frame length, an adaptive vector of a vector length corresponding to a delay parameter; and adaptive code search means for evaluating the distortion of a synthesis vector from said adaptive vector to determine an adaptive vector conducive to the least distortion, of a vector length corresponding to a delay parameter conducive to the least distortion, wherein said target speech generation means divides an input speech in a frame into portions each leaving said vector length corresponding to said delay parameter, and computes a weighted mean of the input speech portions each having said vector length so as to generate said target speech vector.
-
Specification