Method for generating a vector codebook, method and device for compressing data, and distributed speech recognition system
First Claim
1. A method, implemented by a computer, for generating a vector codebook providing low data compression computational effort starting from a vector codebook comprising a set of N reference vectors each comprising Q features, wherein N and Q are positive integers higher than 1, comprising the steps of:
- a) sorting, by the computer, the set of N reference vectors in ascending or descending order with respect to values associated with a predetermined feature of the set of N reference vectors;
b) subdividing, by the computer, the set of sorted reference vectors in sub-sets;
c) associating, by the computer, with each of said sub-sets a respective threshold value for the predetermined feature;
d) sorting, by the computer, the reference vectors of each of said sub-sets in ascending or descending order with respect to values associated with a further predetermined feature of the set of N reference vectors;
e) subdividing, by the computer, each of said sub-sets of sorted reference vectors in further sub-sets;
f) associating, by the computer, with each of said further sub-sets a respective further threshold value for the further predetermined feature; and
g) extending at least one of the sub-sets obtained in step b) with at least one reference vector of an adjacent sub-set,wherein steps d) and e) being carried out on the sub-sets extended according to step g),wherein the set of N reference vectors sorted in step a) are arranged in a table form including N rows and Q columns, each row representing a reference vector having Q features.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for compressing data, the data being represented by an input vector having Q features, wherein Q is an integer higher than 1, including the steps of 1) providing a vector codebook of sub-sets of indexed Q-feature reference vectors and threshold values associated with the sub-sets for a prefixed feature; 2) identifying a sub-set of reference vectors among the sub-sets by progressively comparing the value of a feature of the input vector which corresponds to the prefixed feature, with the threshold values associated with the sub-sets; and 3) identifying the reference vector which, within the sub-set identified in step 2), provides the lowest distortion with respect to the input vector.
-
Citations
23 Claims
-
1. A method, implemented by a computer, for generating a vector codebook providing low data compression computational effort starting from a vector codebook comprising a set of N reference vectors each comprising Q features, wherein N and Q are positive integers higher than 1, comprising the steps of:
-
a) sorting, by the computer, the set of N reference vectors in ascending or descending order with respect to values associated with a predetermined feature of the set of N reference vectors; b) subdividing, by the computer, the set of sorted reference vectors in sub-sets; c) associating, by the computer, with each of said sub-sets a respective threshold value for the predetermined feature; d) sorting, by the computer, the reference vectors of each of said sub-sets in ascending or descending order with respect to values associated with a further predetermined feature of the set of N reference vectors; e) subdividing, by the computer, each of said sub-sets of sorted reference vectors in further sub-sets; f) associating, by the computer, with each of said further sub-sets a respective further threshold value for the further predetermined feature; and g) extending at least one of the sub-sets obtained in step b) with at least one reference vector of an adjacent sub-set, wherein steps d) and e) being carried out on the sub-sets extended according to step g), wherein the set of N reference vectors sorted in step a) are arranged in a table form including N rows and Q columns, each row representing a reference vector having Q features. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 21)
-
-
12. A method, implemented by a computer, for compressing data, said data being represented by an input vector having Q features, wherein Q is an integer higher than 1, comprising the steps of:
-
1) providing a vector codebook, as generated by a method for generating a vector codebook providing low data compression computational effort starting from a vector codebook comprising a set of N reference vectors each comprising Q features, wherein N and Q are positive integers higher than 1, comprising the steps of; a) sorting, by the computer, the set of N reference vectors in ascending or descending order with respect to values associated with a predetermined feature of the set of N reference vectors; b) subdividing, by the computer, the set of sorted reference vectors in sub-sets; c) associating, by the computer, with each of said sub-sets a respective threshold value for the predetermined feature; d) sorting, by the computer, the reference vectors of each of said sub-sets in ascending or descending order with respect to values associated with a further predetermined feature of the set of N reference vectors; e) subdividing, by the computer, each of said sub-sets of sorted reference vectors in further sub-sets; f) associating, by the computer, with each of said further sub-sets a respective further threshold value for the further predetermined feature; and g) extending at least one of the sub-sets obtained in step b) with at least one reference vector of an adjacent sub-set, wherein steps d) and e) being carried out on the sub-sets extended according to step g), wherein the set of N reference vectors sorted in step a) are arranged in a table form including N rows and Q columns, each row representing a reference vector having Q features; 2) identifying, by the computer, a sub-set of reference vectors among said sub-sets by progressively comparing a value of a feature of the input vector, which corresponds to said predetermined feature, with the threshold values associated with said sub-sets; and 3) identifying, by the computer, the reference vector which, within the sub-set identified in step
2) provides the lowest distortion with respect to the input vector. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
-
22. A distributed speech recognition system comprising:
-
a user device suitable for data compression comprising; a feature extraction unit for processing a digital input data so as to provide Q-feature vectors, wherein Q is an integer higher than 1; a memory unit for storing at least one vector codebook; a data compression unit for identifying, for each input Q-feature vector provided by the feature extraction unit, a reference vector which is in the vector codebook stored in the memory unit that provides the lowest distortion with respect to the input feature vector; a codebook processing unit to carry out the steps of method for generating said at least one vector codebook providing low data compression computational effort starting from a vector codebook comprising a set of N reference vectors each comprising Q features, wherein N and Q are positive integers higher than 1, comprising the steps of; a) sorting the set of N reference vectors in ascending or descending order with respect to the values of a predetermined feature of the set of N reference vectors; b) subdividing the set of sorted reference vectors in sub-sets; c) associating with each of said sub-sets a respective threshold value for the predetermined feature; d) sorting the reference vectors of each of said sub-sets in ascending or descending order with respect to values associated with a further predetermined feature of the set of N reference vectors; e) subdividing each of said sub-sets of sorted reference vectors in further sub-sets; f) associating with each of said further sub-sets a respective further threshold value for the further predetermined feature; and g) extending at least one of the sub-sets obtained in step b) with at least one reference vector of an adjacent sub-set, wherein steps d) and e) being carried out on the sub-sets extended according to step g), and to store the generated vector codebook in the memory unit; a recognition device for reconstructing and/or recognizing the digital input data; and a transmission channel for data transmission between the user device and the recognition device. - View Dependent Claims (23)
-
Specification