System and method for lossy compression of voice recognition models
First Claim
Patent Images
1. A method of voice recognition (VR), comprising:
- recording a plurality of utterances;
extracting features of the plurality of utterances to generate extracted features of the plurality of utterances;
creating a plurality of VR models from the extracted features of the plurality of utterances; and
lossy-compressing the plurality of VR models using A-law compression to quantize information bits of the plurality of VR models.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system that improves voice recognition by improving storage of voice recognition (VR) templates. The improved storage means that more VR models can be stored in memory. The more VR models that are stored in memory, the more robust the VR system and therefore the more accurate the VR system. Lossy compression techniques are used to compress VR models. In one embodiment, A-law compression and A-law expansion are used to compress and expand VR models. In another embodiment, Mu-law compression and Mu-law expansion are used to compress and expand VR models. VR models are compressed during a training process and they are expanded during voice recognition.
34 Citations
16 Claims
-
1. A method of voice recognition (VR), comprising:
-
recording a plurality of utterances;
extracting features of the plurality of utterances to generate extracted features of the plurality of utterances;
creating a plurality of VR models from the extracted features of the plurality of utterances; and
lossy-compressing the plurality of VR models using A-law compression to quantize information bits of the plurality of VR models. - View Dependent Claims (3, 4, 5, 6, 7, 8)
-
-
2. A method of voice recognition (VR), comprising:
-
recording a plurality of utterances;
extracting features of the plurality of utterances to generate extracted features of the plurality of utterances;
creating a plurality of VR models from the extracted features of the plurality of utterances; and
lossy-compressing the plurality of VR models using mu-law compression to quantize information bits of the plurality of VR models. - View Dependent Claims (9, 10, 11, 12)
-
- 13. A voice recognition (VR) system comprising a training module configured to extract features of a plurality of utterances to generate extracted features of the utterances, create a plurality of VR models from the extracted features of the utterances, and lossy-compress the plurality of VR models using A-law compression to quantize information bits of the plurality of VR models.
-
15. A voice recognition (VR) system, comprising:
-
a plurality of lossy-compressed VR models, wherein A-law compression is used to quantize the information bits of the VR models;
a feature extraction module configured to extract features of a test utterance to generate extracted features of a test utterance;
an expansion module configured to expand a lossy-compressed VR model from the plurality of lossy-compressed VR models to generate an expanded VR model; and
a pattern-matching module that matches the extracted features of the test utterance to the expanded VR model to generate a recognition hypothesis.
-
-
16. A voice recognition (VR) training system comprising:
- a feature extraction module configured to extract features of a plurality of utterances and generate a plurality of VR models for the extracted features of the plurality of utterances; and
a compression module configured to lossy-compress the plurality of VR models using mu-law compression to quantize information bits of the plurality of VR models.
- a feature extraction module configured to extract features of a plurality of utterances and generate a plurality of VR models for the extracted features of the plurality of utterances; and
Specification