Speech compression method and apparatus
First Claim
1. A method for encoding speech, the method comprising:
- processing an input speech signal using an encoder, resulting in a compressed encoder representation of the input speech signal; and
if a speech recognizer identifies, in the input speech signal, a corresponding dictionary speech element that approximates the input speech signal,determining, with an electronic device, a compressed recognizer representation of the corresponding dictionary speech element,calculating, with the electronic device, one or more differences between the compressed encoder representation and the compressed recognizer representation, andcompiling, with the electronic device, compressed speech information that includes representations of the one or more differences,wherein the encoder and the speech recognizer are implemented with the electronic device.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for encoding speech includes processing an input speech signal using an encoder, resulting in a compressed encoder representation of the input speech signal. The method also includes, if a speech recognizer identifies, in the input speech signal, a corresponding dictionary speech element that approximates the input speech signal, determining, with an electronic device, a compressed recognizer representation of the corresponding dictionary speech element, calculating, with the electronic device, one or more differences between the compressed encoder representation and the compressed recognizer representation, and compiling, with the electronic device, compressed speech information that includes representations of the one or more differences. The encoder and the speech recognizer are implemented with the electronic device.
-
Citations
27 Claims
-
1. A method for encoding speech, the method comprising:
-
processing an input speech signal using an encoder, resulting in a compressed encoder representation of the input speech signal; and if a speech recognizer identifies, in the input speech signal, a corresponding dictionary speech element that approximates the input speech signal, determining, with an electronic device, a compressed recognizer representation of the corresponding dictionary speech element, calculating, with the electronic device, one or more differences between the compressed encoder representation and the compressed recognizer representation, and compiling, with the electronic device, compressed speech information that includes representations of the one or more differences, wherein the encoder and the speech recognizer are implemented with the electronic device.
-
-
2. A method for encoding an input speech signal, the method comprising:
-
processing the input speech signal using a first encoder configured (i) to compress the input speech signal by removing natural redundant information in the input speech signal, and (ii) to generate an encoded representation of the input speech signal, wherein the first encoder is implemented with an electronic device; processing the input speech signal using a second encoder configured to compress the input speech signal by recognizing known speech elements in the input speech signal, the known speech elements stored in a memory associated with the second encoder, wherein the second encoder is implemented with the electronic device; when the second encoder identifies a known speech element that approximates the input speech signal, determining, with the electronic device, an encoded representation of the known speech element, calculating, with the electronic device, one or more differences between (i) the encoded representation of the input speech signal generated by the first encoder, and (ii) the encoded representation of the known speech element generated by the second encoder, and compiling, with the electronic device, compressed speech information that includes (i) representations of the one or more differences, and (ii) an indication of the known speech element; and when the second encoder does not identify a corresponding known speech element, compiling, with the electronic device, the compressed speech information to include the encoded representation of the input speech signal generated by the first encoder. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. An apparatus, comprising:
-
a first speech encoder configured (i) to compress an input speech signal by removing natural redundant information in the input speech signal, and (ii) to generate an encoded representation of the input speech signal; a memory to store known speech elements; a second speech encoder coupled to the memory, the second speech encoder configured to recognize, in the input speech signal, known speech elements from the memory, and, when a known speech element that approximates the input speech signal is identified, determine an encoded representation of the known speech element; a difference encoder configured to, when the second encoder identifies a known speech element that approximates the input speech signal, calculate one or more differences between (i) the encoded representation of the input speech signal generated by the first encoder, and (ii) the encoded representation of the known speech element generated by the second encoder, and compile compressed speech information that includes (i) representations of the one or more differences, and (ii) an indication of the known speech element; and a transmitter configured (i) to transmit, when the second speech encoder identifies the corresponding known speech element, the compressed speech information that includes representations of the one or more differences, and (ii) to transmit, when the second speech encoder does not identify a known speech element that approximates the input speech signal, the encoded representation of the input speech signal generated by the first speech encoder. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A tangible, non-transitory computer readable storage medium storing instructions that, when executed by a processor, cause the processor to:
-
process an input speech signal using a first encoder configured (i) to compress the input speech signal by removing natural redundant information in the input speech signal, and (ii) to generate an encoded representation of the input speech signal; process the input speech signal using a second encoder configured to compress the input speech signal by recognizing known speech elements in the input speech signal, the known speech elements stored in a memory associated with the second encoder; when the second encoder identifies a known speech element that approximates the input speech signal, determine an encoded representation of the known speech element, calculate one or more differences between (i) the encoded representation of the input speech signal generated by the first encoder, and (ii) the encoded representation of the known speech element generated by the second encoder, and compile compressed speech information that includes (i) representations of the one or more differences, and (ii) an indication of the known speech element; and when the second encoder does not identify a corresponding known speech element, compile the compressed speech information to include the encoded representation of the input speech signal generated by the first encoder. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27)
-
Specification