Speech compression method and apparatus
First Claim
1. A method for encoding speech comprising:
- processing an input speech signal using an encoder, resulting in a compressed encoder representation of the input speech signal,if a speech recognizer identifies a corresponding dictionary speech element, which approximates the input speech signal,determining a compressed recognizer representation of the corresponding dictionary speech element,calculating one or more differences between the compressed encoder representation and the compressed recognizer representation,compiling compressed speech information that includes representations of the one or more differences; and
the method further comprising, if the speech recognizer does not identify a corresponding dictionary speech element, compiling the compressed speech information to include the compressed encoder representation of the input speech signal, and not to include the one or more differences.
4 Assignments
0 Petitions
Accused Products
Abstract
A system for encoding speech includes a speech encoder (106, FIG. 1), a speech recognizer (110), and a difference encoder (108). When the speech recognizer (110) recognizes a word, phoneme or feature within an input speech signal (122), the difference encoder (108) calculates the differences between speech parameters (140, 142) derived by the speech encoder (106) and speech parameters (146, 148) derived by the speech recognizer (110). The difference encoder (108) quantizes the differences (128), which replace corresponding encoder-derived parameters to be transmitted over a channel (130). In one embodiment, the difference encoder representation (128) of the speech parameters consumes fewer bits than the encoder-derived representation (124). Accordingly, the resulting bandwidth consumed by a single channel can be decreased.
24 Citations
30 Claims
-
1. A method for encoding speech comprising:
-
processing an input speech signal using an encoder, resulting in a compressed encoder representation of the input speech signal, if a speech recognizer identifies a corresponding dictionary speech element, which approximates the input speech signal, determining a compressed recognizer representation of the corresponding dictionary speech element, calculating one or more differences between the compressed encoder representation and the compressed recognizer representation, compiling compressed speech information that includes representations of the one or more differences; and the method further comprising, if the speech recognizer does not identify a corresponding dictionary speech element, compiling the compressed speech information to include the compressed encoder representation of the input speech signal, and not to include the one or more differences. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. An apparatus comprising:
-
speech encoder means for processing an input speech signal, resulting in a compressed encoder representation of the input speech signal; speech recognizer means for processing the input speech signal; and difference encoder means, responsive to the speech recognizer means, for determining a compressed recognizer representation of a corresponding dictionary speech element that approximates the input speech signal when the speech recognizer means identifies the corresponding dictionary speech element, calculating one or more differences between the compressed encoder representation and the compressed recognizer representation, and compiling compressed speech information that includes representations of the one or more differences; and a transmitter to transmit the compressed speech information that includes representations of the one or more differences when the speech recognizer means identifies the corresponding dictionary speech element and to transmit the compressed encoder representation of the input speech signal when the speech recognizer means does not identify a dictionary speech element that approximates the input speech signal. - View Dependent Claims (14, 15, 16, 17, 18)
-
-
19. An apparatus comprising:
-
a speech encoder, which processes an input speech signal, resulting in a compressed encoder representation of the input speech signal; a speech recognizer, which processes the input speech signal; and a difference encoder, which determines a compressed recognizer representation of a corresponding dictionary speech element that approximates the input speech signal when the speech recognizer identifies the corresponding dictionary speech element, calculates one or more differences between the compressed encoder representation and the compressed recognizer representation, and compiles compressed speech information that includes representations of the one or more differences; and a transmitter, which transmits the compressed speech information that includes representations of the one or more differences when the speech recognizer identifies the corresponding dictionary speech element and transmits the compressed encoder representation of the input speech signal when the speech recognizer does not identify a dictionary speech element that approximates the input speech signal. - View Dependent Claims (20, 21, 22, 23, 24)
-
-
25. A system comprising:
-
a communication channel operably connected to a first communication device and a second communication device; the first communication device, which includes a speech encoder, which processes an input speech signal, resulting in a compressed encoder representation of the input speech signal, a speech recognizer, and a difference encoder, which determines a compressed recognizer representation of a corresponding dictionary speech element that approximates the input speech signal when the speech recognizer identifies the corresponding dictionary speech element, calculates one or more differences between the compressed encoder representation and the compressed recognizer representation, and compiles compressed speech information that includes representations of the one or more differences; wherein the first communication device further includes a transmitter, which transmits the compressed speech information that includes representations of the one or more differences when the speech recognizer identifies the corresponding dictionary speech element and transmits the compressed encoder representation of the input speech signal when the speech recognizer does not identify a dictionary speech element that approximates the input speech signal; and wherein the system further comprises the second communication device, which constructs an output speech signal based on the compressed speech information, and information associated with the corresponding dictionary speech element, and the compressed encoder information. - View Dependent Claims (26, 27)
-
-
28. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform a method for encoding speech, the method comprising:
-
processing an input speech signal using an encoder, resulting in a compressed encoder representation of the input speech signal; processing the input speech signal using a speech recognizer; if the speech recognizer identifies a corresponding dictionary speech element, which approximates the input speech signal, determining a compressed recognizer representation of the corresponding dictionary speech element; calculating one or more differences between the compressed encoder representation and the compressed recognizer representation; and compiling compressed speech information that includes representations of the one or more differences, and the method further comprising, if the speech recognizer does not identify a corresponding dictionary speech element, compiling the compressed speech information to include the compressed encoder representation of the input speech signal, and not to include the one or more differences. - View Dependent Claims (29, 30)
-
Specification