Phoneme recognition and difference signal for speech coding/decoding
First Claim
1. A system for encoding a speech signal into a bit stream, the system comprising:
- a phoneme parser which parses the speech signal into at least one phoneme;
a phoneme recognizer, coupled to the phoneme parser, which assigns a symbolic code to each of the at least one phoneme based upon recognition of the at least one phoneme from a predetermined phoneme set; and
a difference processor, coupled to the phoneme parser, which forms a difference signal between a user-spoken phoneme waveform and a corresponding phoneme waveform from a standard waveform set;
wherein the difference signal is stored during a "training" mode and transmitted in the bit stream during a "transmit initiate" mode.
1 Assignment
0 Petitions
Accused Products
Abstract
An analog-to-digital converter (20) forms a digital signal based upon an analog speech signal. A phoneme parser (22) parses the digital signal into at least one phoneme. A phoneme recognizer (24) assigns a symbolic code to each phoneme based upon recognition of the phonemes from a predetermined set. A read-only memory (34) contains a standard waveform representation of each phoneme from the predetermined set. A difference processor (32) forms a difference signal between a user-spoken phoneme waveform and a corresponding waveform from the read-only memory (34). The difference signal is stored in a storage device (40). A multiplexer (30) provides a bit stream signal based upon the symbolic code and the difference signal. A synchronizer (70) extracts the symbolic code and the difference signal from the bit stream. A phoneme generator (76) forms the speech signal based upon the symbolic code and the difference signal.
26 Citations
33 Claims
-
1. A system for encoding a speech signal into a bit stream, the system comprising:
-
a phoneme parser which parses the speech signal into at least one phoneme; a phoneme recognizer, coupled to the phoneme parser, which assigns a symbolic code to each of the at least one phoneme based upon recognition of the at least one phoneme from a predetermined phoneme set; and a difference processor, coupled to the phoneme parser, which forms a difference signal between a user-spoken phoneme waveform and a corresponding phoneme waveform from a standard waveform set; wherein the difference signal is stored during a "training" mode and transmitted in the bit stream during a "transmit initiate" mode. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for encoding an analog speech signal into a bit stream, the system comprising:
-
an analog-to-digital converter which forms a digital signal based upon the analog speech signal; a phoneme parser which parses the digital signal into at least one phoneme; a phoneme recognizer, coupled to the phoneme parser, which assigns a symbolic code to each of the at least one phoneme based upon recognition of the at least one phoneme from a predetermined phoneme set; a first storage device which contains a standard waveform representation of each phoneme from the predetermined phoneme set; a difference processor, coupled to the phoneme parser and to the first storage device, which during a "training" mode and during encoding forms a difference signal between a user-spoken phoneme waveform and a corresponding phoneme waveform from the first storage device; a second storage device, coupled to the difference processor, in which a representation of the difference signal is stored for use in a header at the initiation of transmission; and a multiplexer, coupled to the phoneme recognizer and to the second storage device, which provides the bit stream based upon the symbolic code and the representation of the difference signal.
-
-
11. A method of encoding a speech signal into a bit stream, the method comprising the steps of:
-
parsing the speech signal into at least one phoneme; recognizing the at least one phoneme from a predetermined phoneme set; assigning a symbolic code to each of the at least one phoneme; forming during a "training" mode and during encoding a difference signal between a user-spoken phoneme waveform and a corresponding phoneme waveform from a standard waveform set and stores the difference signal in a header for use at initiation of a transmission; and forming the bit stream based upon the difference signal and the symbolic code of each of the at least one phoneme. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A method of encoding an analog speech signal into a bit stream, the method comprising the steps of:
-
converting the analog speech signal into a digital signal; parsing the digital signal into at least one phoneme; recognizing the at least one phoneme from a predetermined phoneme set; assigning a symbolic code to each of the at least one phoneme; forming during a "training" mode and during encoding a difference signal between a user-spoken phoneme waveform and a corresponding phoneme waveform from a standard waveform set; storing a representation of the difference signal; transmitting the stored difference signal in a header; and multiplexing the symbolic code with the representation of the difference signal to form the bit stream.
-
-
20. A system for recreating a speech signal from a bit stream representative of an encoded speech signal, the system comprising:
-
a synchronizer which extracts at least one symbolic code from the bit stream, wherein each of the at least one symbolic code is representative of a corresponding phoneme from a predetermined phoneme set, the synchronizer further extracting at least one difference signal representative of a difference between a first phoneme waveform and a second phoneme waveform; and a phoneme generator, coupled to the synchronizer, which forms the speech signal by generating a corresponding phoneme waveform for each of the at least one symbolic code extracted by the synchronizer in dependence upon the at least one difference signal. - View Dependent Claims (21, 22, 23, 24, 25)
-
-
26. A system for recreating a speech signal from a bit stream representative of an encoded speech signal, the system comprising:
-
a synchronizer which extracts at least one symbolic code from the bit stream, wherein each of the at least one symbolic code is representative of a corresponding phoneme from a predetermined phoneme set, the synchronizer further extracting at least one difference signal representative of a difference between a user-spoken phoneme waveform and a corresponding phoneme waveform from a standard waveform set; a storage device, coupled to the synchronizer, which stores a representation of the at least one difference signal; a phoneme generator, coupled to the synchronizer and to the storage device, which forms a digital recreated speech signal by generating a corresponding phoneme waveform from the standard waveform set for each of the at least one symbolic code extracted by the synchronizer, wherein the digital recreated speech signal is modified in dependence upon the at least one difference signal; and a digital-to-analog converter, coupled to the phoneme generator, which forms an analog recreated speech signal from the digital recreated speech signal.
-
-
27. A method of recreating a speech signal from a bit stream representative of an encoded speech signal, the method comprising the steps of:
-
extracting at least one symbolic code from the bit stream, wherein each of the at least one symbolic code is representative of a corresponding phoneme from a predetermined phoneme set; and extracting at least one difference signal from the bit stream, wherein the at least one difference signal is representative of a difference between a first phoneme waveform and a second phoneme waveform; and forming the speech signal by generating a corresponding phoneme waveform for each of the at least one symbolic code in dependence upon the at least one difference signal. - View Dependent Claims (28, 29, 30, 31, 32)
-
-
33. A method of recreating a speech signal from a bit stream representative of an encoded speech signal, the method comprising the steps of:
-
extracting at least one difference signal from the bit stream, wherein the at least one difference signal is representative of a difference between a user-spoken phoneme waveform and a corresponding phoneme waveform from a standard waveform set; storing a representation of the at least one difference signal; extracting at least one symbolic code from the bit stream, wherein each of the at least one symbolic code is representative of a corresponding phoneme from a predetermined phoneme set; forming a digital speech signal by generating a corresponding phoneme waveform from the standard waveform set for each of the at least one symbolic code; modifying the digital speech signal in dependence upon the at least one difference signal; and converting the digital speech signal to an analog speech signal.
-
Specification