Speech packetizing based linguistic processing to improve voice quality
First Claim
1. A method of establishing a telephone communication using a packet switching communications network, comprising:
- digitizing voice information received from a speaker;
placing the voice information into a payload of a first packet;
transmitting the first packet to a recipient;
calculating a significance to voice quality of the voice information contained in the first packet; and
transmitting one or more additional packets to the recipient containing the voice information if the significance of the voice information is above a threshold level;
wherein calculating the significance to voice quality of the voice information comprises;
identifying one or more phonemes contained in the voice information;
retrieving a value from memory for each identified phoneme representing the significance to voice quality of that phoneme; and
setting the measure of significance for the voice information to the maximum of the values for all of the phonemes contained in the voice information.
18 Assignments
0 Petitions
Accused Products
Abstract
An embodiment of the present invention is a technique of establishing a telephone communication using a packet switching communications network. Digitized voice information is received from a speaker. The voice information is placed into a payload of a first packet. The first packet is transmitted to a recipient. A significance to voice quality of the voice information contained in the first packet is calculated. One or more additional packets is transmitted to the recipient containing the voice information if the significance of the voice information is above a threshold level. One or more phonemes contained in the voice information is identified. A value from memory for each identified phoneme representing the significance to voice quality of that phoneme is retrieved. The measure of significance for the voice information is set to the maximum of the values for all of the phonemes contained in the voice information.
-
Citations
24 Claims
-
1. A method of establishing a telephone communication using a packet switching communications network, comprising:
-
digitizing voice information received from a speaker;
placing the voice information into a payload of a first packet;
transmitting the first packet to a recipient;
calculating a significance to voice quality of the voice information contained in the first packet; and
transmitting one or more additional packets to the recipient containing the voice information if the significance of the voice information is above a threshold level;
wherein calculating the significance to voice quality of the voice information comprises;
identifying one or more phonemes contained in the voice information;
retrieving a value from memory for each identified phoneme representing the significance to voice quality of that phoneme; and
setting the measure of significance for the voice information to the maximum of the values for all of the phonemes contained in the voice information. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computing device comprising a processor for determining a significance to voice quality of voice information contained in a first packet and for transmitting one or more additional packets containing the voice information if the significance of the voice information is above a threshold level;
- wherein said processor is capable of;
identifying one or more phonemes contained in the voice information;
retrieving a value from memory for each identified phoneme representing the significance to voice quality of that phoneme; and
setting the measure of significance for the voice information to the maximum of the values for all of the phonemes contained in the voice information. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21)
a network interface for transmitting and receiving packets; and
a microprocessor for receiving said first packet from said network interface, for determining the significance to voice quality of the voice information contained in a packet, and for transmitting through said network interface one or more additional packets containing the voice information if the significance of the voice information is above a threshold level.
- wherein said processor is capable of;
-
18. The computing device of claim 17, further including a digital signal co-processor for assisting in determining the significance to voice quality of the voice information contained in a packet.
-
19. The computing device of claim 10, wherein said processor comprises:
-
speech recognition system for identifying one or more linguistic units of the voice information;
a speech information significance evaluator for evaluating the significance of the identified one or more linguistic units to voice quality;
a packet retransmission decision node for generating a control signal if said significance is above said threshold; and
a packet transmission control for transmitting one or more additional packets in response to said control signal.
-
-
20. The computing device of claim 19, wherein said speech recognition system comprises:
-
a spectral analyzer for identifying frequency responses of said voice information;
a vector quantization table for storing a list of codewords associated prototypical frequency responses; and
a codeword designator for selecting optimal codewords from said list of codewords whose frequency response best matches said frequency response of said voice information; and
a recognizer engine for generating said one or more linguistic units from said optimal codewords.
-
-
21. The computing device of claim 19, wherein said one or more linguistic units comprises one or more phonemes.
-
22. A computer readable medium comprising a software program including a first routine for calculating the significance to voice quality of voice information contained in a first packet;
- and a second routine for transmitting one or more additional packets to the recipient containing the voice information if the significance of the voice information be above a threshold level;
wherein said first routine comprises the following subroutines;a first sub-routine for identifying one or more phonemes contained in the voice information;
a second sub-routine for retrieving a value from memory for each identified phoneme representing the significance to voice quality of that phoneme; and
a third sub-routine for setting the measure of significance for the voice information to the maximum of the values for all of the phonemes contained in the voice information. - View Dependent Claims (23, 24)
- and a second routine for transmitting one or more additional packets to the recipient containing the voice information if the significance of the voice information be above a threshold level;
Specification