Speech manipulation for continuous speech playback over a packet network
First Claim
1. A method of controlling playback of audio signals over a communication network, the method comprising:
- receiving a plurality of audio packets;
storing temporarily the plurality of audio packets;
executing playback of the plurality of audio packets;
compressing the plurality of audio packets to accelerate the playback of the plurality of audio packets when a rate of receipt of audio packets is greater than a predetermined upper replay rate; and
decompressing the plurality of audio packets to decelerate the playback of the plurality of audio packets when the rate of receipt of the plurality of audio packets is less than a predetermined lower replay rate.
11 Assignments
0 Petitions
Accused Products
Abstract
In a speech communications network, continuous play of audio packets is achieved using a jitter buffer in a receiver. Audio packets are stored in the jitter buffer before decoding the audio packets into an audible output. When the level of stored audio packets approaches the full capacity of the jitter buffer, the rate at which the audio packets are played out of the jitter buffer is increased signaling a compression operation in the decoder. When the level of stored audio packets approaches an empty level of the jitter buffer, the rate which the audio packets are played out of the jitter buffer is reduced signaling an expansion operation in the decoder. Audio packets are not modified when the level of stored audio packets is within a predetermined range. A speed controller is provided to instruct the decoder to decode the audio packets according to either a compressed, expanded or normal audio packet status.
133 Citations
20 Claims
-
1. A method of controlling playback of audio signals over a communication network, the method comprising:
-
receiving a plurality of audio packets;
storing temporarily the plurality of audio packets;
executing playback of the plurality of audio packets;
compressing the plurality of audio packets to accelerate the playback of the plurality of audio packets when a rate of receipt of audio packets is greater than a predetermined upper replay rate; and
decompressing the plurality of audio packets to decelerate the playback of the plurality of audio packets when the rate of receipt of the plurality of audio packets is less than a predetermined lower replay rate. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
decoding the plurality of audio packets.
-
-
3. The method of claim 1, the accelerating step further comprising:
compressing an audio packet.
-
4. The method of claim 3, wherein the compressing step reduces the number of the plurality of audio packets.
-
5. The method of claim 1, the accelerating step further comprising:
compressing a speech segment represented by an audio packet.
-
6. The method of claim 1, the decelerating step further comprising:
expanding an audio packet.
-
7. The method of claim 6, wherein the expanding step increases the number of the plurality of audio packets.
-
8. The method of claim 1, the decelerating step further comprising:
expanding a speech segment represented by an audio packet.
-
9. The method of claim 1, further comprising the step of:
detecting the rate of receipt of the plurality of audio packets.
-
10. The method of claim 9, the plurality of audio packets being stored in a jitter buffer, detecting step comprising the step of:
determining a location of a jitter buffer using an address pointer of the jitter buffer.
-
11. The method of claim 10, wherein the jitter buffer address pointer points to an address of the jitter buffer corresponding to a relatively full level of the jitter buffer when the rate of receipt of the audio packets is higher than the predetermined replay rate and the jitter buffer address pointer points to an address of the jitter buffer corresponding to a relatively empty level of the jitter buffer when the rate of receipt of the audio packets is lower than the predetermined replay rate.
-
12. A receiver configured for continuous playback of audio packets, the receiver comprising:
-
a jitter buffer to store a plurality of audio packets;
a jitter buffer controller coupled to the jitter buffer to monitor capacity of the jitter buffer, the jitter buffer controller accelerating playback of the plurality of audio packets out of the jitter buffer when a rate of receipt of the plurality of audio packets is greater than a predetermined upper replay rate and decelerating the playback of the plurality of audio packets out of the jitter buffer when a rate of receipt of the plurality of audio packets is lower than a predetermined lower replay rate; and
a decoder to decode the stored audio packets, the decoder compressing an audio packet when a rate of receipt of the plurality of audio packets is greater than a predetermined upper replay rate, the decoder expanding an audio packet when the rate of receipt of the plurality of audio packets is lower than the predetermined lower replay rate. - View Dependent Claims (13, 14, 15, 16)
-
-
17. A communications network configured for continuous playback of asynchronously transmitted audio packets, comprising:
-
a transmitter to transmit an audio packet;
a receiver to receive an audio packet, comprising;
a jitter buffer for storing received audio packets;
a jitter buffer controller coupled to the jitter buffer to monitor capacity of the jitter buffer, the jitter buffer controller accelerating playback of the plurality of audio packets out of the jitter buffer when a rate of receipt of the plurality of audio packets is greater than a predetermined upper replay rate and decelerating the playback of the plurality of audio packets out of the jitter buffer when a rate of receipt of the plurality of audio packets less than a predetermined lower replay rate;
a decoder to decode the stored audio packets, the decoder compressing a speech segment represented by an audio packet when a rate of receipt of the plurality of audio packets is greater than a predetermined upper replay rate, the decoder expanding a speech segment represented by an audio packet when the rate of receipt of the plurality of audio packets is lower than the predetermined lower replay rate;
a converter for converting the audio packets into an audible signal; and
a playback device for replaying the audible signal at the predetermined replay rate. - View Dependent Claims (18, 19, 20)
-
Specification