Method and apparatus for enhancing voice intelligibility in voice-over-IP network applications with late arriving packets
First Claim
1. A method for playing out speech received as a sequence of encoded speech packets over a packet-based communications network, the method comprising the steps of:
- determining that a given speech packet has not been received prior to a time when said given speech packet is to be decoded for playout;
replacing said given speech packet with replacement speech data with use of a packet loss concealment technique;
playing out said replacement speech data in place of said given speech packet;
receiving said given speech packet at a time subsequent to said playing out of said replacement speech data;
modifying said given speech packet which has been received to generate a time scale modified version thereof, said time scale modified version of said given speech packet comprising speech having a reduced time length relative to said given speech packet; and
playing out said time scale modified version of said given speech packet after said replacement speech packet has been played out.
7 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for enhancing voice intelligibility for network communications of speech such as, for example, VoIP (Voice-Over-Internet-Protocol), in the presence of packets which arrive too late for normal playout. When a late speech packet is received by a speech decoder, that packet and, if necessary, one or more additional packets subsequent thereto, are played out over a shorter than normal duration so that the decoder can “catch up” with the encoder. Since a voice frame is usually decoded in several sub-frames—typically two or three—this shortened playout may be achieved, for example, by skipping one sub-frame from each frame to be shortened.
30 Citations
20 Claims
-
1. A method for playing out speech received as a sequence of encoded speech packets over a packet-based communications network, the method comprising the steps of:
-
determining that a given speech packet has not been received prior to a time when said given speech packet is to be decoded for playout;
replacing said given speech packet with replacement speech data with use of a packet loss concealment technique;
playing out said replacement speech data in place of said given speech packet;
receiving said given speech packet at a time subsequent to said playing out of said replacement speech data;
modifying said given speech packet which has been received to generate a time scale modified version thereof, said time scale modified version of said given speech packet comprising speech having a reduced time length relative to said given speech packet; and
playing out said time scale modified version of said given speech packet after said replacement speech packet has been played out. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus for playing out speech received as a sequence of encoded speech packets over a packet-based communications network, the apparatus comprising a processor adapted to:
-
determine that a given speech packet has not been received prior to a time when said given speech packet is to be decoded for playout;
replace said given speech packet with replacement speech data with use of a packet loss concealment technique;
play out said replacement speech data in place of said given speech packet;
receive said given speech packet at a time subsequent to said playing out of said replacement speech data;
modify said given speech packet which has been received to generate a time scale modified version thereof, said time scale modified version of said given speech packet comprising speech having a reduced time length relative to said given speech packet; and
play out said time scale modified version of said given speech packet after said replacement speech packet has been played out. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification