Distributed speech system with buffer flushing on barge-in
First Claim
Patent Images
1. A device comprising:
- a jitter buffer; and
a processor coupled with the jitter buffer, wherein the processor is adapted to receive a stream of audio packets in the jitter buffer,play out the audio packets from the jitter buffer;
receive barge-in sound while playing out the audio packets;
encode and transmitting the barge in sound; and
flush without playing out at least some of the yet unplayed audio packets from the jitter buffer in response to transmitting the barge-in sound.
1 Assignment
0 Petitions
Accused Products
Abstract
A distributed automatic speech recognition and text to speech system has components separated by a network. The components may be a voice interface device with jitter buffer, a voice browser and a text to speech engine. A barge-in detection feature may reside in any one of these components, to implement kill on barge in. The system, softwares and methods of the invention operate to flush the jitter buffer of the voice interface device when a barge-in is detected. Any packets that had been received are therefore not played out.
-
Citations
84 Claims
-
1. A device comprising:
-
a jitter buffer; and a processor coupled with the jitter buffer, wherein the processor is adapted to receive a stream of audio packets in the jitter buffer, play out the audio packets from the jitter buffer; receive barge-in sound while playing out the audio packets; encode and transmitting the barge in sound; and flush without playing out at least some of the yet unplayed audio packets from the jitter buffer in response to transmitting the barge-in sound. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A device comprising:
-
a network interface for coupling to a network; and a processor coupled with the network interface, wherein the processor is adapted to detect a barge-in event; responsive to the barge-in event, generate a purge packet; and transmit the purge packet through a network to a voice interface device having a jitter buffer, wherein the purge packet is for flushing the jitter buffer upon being received. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A device comprising:
-
means for receiving a stream of audio packets in a jitter buffer; means for playing out the audio packets from the jitter buffer; means for receiving barge-in sound while playing out the audio packets; means for encoding and transmitting the barge in sound; and means for flushing without playing out at least some of the yet unplayed audio packets from the jitter buffer in response to transmitting the barge-in sound. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33)
-
-
34. A device comprising;
-
means for detecting a barge-in event; means for responsive to the barge-in event, generating a purge packet; and means for transmitting the purge packet through a network to a voice interface device having a jitter buffer, wherein the purge packet is for flushing the jitter buffer upon being received. - View Dependent Claims (35, 36, 37, 38, 39, 40, 41, 42)
-
-
43. An article comprising:
- a storage medium, the storage medium having instructions stored thereon, wherein when the instructions are executed by at least one device, they result in;
receiving a stream of audio packets in a jitter buffer; playing out the audio packets from the jitter buffer; receiving barge-in sound while playing out the audio packets; encoding and transmitting the barge in sound; and flushing without playing out at least some of the yet unplayed audio packets from the jitter buffer in response to transmitting the barge-in sound. - View Dependent Claims (44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54)
- a storage medium, the storage medium having instructions stored thereon, wherein when the instructions are executed by at least one device, they result in;
-
55. An article comprising:
- a storage medium, the storage medium having instructions stored thereon, wherein when the instructions are executed by at least one device, they result in;
detecting a barge-in event; responsive to the barge-in event, generating a purge packet; and transmitting the purge packet through a network to a voice interface device having a jitter buffer, wherein the purge packet is for flushing the jitter buffer upon being received. - View Dependent Claims (56, 57, 58, 59, 60, 61, 62, 63)
- a storage medium, the storage medium having instructions stored thereon, wherein when the instructions are executed by at least one device, they result in;
-
64. A method comprising:
-
receiving a stream of audio packets in a jitter buffer; playing out the audio packets from the jitter buffer; receiving bargain sound while playing out the audio packets; encoding and transmitting the barge in sound; and flushing without playing out at least some of the yet unplayed audio packets from the jitter buffer in response to transmitting the barge-in sound. - View Dependent Claims (65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75)
-
-
76. A method comprising:
-
detecting a barge-in event; responsive to the barge-in event, generating a purge packet; and transmitting the purge packet through a network to a voice interface device having a jitter buffer, wherein the purge packet is for flushing the jitter buffer upon being received. - View Dependent Claims (77, 78, 79, 80, 81, 82, 83, 84)
-
Specification