Method for talk/listen determination and multipoint conferencing system using such method
First Claim
1. A method for determining a talk/listen state using a dynamic threshold, comprising:
- receiving an audio sample representing sound measured during a sample time interval;
generating an audio level from the audio sample, the audio level representing an average power level of the audio sample;
setting a dynamic threshold by processing the audio level to set and maintain a dynamic level and using the dynamic level to determine a value for the dynamic threshold;
comparing the audio level to the dynamic threshold;
determining a talk/listen state to be a talking state if the audio level is above the dynamic threshold and to be a listening state if the audio level is below the dynamic threshold; and
repeating the previous steps for each audio sample in a sequential stream of received audio samples such that the dynamic threshold is maintained and used to determine the talk/listen state.
7 Assignments
0 Petitions
Accused Products
Abstract
A method for determining a talk/listen state using a dynamic threshold is provided that can be implemented in a multipoint control unit (12). An audio sample is received and represents sound measured during a sample time interval. An audio level is generated from the audio sample The audio level represents an average power level of the audio sample. A dynamic threshold is set by processing the audio level. The audio level is compared to the dynamic threshold. A talk/listen state is determined to be a talking state if the audio level is above the dynamic threshold and to be a listening state if the audio level is below the dynamic threshold. The previous steps are repeated for each audio sample in a sequential stream of received audio samples such that the dynamic threshold is maintained and used to determine the talk/listen state.
-
Citations
37 Claims
-
1. A method for determining a talk/listen state using a dynamic threshold, comprising:
-
receiving an audio sample representing sound measured during a sample time interval; generating an audio level from the audio sample, the audio level representing an average power level of the audio sample; setting a dynamic threshold by processing the audio level to set and maintain a dynamic level and using the dynamic level to determine a value for the dynamic threshold; comparing the audio level to the dynamic threshold; determining a talk/listen state to be a talking state if the audio level is above the dynamic threshold and to be a listening state if the audio level is below the dynamic threshold; and repeating the previous steps for each audio sample in a sequential stream of received audio samples such that the dynamic threshold is maintained and used to determine the talk/listen state. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method for determining a talk/listen state using a dynamic threshold, comprising:
-
receiving an audio data packet representing sound measured during a packet time interval; generating an audio level from the audio data packet, the audio level representing an average power level of the audio data packet; setting a long term background level; setting a foreground level by weighting the foreground level with the audio level; setting a background level by weighting the background level with the audio level; setting a dynamic threshold equal to a weighted sum of the long term background level, the foreground level and the background level; comparing the audio level to the dynamic threshold; determining a talk/listen state to be a talking state if the audio level is above the dynamic threshold and to be a listening state if the audio level is below the dynamic threshold; and repeating the previous steps for each audio data packet in a sequential stream of received audio data packets such that the dynamic threshold and the long term background, background and foreground levels are maintained and used to determine the talk/listen state. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
-
21. A multipoint control unit, comprising:
-
a plurality of conference units, each conference unit operable to receive and process each audio sample in a sequential stream of audio samples received from an associated conference site; each conference unit further operable to set and maintain a dynamic level, to maintain a dynamic threshold using the dynamic level, to compare an audio level of each audio sample to the dynamic threshold and to provide a notification signal indicating a talk/listen state of the associated conference site; and a control unit coupled to the plurality of conference units and receiving each notification signal, the control unit operable to control voice activated switching of video between the associated conference sites using the notification signals. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29)
-
-
30. A multipoint conference system, comprising:
-
a plurality of sets of conference equipment, each set of conference equipment located at a conference site and operable to transmit a sequential stream of audio samples, where each audio sample represents sound measured from the conference site for a sampled interval of time; a multipoint control unit coupled to the plurality of sets of conference equipment and receiving each sequential stream of audio samples, the multipoint conference system operable to set and maintain a dynamic level associated with each site, and set a separate dynamic threshold associated with each conference site by processing each dynamic level and audio sample in the sequential stream received from each set of equipment; and the multipoint control unit further operable to compare an audio level of each audio sample to the dynamic threshold to determine a talk/listen state of each conference site and to control voice activated switching of video between the conference sites using the determined talk/listen states of the conference sites. - View Dependent Claims (31, 32, 33, 34, 35, 36, 37)
-
Specification