Spatial sound conference system and method

US 20060133619A1
Filed: 02/16/2006
Published: 06/22/2006
Est. Priority Date: 02/08/1996
Status: Active Grant

First Claim

Patent Images

1-27. -27. (canceled)

View all claims

3 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The spatial sound conference system enables participants in a teleconference to distinguish between speakers even during periods of interruption and overtalk, identify speakers based on spatial location cues, understand low volume speech, and block out background noise using spatial sound information. Spatial sound information may be captured using microphones positioned at the ear locations of a dummy head at a conference table, or spatial sound information may be added to a participant'"'"'s monaural audio signal using head-related transfer functions. Head-related transfer functions simulate the frequency response of audio signals across the head from one ear to the other ear to create a spatial location for a sound. Spatial sound is transmitted across a communication channel, such as ISDN, and reproduced using spatially disposed loudspeakers positioned at the ears of a participant. By inserting a spatial sound component in a teleconference, a speaker other than the loudest speaker may be heard during periods of interruption and overtalk. Additionally, speakers may be more readily identified when they have a spatial sound position, and the perception of background noise is reduced.

66 Citations

View as Search Results

47 Claims

1-27. -27. (canceled)

28. A system comprising:
- a plurality of participant stations, each of the plurality of participant stations associated with at least one conference participant and including at least one microphone configured to transmit a participant audio signal generated based on the at least one conference participant, at least one speaker configured to receive a composite audio signal and convert the composite audio signal to audible sound, at least one video camera configured to transmit a participant video signal generated based on the at least one conference participant, at least one video display configured to receive a transmitted video signal, and a station processing system coupled to the at least one microphone, the at least one speaker, the at least one video camera and the at least one video display, the station processing system configured to receive the participant audio signal from the at least one microphone, receive the participant video signal from the at least one video camera, compress the participant audio signal and the participant video signal, transmit the compressed participant audio signal and compressed participant video signal over a network, receive the composite audio signal and the transmitted video signal in compressed form, decompress the composite audio signal and transmitted video signal from the compressed form, transmit the composite audio signal to the at least one speaker, transmit the transmitted video signal to the at least one display; and
  
  a spatial processing system coupled to the plurality of participant stations via the network, the spatial processing system configured to receive the participant audio signal from each participant station, receive the participant video signal from each participant station, decompress the participant audio signals, apply a first head-related transfer function associated with a first participant station of the plurality of participant stations to the participant audio signal of the first participant station to generate a first spatialized audio signal, apply a second head-related transfer function associated with a second participant station of the plurality of participant stations to the participant audio signal of the second participant station to generate a second spatialized audio signal, combine the first spatialized audio signal and second spatialized audio signal into a third composite audio signal, compress the third composite audio signal, and transmit the third composite audio signal to a third participant station of the plurality of participant stations.
- View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 46, 47)
- - 29. The system of claim 28, wherein the participant audio signal is a monaural signal.
  - 30. The system of claim 28, wherein the station processing system is further configured to convert the participant audio signal into a digital signal prior to compression.
  - 31. The system of claim 28, wherein the spatial conferencing system is further configured to assign a virtual location associated with each of the plurality of participant stations, and select one of a plurality of head-related transfer functions to be associated with each of the plurality of participant stations based on the virtual location, wherein the first head-related transfer function and the second head-related transfer function are among the plurality of head-related transfer functions.
  - 32. The system of claim 31, wherein the virtual locations simulate one of:
    - participants in a circle, participants in a line, participants in a rectangle, participants in a semicircle.
  - 33. The system of claim 31, wherein each assigned virtual location associated with each of the plurality of participant stations is also associated with a perspective of a specific one of the plurality of participant stations.
  - 34. The system of claim 33, wherein the spatial conference system is configured to assign virtual locations associated with each of the plurality of participant stations and associated with the perspective of each of the plurality of participant stations, and select a head-related transfer function to be associated with each of the plurality of participant stations based on the virtual location.
  - 35. The system of claim 28, wherein the composite audio signal comprises a first audio signal and a second audio signal, and wherein the at least one speaker comprises a first speaker and a second speaker, and the first speaker receives the first audio signal and the second speaker received the second audio signal.
  - 36. The system of claim 28, wherein the spatial conference system is further configured to apply a third head-related transfer function associated with the third participant station to the participant audio signal of the third participant station to generate a third spatialized audio signal, apply a fourth head-related transfer function associated with the second participant station to the participant audio signal of the second participant station to generate a fourth spatialized audio signal, combine the third spatialized audio signal and fourth spatialized audio signal into a first composite audio signal, compress the first composite audio signal, and transmit the first composite audio signal to the first participant station.
  - 37. The system of claim 28, wherein the spatial conference system is further configured to apply a fifth head-related transfer function associated with the first participant station to the participant audio signal of the first participant station to generate a fifth spatialized audio signal, apply a sixth head-related transfer function associated with the third participant station to the participant audio signal of the third participant station to generate a sixth spatialized audio signal, combine the fifth spatialized audio signal and sixth spatialized audio signal into a second composite audio signal, compress the second composite audio signal, and transmit the second composite audio signal to the second participant station.
  - 46. The system of claim 28, wherein the spatial processing system is further configured to transmit the participant video signals to each of the plurality of participant stations.
  - 47. The system of claim 28, wherein the spatial processing system includes at least one of:
    - echo cancellation facilities, reverberation facilities, and speaker crossover cancellation facilities.

38. A method comprising:
- receiving participant audio signals from each of a plurality of participant stations;
  
  receiving participant video signals from each of the plurality of participant stations;
  
  decompressing the participant audio signals;
  
  applying a first head-related transfer function associated with a first participant station of the plurality of participant stations to the participant audio signal of the first participant station to generate a first spatialized audio signal;
  
  applying a second head-related transfer function associated with a second participant station of the plurality of participant stations to the participant audio signal of the second participant station to generate a second spatialized audio signal;
  
  combining the first spatialized audio signal and second spatialized audio signal into a third composite audio signal;
  
  compressing the third composite audio signal; and
  
  transmitting the third composite audio signal to a third participant station of the plurality of participant stations.
- View Dependent Claims (39, 40, 41, 42, 43, 44, 45)
- - 39. The method of claim 38, further comprising:
    - determining the first head-related transfer function based on a virtual location of the first participant station; and
      
      determining the second head-related transfer function based on a virtual location of the second participant station.
  - 40. The method of claim 39, wherein the virtual location is associated with a perspective of the third participant station.
  - 41. The method of claim 38, further comprising:
    - assigning a virtual location associated with each of the plurality of participant stations; and
      
      selecting one of a plurality of head-related transfer functions to be associated with each of the plurality of participant stations based on the virtual location;
      
      wherein the first head-related transfer function and the second head-related transfer function are among the plurality of head-related transfer functions.
  - 42. The method of claim 41, wherein each virtual location is further associated with a perspective of each of the plurality of participant stations, and the selected one of the plurality of head-related transfer functions is also associated with the perspective of one of the plurality of participant stations based on the virtual location.
  - 43. The method of claim 38, further comprising:
    - applying a third head-related transfer function associated with the third participant station to the participant audio signal of the third participant station to generate a third spatialized audio signal;
      
      applying a fourth head-related transfer function associated with the second participant station to the participant audio signal of the second participant station to generate a fourth spatialized audio signal;
      
      combining the third spatialized audio signal and fourth spatialized audio signal into a first composite audio signal;
      
      compressing the first composite audio signal; and
      
      transmitting the first composite audio signal to the first participant station.
  - 44. The method of claim 38, further comprising:
    - applying a fifth head-related transfer function associated with the first participant station to the participant audio signal of the first participant station to generate a fifth spatialized audio signal;
      
      applying a sixth head-related transfer function associated with the third participant station to the participant audio signal of the third participant station to generate a sixth spatialized audio signal;
      
      combining the fifth spatialized audio signal and sixth spatialized audio signal into a second composite audio signal;
      
      compressing the second composite audio signal; and
      
      transmitting the second composite audio signal to the second participant station.
  - 45. The method of claim 38, further comprising:
    - transmitting the participant video signals to each of the plurality of participant stations.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Verizon Patent and Licensing Incorporated (Verizon Communications Inc.)
Original Assignee
Verizon Services Corporation (Verizon Communications Inc.)
Inventors
McAllister, Alexander I., Curry, James E., Hatton, Patricia V.

Granted Patent

US 8,170,193 B2
Time in Patent Office

Days
Field of Search
US Class Current

381/26
CPC Class Codes

H04M 3/56   Arrangements for connecting...

H04M 3/568   audio processing specific t...

H04N 7/148   Interfacing a video termina...

H04N 7/15   Conference systems

H04R 27/00   Public address systems circ...

Spatial sound conference system and method

First Claim

3 Assignments

0 Petitions

Accused Products

Abstract

66 Citations

47 Claims

Specification

Solutions

Use Cases

Quick Links

Spatial sound conference system and method

First Claim

3 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

66 Citations

47 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links