Automatic voice tracking camera system and method of operation
First Claim
1. An automatic voice tracking camera system, comprising:
- a first camera operable to receive control signals for controlling a view of the first camera;
a microphone array comprising a plurality of microphones, the microphone array operable to receive a voice of a speaker and to provide an audio signal representing the voice;
a beamformer coupled to the microphone array, the beamformer operable to receive the audio signal, to generate from the audio signal speaker position data representing a position of the speaker as coordinates for a point in space, and to provide the speaker position data; and
a camera controller coupled to the beamformer and to the first camera, the camera controller operable;
to receive the speaker position data, to determine a responsive camera movement based upon changes of greater than a predetermined amount in the coordinates for the point in space indicative of sneaker position changes, to generate camera control signals based upon the responsive camera movement, and to provide the camera control signals to the first camera such that the view of the first camera automatically tracks the position of the speaker.
7 Assignments
0 Petitions
Accused Products
Abstract
An automatic voice tracking camera system (12) is provided. System (12) includes a camera (18) operable to receive control signals for controlling a view of the camera (18). A microphone array (14) comprises a plurality of microphones. The microphone array (14) is operable to receive a voice of a speaker and to provide an audio signal representing the voice. A beamformer (30) coupled to the microphone array (14). The beamformer (30) is operable to receive the audio signal, to generate from the audio signal speaker position data representing a position of the speaker, and to provide the speaker position data. A camera controller (36) is coupled to the beamformer (30) and to the camera (18). The camera controller (36) is operable to receive the speaker position data and to determine an appropriate responsive camera movement The camera controller (36) is further operable to generate camera control signals and to provide the camera control signals to the camera (18) such that the view of the camera (18) automatically tracks the position of the speaker.
208 Citations
30 Claims
-
1. An automatic voice tracking camera system, comprising:
-
a first camera operable to receive control signals for controlling a view of the first camera;
a microphone array comprising a plurality of microphones, the microphone array operable to receive a voice of a speaker and to provide an audio signal representing the voice;
a beamformer coupled to the microphone array, the beamformer operable to receive the audio signal, to generate from the audio signal speaker position data representing a position of the speaker as coordinates for a point in space, and to provide the speaker position data; and
a camera controller coupled to the beamformer and to the first camera, the camera controller operable;
to receive the speaker position data, to determine a responsive camera movement based upon changes of greater than a predetermined amount in the coordinates for the point in space indicative of sneaker position changes, to generate camera control signals based upon the responsive camera movement, and to provide the camera control signals to the first camera such that the view of the first camera automatically tracks the position of the speaker. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An automatic camera controller for automatically controlling a camera to track a position of a speaker, the automatic camera controller comprising:
-
a camera controller operable;
to receive speaker position data representing the position of the speaker as coordinates for a point in space, to determine a speaker position change if changes in the coordinates exceed a predetermined amount;
to determine an appropriate responsive camera movement based upon the speaker position change determination, to generate camera control signals based upon the appropriate responsive camera movement, and to provide the camera control signals to at least one camera such that a view of the at least one camera automatically tracks the position of the speaker; and
an interface coupled to the camera controller, the interface operable to communicate with a host video conference system and with the camera controller. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. A method of automatically controlling a camera to track a position of a speaker using a voice of the speaker, comprising:
-
receiving a voice of the speaker and providing an audio signal representing the voice;
processing the audio signal to generate speaker position data representing the position of the speaker as coordinates for a point in space;
determining a speaker position change if changes in the coordinates exceed a predetermined amount;
determining an appropriate responsive camera movement from the speaker position data based upon the speaker position change determination;
generating camera control signals based upon the appropriate responsive camera movement; and
providing the camera control signals to a first camera such that a view of the first camera automatically tracks the position of the speaker. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30)
generating camera control signals to move a second camera to view a new speaker position at a new point in space while the first camera is active; and
switching to the second camera after the new speaker position is viewed by the second camera.
-
-
22. The method of claim 20, wherein determining an appropriate responsive camera movement comprises moving the first camera to view a new speaker position at a new point in space only when the new speaker position is outside of a first window and inside of a second window, said first window and said second window each defined with respect to a current speaker position.
-
23. The method of claim 20, wherein determining an appropriate responsive camera movement further comprises moving a second camera to view a new speaker position when the new speaker position is outside of a first and a second window defined with respect to a current speaker position.
-
24. The method of claim 20, wherein determining an appropriate responsive camera movement comprises filtering the speaker position data to delay movement of the first camera until a new speaker position is verified.
-
25. The method of claim 24, wherein determining an appropriate responsive camera movement further comprises filtering the speaker position data by storing and averaging a plurality of temporary speaker positions.
-
26. The method of claim 25, wherein the plurality of temporary speaker positions comprises three temporary speaker positions.
-
27. The method of claim 20, wherein determining an appropriate responsive camera movement comprises associating the first camera and a second camera each with one of a plurality of zones and only moving the first camera and the second camera to view the speaker position if the speaker position is in the associated zone.
-
28. The method of claim 20, wherein generating camera control signals further comprises generating zoom control from a host computer as an altered zoom set up configuration for controlling the camera.
-
29. The method of claim 20, wherein processing the audio signal further comprises generating on and off control signals from a host computer for the automatic tracking control of the speaker.
-
30. The method of claim 20, further comprising processing the audio signal representing the voice and the speaker position data for determining an offset between the camera and the microphone array by capturing the speaker while talking.
Specification