Conference support device, conference support method, and computer-readable medium storing conference support program
First Claim
1. A conference support device that, by controlling communication among a plurality of conference terminals, is configured to support a video conference that is conducted among conference participants who are using the conference terminals, the conference support device comprising:
- a memory configured to store computer-readable instructions; and
a processor that is configured to execute the computer-readable instructions to;
receive, from the plurality of the conference terminals, captured images that are captured by image capture devices of the conference terminals and in each of which at least one of the conference participants is visible;
receive, from a first conference terminal that is one of the plurality of the conference terminals, a voice that is generated by a first participant, the first participant being one of the conference participants and being using the first conference terminal, the voice being input from a voice input device of the first conference terminal;
identify words that are included in the received voice by voice recognition processing;
create text data that express the identified words;
specify a second participant who is a different conference participant from the first participant based on a result of the voice recognition processing, the second participant corresponding to the identified words and being at least one of the conference participants;
create a display image that is to be displayed on display devices of the plurality of the conference terminals from the captured images, and in which the text data are associated with a first portion of the captured image that corresponds to the first participant and a specified image is associated with a second portion of the captured image that corresponds to the second participant, the specified image being an image that indicate that the second participant is addressed by the first participant; and
transmit the created display image to the plurality of the conference terminals, in order for the display image to be displayed on the display devices of the plurality of the conference terminals.
1 Assignment
0 Petitions
Accused Products
Abstract
A conference support device includes an image receiving portion that receives captured images from conference terminals, a voice receiving portion that receives, from one of the conference terminals, a voice that is generated by a first participant, a first storage portion that stores the captured images and the voice, a voice recognition portion that recognizes the voice, a text data creation portion that creates text data that express the words that are included in the voice, an addressee specification portion that specifies a second participant, whom the voice is addressing, an image creation portion that creates a display image that is configured from the captured images and in which the text data are associated with the first participant and a specified image is associated with at least one of the first participant and the second participant, and a transmission portion that transmits the display image to the conference terminals.
42 Citations
20 Claims
-
1. A conference support device that, by controlling communication among a plurality of conference terminals, is configured to support a video conference that is conducted among conference participants who are using the conference terminals, the conference support device comprising:
-
a memory configured to store computer-readable instructions; and a processor that is configured to execute the computer-readable instructions to; receive, from the plurality of the conference terminals, captured images that are captured by image capture devices of the conference terminals and in each of which at least one of the conference participants is visible; receive, from a first conference terminal that is one of the plurality of the conference terminals, a voice that is generated by a first participant, the first participant being one of the conference participants and being using the first conference terminal, the voice being input from a voice input device of the first conference terminal; identify words that are included in the received voice by voice recognition processing; create text data that express the identified words; specify a second participant who is a different conference participant from the first participant based on a result of the voice recognition processing, the second participant corresponding to the identified words and being at least one of the conference participants; create a display image that is to be displayed on display devices of the plurality of the conference terminals from the captured images, and in which the text data are associated with a first portion of the captured image that corresponds to the first participant and a specified image is associated with a second portion of the captured image that corresponds to the second participant, the specified image being an image that indicate that the second participant is addressed by the first participant; and transmit the created display image to the plurality of the conference terminals, in order for the display image to be displayed on the display devices of the plurality of the conference terminals. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A conference support method that, by controlling communication among a plurality of conference terminals, is configured to support a video conference that is conducted among conference participants who are using the conference terminals, the conference support method comprising:
-
receiving, from the plurality of the conference terminals, captured images that are captured by image capture devices of the conference terminals and in each of which at least one of the conference participants is visible; receiving, from a first conference terminal that is one of the plurality of the conference terminals, a voice that is generated by a first participant, the first participant being one of the conference participants and being using the first conference terminal, the voice being input from a voice input device of the first conference terminal; identifying words that are included in the received voice by voice recognition processing; creating text data that express the identified words; specifying a second participant who is a different conference participant from the first participant based on a result of voice recognition processing, the second participant corresponding to the identified words and being at least one of the conference participants; creating a display image that is to be displayed on display devices of the plurality of the conference terminals from the captured images that have been received, and in which the text data that have been created are associated with a first portion of the captured image that corresponds to the first participant and a specified image is associated with a second portion of the captured image that corresponds to the second participant, the specified image being an image that indicates that the second participant is addressed by the first participant; and transmitting the created display image to the plurality of the conference terminals, in order for the display image to be displayed on the display devices of the plurality of the conference terminals. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A non-transitory computer-readable medium that stores a conference support program for a conference support device that, by controlling communication among a plurality of conference terminals, is configured to support a video conference that is conducted among conference participants who are using the conference terminals, the program comprising instructions that cause a computer of the conference support device to perform:
-
receiving, from the plurality of the conference terminals, captured images that are captured by image capture devices of the conference terminals and in each of which at least one of the conference participants is visible; receiving, from a first conference terminal that is one of the plurality of the conference terminals, a voice that is generated by a first participant, the first participant being one of the conference participants and being using the first conference terminal, the voice being input from a voice input device of the first conference terminal; identifying words that are included in the received voice by voice recognition processing; creating text data that express the identified words; specifying a second participant who is a different conference participant from the first participant based on a result of voice recognition processing, the second participant corresponding to the identified words and being at least one of the conference participants; creating a display image that is to be displayed on display devices of the plurality of the conference terminals from the captured images that have been received, and in which the text data that have been created are associated with a first portion of the captured image that corresponds to the first participant and a specified image is associated with a second portion of the captured image that corresponds to the second participant, the specified image being an image that indicates that the second participant is addressed by the first participant; and transmitting the created display image to the plurality of the conference terminals, in order for the display image to be displayed on the display devices of the plurality of the conference terminals. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
Specification