Conversation support apparatus, control method of conversation support apparatus, and program for conversation support apparatus
First Claim
Patent Images
1. A single conversation support apparatus which is formed into a plate shape comprises:
- two or more microphones configured to input speech signals of two or more users;
a processor configured to recognize the inputted speech signals from the two or more microphones, wherein the processor is configured to convert the inputted speech signals into recognition results; and
one display unit configured to display the recognition results of the processor, wherein the two or more microphones are disposed separately from each other in a same plane as the single conversation support apparatus, wherein the two or more microphones and the display unit are included within the single conversation support apparatus;
wherein the processor is configured to;
divide a display area of the one display unit to respectively correspond to the users;
estimate sound source directions of the users based on signal levels of the inputted speech signals by the two or more microphones; and
display the recognition results of the processor to be listed in time-series in the divided display areas, which respectively correspond to the users, of the one display unit at display angles respectively facing front when viewed from the estimated sound source directions, wherein the recognition results in each of the divided display areas are aligned in accordance with the estimated sound source directions so as to face at least one of the users.
1 Assignment
0 Petitions
Accused Products
Abstract
A conversation support apparatus includes: a speech input unit configured to input speech signals of two or more users; a speech recognizing unit configured to recognize the speech signals input from the speech input unit; a display unit configured to display the recognition results of the speech recognizing unit; and an image processing unit configured to set display areas respectively corresponding to the users into an image display area of the display unit.
28 Citations
11 Claims
-
1. A single conversation support apparatus which is formed into a plate shape comprises:
-
two or more microphones configured to input speech signals of two or more users; a processor configured to recognize the inputted speech signals from the two or more microphones, wherein the processor is configured to convert the inputted speech signals into recognition results; and one display unit configured to display the recognition results of the processor, wherein the two or more microphones are disposed separately from each other in a same plane as the single conversation support apparatus, wherein the two or more microphones and the display unit are included within the single conversation support apparatus; wherein the processor is configured to; divide a display area of the one display unit to respectively correspond to the users; estimate sound source directions of the users based on signal levels of the inputted speech signals by the two or more microphones; and display the recognition results of the processor to be listed in time-series in the divided display areas, which respectively correspond to the users, of the one display unit at display angles respectively facing front when viewed from the estimated sound source directions, wherein the recognition results in each of the divided display areas are aligned in accordance with the estimated sound source directions so as to face at least one of the users. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A control method of a single conversation support apparatus which is formed into a plate shape, comprising:
-
inputting speech signals of two or more users at two or more microphones; recognizing the inputted speech signals at a processor, and converting the inputted speech signals into recognition results; dividing one display area of a display unit to respectively correspond to the users, wherein the one display unit is configured to display the recognition results of the processor, wherein the two or more microphones and the display unit are included within the single conversation support apparatus; estimating a sound source direction of the users based on signal levels of the inputted speech signals by the two or more microphones; and displaying the recognition results of the processor to be listed in time-series in the divided display area, which respectively correspond to the users, of the one display unit at display angles respectively facing front when viewed from the estimated sound source directions, wherein the two or more microphones are disposed separately from each other in a same plane as the single conversation support apparatus, wherein the recognition results in each of the divided display areas are aligned in accordance with the estimated sound source directions so as to face at least one of the users.
-
-
11. A non-transitory computer-readable medium encoding instructions that, when executed in hardware, perform a process, the process comprising:
-
inputting speech signals of two or more users at two or more microphones; recognizing the inputted speech signals at a processor, and converting the inputted speech signals into recognition results; dividing one display area of a display unit to respectively correspond to the users, wherein the one display unit is configured to display the recognized speech signals of the processor, wherein the two or more microphones and the display unit are included within a single conversation support apparatus; estimating a sound source direction of the users based on signal levels of the inputted speech signals by the two or more microphones; and displaying the recognized speech signals of the processor are listed in time-series in the divided display area, which respectively correspond to the users, of the display unit at display angles respectively facing front when viewed from the estimated sound source directions, wherein the two or more microphones are disposed separately from each other in a same plane as the single conversation support apparatus, wherein the recognition results in each of the divided display areas are aligned in accordance with the estimated sound source directions so as to face at least one of the users.
-
Specification