Speech recognition terminal device, speech recognition system, and speech recognition method
First Claim
1. A speech recognition terminal device capable of communicating with a speech recognition server that carries out speech recognition, the speech recognition terminal device comprising:
- a speech acquisition device that acquires a speech command spoken by a user;
a request device that requests the speech recognition server to carry out the speech recognition of the speech command acquired by the speech acquisition device;
a prediction device that predicts a present delay time until a result of the speech recognition of the speech command requested from the request device is obtained from the speech recognition server;
a determination device that determines a filler word with a time length in accordance with the present delay time predicted by the prediction device;
a filler speaking device that outputs the filler word determined by the determination device as speech information during a waiting time until the result of the speech recognition requested from the request device is obtained from the speech recognition server;
a response device that, when the result of the speech recognition is acquired from the speech recognition server, executes a process of responding to the user based on the acquired result of the speech recognition; and
an acquiring device that acquires time information expressing past delay times when the communication has been carried out with the speech recognition server in past, whereinbased on the past delay times expressed by the time information acquired by the acquisition device, the prediction device predicts the present delay time until the result of the speech recognition requested from the request device is obtained from the speech recognition server.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition terminal device communicable with a speech recognition server includes a request device for requesting the speech recognition server to carry out the speech recognition of an acquired speech command, a prediction device for predicting a present delay time until a result of the requested speech recognition is obtained from the speech recognition server, a determination device for determining a filler word with a time length in accordance with the predicted present delay time, a filler speaking device for outputting the determined filler word during a waiting time until the result of the requested speech recognition is obtained from the speech recognition server, and a response device for responding to the user when the result of the speech recognition is acquired from the speech recognition server.
-
Citations
15 Claims
-
1. A speech recognition terminal device capable of communicating with a speech recognition server that carries out speech recognition, the speech recognition terminal device comprising:
-
a speech acquisition device that acquires a speech command spoken by a user; a request device that requests the speech recognition server to carry out the speech recognition of the speech command acquired by the speech acquisition device; a prediction device that predicts a present delay time until a result of the speech recognition of the speech command requested from the request device is obtained from the speech recognition server; a determination device that determines a filler word with a time length in accordance with the present delay time predicted by the prediction device; a filler speaking device that outputs the filler word determined by the determination device as speech information during a waiting time until the result of the speech recognition requested from the request device is obtained from the speech recognition server; a response device that, when the result of the speech recognition is acquired from the speech recognition server, executes a process of responding to the user based on the acquired result of the speech recognition; and an acquiring device that acquires time information expressing past delay times when the communication has been carried out with the speech recognition server in past, wherein based on the past delay times expressed by the time information acquired by the acquisition device, the prediction device predicts the present delay time until the result of the speech recognition requested from the request device is obtained from the speech recognition server. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A speech recognition terminal device capable of communicating with a speech recognition server that carries out speech recognition, the speech recognition terminal device comprising:
-
a speech acquisition device that acquires a speech command spoken by a user; a request device that requests the speech recognition server to carry out the speech recognition of the speech command acquired by the speech acquisition device; a prediction device that predicts a present delay time until a result of the speech recognition of the speech command requested from the request device is obtained from the speech recognition server; a determination device that determines a filler word with a time length in accordance with the present delay time predicted by the prediction device; a filler speaking device that outputs the filler word determined by the determination device as speech information during a waiting time until the result of the speech recognition requested from the request device is obtained from the speech recognition server; a response device that, when the result of the speech recognition is acquired from the speech recognition server, executes a process of responding to the user based on the acquired result of the speech recognition; and an acquiring device that acquires from an external device time information expressing past delay times when the speech recognition server has communicated with the speech recognition terminal device in past, wherein based on the past delay times expressed by the time information acquired from the external device by the acquiring device, the prediction device predicts the present delay time until the result of the speech recognition is obtained from the speech recognition server.
-
-
8. A speech recognition terminal device capable of communicating with a speech recognition server that carries out speech recognition, the speech recognition terminal device comprising:
-
a speech acquisition device that acquires a speech command spoken by a user; a request device that requests the speech recognition server to carry out the speech recognition of the speech command acquired by the speech acquisition device; a prediction device that predicts a present delay time until a result of the speech recognition of the speech command requested from the request device is obtained from the speech recognition server; a determination device that determines a filler word with a time length in accordance with the present delay time predicted by the prediction device; a filler speaking device that outputs the filler word determined by the determination device as speech information during a waiting time until the result of the speech recognition requested from the request device is obtained from the speech recognition server; a response device that, when the result of the speech recognition is acquired from the speech recognition server, executes a process of responding to the user based on the acquired result of the speech recognition; an extension prediction device that, in cases where the result of the speech recognition is not acquired from the speech recognition server at a time when the present delay time predicted by the prediction device elapses, predicts an extension time when the result of the speech recognition will be obtained; an extension determination device that determines an extension filler word with a time length in accordance with the extension time predicted by the extension prediction device; and an extension filler speaking device that outputs the extension filler word determined by the extension determination device as speech information during the waiting time until the result of the speech recognition is obtained from the speech recognition server.
-
-
9. A speech recognition method in a computer system that carries out speech recognition, the speech recognition method comprising:
-
performing an acquisition process including acquiring a speech command spoken by a user; performing a request process including requesting a speech recognition server that carries out speech recognition of the speech command acquired in the acquisition process; performing a prediction process including predicting a present delay time until a result of the speech recognition requested in the request process is obtained from the speech recognition server; performing a determination process including determining a filler word with a time length in accordance with the present delay time predicted in the prediction process; performing a filler speaking process including outputting the filler word, which is determined in the determination process, as speech information during a waiting time until the result of the speech recognition requested in the request process is obtained from the speech recognition server; when the result of the speech recognition is acquired from the speech recognition server, performing a response process including responding to the user based on the acquired result of the speech recognition; and performing an acquiring process including acquiring time information expressing past delay times when communication has been carried out with the speech recognition server in past, wherein; in the prediction process, the present delay time until the result of the speech recognition is obtained from the speech recognition server is predicted based on the past delay times expressed by the time information acquired in the acquiring process. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A speech recognition method in a computer system that carries out speech recognition, the speech recognition method comprising:
-
performing an acquisition process including acquiring a speech command spoken by a user; performing a request process including requesting a speech recognition server that carries out speech recognition of the speech command acquired in the acquisition process; performing a prediction process including predicting a present delay time until a result of the speech recognition requested in the request process is obtained from the speech recognition server; performing a determination process including determining a filler word with a time length in accordance with the present delay time predicted in the prediction process; performing a filler speaking process including outputting the filler word, which is determined in the determination process, as speech information during a waiting time until the result of the speech recognition requested in the request process is obtained from the speech recognition server; when the result of the speech recognition is acquired from the speech recognition server, performing a response process including responding to the user based on the acquired result of the speech recognition; and performing an acquiring process including acquiring time information stored in an external device as information expressing past delay times when the speech recognition server has communicated with a remote speech recognition terminal device in past, wherein; in the prediction process, the present delay time until the result of the speech recognition is obtained from the speech recognition server is predicted based on the past delay times expressed by the time information acquired from the external device.
-
-
15. A speech recognition method in a computer system that carries out speech recognition, the speech recognition method comprising:
-
performing an acquisition process including acquiring a speech command spoken by a user; performing a request process including requesting a speech recognition server that carries out speech recognition of the speech command acquired in the acquisition process; performing a prediction process including predicting a present delay time until a result of the speech recognition requested in the request process is obtained from the speech recognition server; performing a determination process including determining a filler word with a time length in accordance with the present delay time predicted in the prediction process; performing a filler speaking process including outputting the filler word, which is determined in the determination process, as speech information during a waiting time until the result of the speech recognition requested in the request process is obtained from the speech recognition server; when the result of the speech recognition is acquired from the speech recognition server, performing a response process including responding to the user based on the acquired result of the speech recognition; performing an extension determination process including, when the result of the speech recognition is not acquired from the speech recognition server at a time when the predicted present delay time elapses, predicting an extension time when the result of the speech recognition will be obtained; performing a determining process including determining an extension filler word with a time length in accordance with the extension time predicted in the extension determination process; and performing an extension filler speaking process including outputting the determined extension filler word as speech information during a waiting time until the result of the speech recognition is obtained from the speech recognition server.
-
Specification