Method, apparatus, and computing device for revision of speech recognition results
First Claim
1. An information input method, comprising:
- receiving a voice input of a user;
sending the received voice input to a server;
displaying an animation having a continuous refresh rate while the server determines, based on a speech recognition model, recognition results on the received voice input;
acquiring, from the server, a list of the recognition results;
in response to the list of the recognition results being acquired, hiding the animation and replacing the animation with a display of the list of the recognition results;
providing an interface to revise a recognition result in the list of the acquired recognition results, the providing the interface comprising;
in response to the user single clicking, double clicking, or hovering over an editing mode logo for the recognition result, sliding, for a duration distinct from the continuous refresh rate of the animation, the interface from a bottom of a screen of a display device in an accelerating and subsequently decelerating manner; and
in response to revising the recognition result, feeding the revised recognition result to the server to train the speech recognition model.
2 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure discloses an information input method and device, and a computing apparatus. The information input method comprises receiving a voice input of a user, acquiring a recognition result on the received voice input, and enabling editing of the acquired recognition result in a text format. With the information input mechanism, according to the present invention, a user is able to choose to revise an automatic speech recognition result in a text editing format, particularly in the case where a small amount of errors occurs to the contents of speech recognition. As a result, the trouble that all contents of a voice input need to be input again is avoided, the speech recognition efficiency is increased, and the user experience is improved.
30 Citations
16 Claims
-
1. An information input method, comprising:
-
receiving a voice input of a user; sending the received voice input to a server; displaying an animation having a continuous refresh rate while the server determines, based on a speech recognition model, recognition results on the received voice input; acquiring, from the server, a list of the recognition results; in response to the list of the recognition results being acquired, hiding the animation and replacing the animation with a display of the list of the recognition results; providing an interface to revise a recognition result in the list of the acquired recognition results, the providing the interface comprising; in response to the user single clicking, double clicking, or hovering over an editing mode logo for the recognition result, sliding, for a duration distinct from the continuous refresh rate of the animation, the interface from a bottom of a screen of a display device in an accelerating and subsequently decelerating manner; and in response to revising the recognition result, feeding the revised recognition result to the server to train the speech recognition model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An information input device, comprising:
-
one or more processors; and a memory storing instructions that, when executed by the one or more processors, cause the device to perform; receiving a voice input of a user; sending the received voice input to a server; displaying an animation having a continuous refresh rate while a server determines, based on a speech recognition model, recognition results on the received voice input; acquiring, from the server, a list of the recognition results; in response to the list of the recognition results being acquired, hiding the animation and replacing the animation with a display of the list of the recognition results; providing an interface to revise a recognition result in the list of the acquired recognition results, the providing the interface comprising; in response to the user single clicking, double clicking, or hovering over an editing mode logo for the recognition result, sliding, for a duration distinct from the continuous refresh rate of the animation, the interface from a bottom of a screen of a display device, in an accelerating and subsequently decelerating manner; and in response to revising the recognition result, feeding the revised recognition result to the server to train the speech recognition model. - View Dependent Claims (10, 11, 12)
-
-
13. A computing apparatus, comprising:
-
a network interface, configured to enable the computing apparatus to conduct data communications with a server via one or more networks; a memory configured to store data communicated via the network interface; and one or more processors in connection with the network interface and the memory, the one or more processors being configured to execute the following operations; receiving a voice input of a user; sending the received voice input to a server; displaying an animation having a continuous refresh rate while a server determines, based on a speech recognition model, recognition results on the received voice input; acquiring, from the server, a list of the recognition results; in response to the list of the recognition results being acquired, hiding the animation and replacing the animation with a display of the list of the recognition results; providing an interface to revise a recognition result in the list of the recognition results, the providing the interface comprising; in response to the user single clicking, double clicking, or hovering over an editing mode logo for the recognition result, sliding, for a duration distinct from the continuous refresh rate of the animation, the interface from a bottom of a screen of a display device in an accelerating and subsequently decelerating manner; and in response to revising the recognition result, feeding the revised recognition result to the server to train the speech recognition model. - View Dependent Claims (14, 15, 16)
-
Specification