Speech recognition using electronic device and server
First Claim
Patent Images
1. An electronic device comprising:
- a processor configured to perform automatic speech recognition (ASR) on a speech input by using a speech recognition model that is stored in a memory; and
a communication module configured to provide the speech input to a server and receive a speech instruction, which corresponds to the speech input, from the server,wherein the processor is further configured to;
perform an operation corresponding to a result of the ASR if a confidence score of the result of the ASR is higher than a first threshold value,perform the speech instruction, which is received from the server, if the confidence score is between the first threshold value and a second threshold value, anddecrease the first threshold value if the result of the ASR corresponds to the speech instruction that is received from the server.
1 Assignment
0 Petitions
Accused Products
Abstract
An electronic device is provided. The electronic device includes a processor configured to perform automatic speech recognition (ASR) on a speech input by using a speech recognition model that is stored in a memory and a communication module configured to provide the speech input to a server and receive a speech instruction, which corresponds to the speech input, from the server. The electronic device may perform different operations according to a confidence score of a result of the ASR. Besides, it may be permissible to prepare other various embodiments speculated through the specification.
-
Citations
20 Claims
-
1. An electronic device comprising:
-
a processor configured to perform automatic speech recognition (ASR) on a speech input by using a speech recognition model that is stored in a memory; and a communication module configured to provide the speech input to a server and receive a speech instruction, which corresponds to the speech input, from the server, wherein the processor is further configured to; perform an operation corresponding to a result of the ASR if a confidence score of the result of the ASR is higher than a first threshold value, perform the speech instruction, which is received from the server, if the confidence score is between the first threshold value and a second threshold value, and decrease the first threshold value if the result of the ASR corresponds to the speech instruction that is received from the server. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A method of executing speech recognition in an electronic device, the method comprising:
-
obtaining a speech input from a user; generating a speech signal corresponding to the obtained speech; performing first speech recognition on at least a part of the speech signal; acquiring first operation information and a first confidence score; transmitting at least a part of the speech signal to a server for second speech recognition; receiving second operation information, which corresponds to the transmitted signal, from the server; performing a function corresponding to the first operation information if the first confidence score is higher than a first threshold value; performing a function corresponding to the second operation information if the first confidence score is between the first threshold value and a second threshold value; and decreasing the first threshold value if the function corresponding to the first operation information is identical to the function corresponding to the second operation information. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
-
20. A non-transitory computer readable recording medium having instructions recorded thereon, the instructions implement a method of executing speech recognition in an electronic device, the method comprising:
-
obtaining a speech input from a user; generating a speech signal corresponding to the obtained speech; performing first speech recognition on at least a part of the speech signal; acquiring first operation information and a first confidence score; transmitting at least a part of the speech signal to a server for second speech recognition; receiving second operation information, which corresponds to the transmitted signal, from the server; performing a function corresponding to the first operation information if the first confidence score is higher than a first threshold value; performing a function corresponding to the second operation information if the first confidence score is between the first threshold value and a second threshold value; and decreasing the first threshold value if the function corresponding to the first operation information is identical to the function corresponding to the second operation information.
-
Specification