Voice activation method, apparatus, electronic device, and storage medium
First Claim
1. A voice activation method applied to an electronic device in a vehicle, comprising:
- capturing a voice signal in a vehicle, the voice signal including a first voice signal captured in a first period from a first timestamp to a second timestamp and a second voice signal captured in a second period from a third timestamp to the first timestamp;
calculating an acoustic score of an activation keyword extracted from the first voice signal, the acoustic score being used for indicating authenticity of the activation keyword, the authenticity of the activation keyword being a probability that the activation keyword is used for waking up the electronic device to perform a voice activation operation, and a magnitude of the acoustic score being positively correlated to a magnitude of the probability;
calculating a noise level in the vehicle by using the second voice signal;
determining a voice activation threshold according to the noise level in the vehicle, a magnitude of the voice activation threshold being negatively correlated to a magnitude of the noise;
comparing the acoustic score of the first voice signal with the voice activation threshold determined according to the noise level calculated from the second voice signal; and
performing the voice activation operation when the acoustic score is greater than the voice activation threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the present disclosure disclose a voice activation method, an apparatus, an electronic device, and a storage medium thereof. The voice activation method includes: capturing a voice signal in a vehicle; calculating an acoustic score of an activation keyword extracted from the voice signal, the acoustic score being used for indicating authenticity of the activation keyword, the authenticity of the activation keyword being a probability that the activation keyword is used for waking up the electronic device to perform a voice activation operation, and a magnitude of the acoustic score being positively correlated to a magnitude of the probability; determining a voice activation threshold according to a noise level in the vehicle, a magnitude of the voice activation threshold being negatively correlated to a magnitude of the noise; and performing the voice activation operation when the acoustic score is greater than the voice activation threshold.
19 Citations
19 Claims
-
1. A voice activation method applied to an electronic device in a vehicle, comprising:
-
capturing a voice signal in a vehicle, the voice signal including a first voice signal captured in a first period from a first timestamp to a second timestamp and a second voice signal captured in a second period from a third timestamp to the first timestamp; calculating an acoustic score of an activation keyword extracted from the first voice signal, the acoustic score being used for indicating authenticity of the activation keyword, the authenticity of the activation keyword being a probability that the activation keyword is used for waking up the electronic device to perform a voice activation operation, and a magnitude of the acoustic score being positively correlated to a magnitude of the probability; calculating a noise level in the vehicle by using the second voice signal; determining a voice activation threshold according to the noise level in the vehicle, a magnitude of the voice activation threshold being negatively correlated to a magnitude of the noise; comparing the acoustic score of the first voice signal with the voice activation threshold determined according to the noise level calculated from the second voice signal; and performing the voice activation operation when the acoustic score is greater than the voice activation threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 17, 18, 19)
-
-
8. An electronic device, the electronic device comprising:
-
one or more processors; and a memory, the memory storing one or more programs, the one or more programs being configured to be executed by the one or more processors, and the one or more programs comprising instructions for performing the following operations; capturing a voice signal in a vehicle, the voice signal including a first voice signal captured in a first period from a first timestamp to a second timestamp and a second voice signal captured in a second period from a third timestamp to the first timestamp; calculating an acoustic score of an activation keyword extracted from the first voice signal, the acoustic score being used for indicating authenticity of the activation keyword, the authenticity of the activation keyword being a probability that the activation keyword is used for waking up the electronic device to perform a voice activation operation, and a magnitude of the acoustic score being positively correlated to a magnitude of the probability; calculating a noise level in the vehicle by using the second voice signal; determining a voice activation threshold according to the noise level in the vehicle, a magnitude of the voice activation threshold being negatively correlated to a magnitude of the noise; comparing the acoustic score of the first voice signal with the voice activation threshold determined according to the noise level calculated from the second voice signal; and performing the voice activation operation when the acoustic score is greater than the voice activation threshold. - View Dependent Claims (9, 10, 11, 12, 13)
-
-
14. A non-transitory computer-readable storage medium storing computer program instructions executable by at least one processor to perform:
-
capturing a voice signal in a vehicle, the voice signal including a first voice signal captured in a first period from a first timestamp to a second timestamp and a second voice signal captured in a second period from a third timestamp to the first timestamp; calculating an acoustic score of an activation keyword extracted from the voice signal, the acoustic score being used for indicating authenticity of the activation keyword, the authenticity of the activation keyword being a probability that the activation keyword is used for waking up the electronic device to perform a voice activation operation, and a magnitude of the acoustic score being positively correlated to a magnitude of the probability; calculating a noise level in the vehicle by using the second voice signal; determining a voice activation threshold according to the noise level in the vehicle, a magnitude of the voice activation threshold being negatively correlated to a magnitude of the noise; comparing the acoustic score of the first voice signal with the voice activation threshold determined according to the noise level calculated from the second voice signal; and performing the voice activation operation when the acoustic score is greater than the voice activation threshold. - View Dependent Claims (15, 16)
-
Specification