SPEAKER ADAPTION METHOD AND APPARATUS, AND STORAGE MEDIUM
First Claim
Patent Images
1. A speaker adaption method, comprising:
- acquiring first speech data of a target speaker;
inputting the first speech data to a pre-trained batch normalization (BN) network to be subjected to an adaptive training to acquire a speech recognition model comprising a speech parameter of the target speaker.
1 Assignment
0 Petitions
Accused Products
Abstract
A speaker adaption method and a speaker adaption apparatus, a device and a storage medium are provided. The method includes: acquiring first speech data of a target speaker; inputting the first speech data to a pre-trained batch normalization (BN) network to be subjected to an adaptive training to acquire a speech recognition model including a speech parameter of the target speaker.
11 Citations
18 Claims
-
1. A speaker adaption method, comprising:
-
acquiring first speech data of a target speaker; inputting the first speech data to a pre-trained batch normalization (BN) network to be subjected to an adaptive training to acquire a speech recognition model comprising a speech parameter of the target speaker. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A speaker adaption apparatus, comprising:
-
one or more processors; a memory; one or more software modules stored in the memory and executable by the one or more processors, and comprising; a speech data acquiring module configured to acquire first speech data of a target speaker; a model training module configured to input the first speech data to a pre-trained batch normalization (BN) network to be subjected to an adaptive training to acquire a speech recognition model comprising a speech parameter of the target speaker. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer-readable storage medium having stored therein computer programs that, when executed by a processor of a terminal, cause the terminal to perform a speaker adaption method, the method comprising:
-
acquiring first speech data of a target speaker; inputting the first speech data to a pre-trained batch normalization (BN) network to be subjected to an adaptive training to acquire a speech recognition model comprising a speech parameter of the target speaker. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification