Method for updating voiceprint feature model and terminal
First Claim
1. A method for updating a voiceprint feature model, comprising:
- obtaining an original audio stream comprising at least one speaker;
obtaining a respective audio stream of each speaker of the at least one speaker in the original audio stream according to a preset speaker segmentation and clustering algorithm;
separately matching the respective audio stream of each speaker of the at least one speaker with an original voiceprint feature model to obtain a successfully matched audio stream;
using the successfully matched audio stream as an additional audio stream training sample for generating the original voiceprint feature model; and
updating the original voiceprint feature model to improve a voice recognition capability of a computing device that uses the original voiceprint feature model to identify the at least one speaker.
3 Assignments
0 Petitions
Accused Products
Abstract
A method for updating a voiceprint feature model and a terminal are provided that are applicable to the field of voice recognition technologies. The method includes: obtaining an original audio stream including at least one speaker; obtaining a respective audio stream of each speaker of the at least one speaker in the original audio stream according to a preset speaker segmentation and clustering algorithm; separately matching the respective audio stream of each speaker of the at least one speaker with an original voiceprint feature model, to obtain a successfully matched audio stream; and using the successfully matched audio stream as an additional audio stream training sample for generating the original voiceprint feature model, and updating the original voiceprint feature model.
24 Citations
20 Claims
-
1. A method for updating a voiceprint feature model, comprising:
-
obtaining an original audio stream comprising at least one speaker; obtaining a respective audio stream of each speaker of the at least one speaker in the original audio stream according to a preset speaker segmentation and clustering algorithm; separately matching the respective audio stream of each speaker of the at least one speaker with an original voiceprint feature model to obtain a successfully matched audio stream; using the successfully matched audio stream as an additional audio stream training sample for generating the original voiceprint feature model; and updating the original voiceprint feature model to improve a voice recognition capability of a computing device that uses the original voiceprint feature model to identify the at least one speaker. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A terminal, comprising:
-
a non-transitory computer readable medium having instructions stored thereon; and a computer processor coupled to the non-transitory computer readable medium and configured to execute the instructions to; obtain an original audio stream comprising at least one speaker; obtain a respective audio stream of each speaker of the at least one speaker in the original audio stream according to a preset speaker segmentation and clustering algorithm; separately match the respective audio stream of each speaker of the at least one speaker with an original voiceprint feature model, to obtain a successfully matched audio stream; use the successfully matched audio stream as an additional audio stream training sample for generating the original voiceprint feature model; and update the original voiceprint feature model to improve a voice recognition capability of a computing device that uses the original voiceprint feature model to identify the at least one speaker. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification