Method for Updating Voiceprint Feature Model and Terminal
First Claim
1. A method for updating a voiceprint feature model, comprising:
- obtaining an original audio stream comprising at least one speaker;
obtaining a respective audio stream of each speaker of the at least one speaker in the original audio stream according to a preset speaker segmentation and clustering algorithm;
separately matching the respective audio stream of each speaker of the at least one speaker with an original voiceprint feature model to obtain a successfully matched audio stream;
using the successfully matched audio stream as an additional audio stream training sample for generating the original voiceprint feature model; and
updating the original voiceprint feature model.
3 Assignments
0 Petitions
Accused Products
Abstract
A method for updating a voiceprint feature model and a terminal are provided that are applicable to the field of voice recognition technologies. The method includes: obtaining an original audio stream including at least one speaker; obtaining a respective audio stream of each speaker of the at least one speaker in the original audio stream according to a preset speaker segmentation and clustering algorithm; separately matching the respective audio stream of each speaker of the at least one speaker with an original voiceprint feature model, to obtain a successfully matched audio stream; and using the successfully matched audio stream as an additional audio stream training sample for generating the original voiceprint feature model, and updating the original voiceprint feature model.
34 Citations
18 Claims
-
1. A method for updating a voiceprint feature model, comprising:
-
obtaining an original audio stream comprising at least one speaker; obtaining a respective audio stream of each speaker of the at least one speaker in the original audio stream according to a preset speaker segmentation and clustering algorithm; separately matching the respective audio stream of each speaker of the at least one speaker with an original voiceprint feature model to obtain a successfully matched audio stream; using the successfully matched audio stream as an additional audio stream training sample for generating the original voiceprint feature model; and updating the original voiceprint feature model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A terminal, comprising:
-
an original audio stream obtaining unit; a segmentation and clustering unit; a matching unit; and a model updating unit, wherein the original audio stream obtaining unit is configured to obtain an original audio stream comprising at least one speaker, and send the original audio stream to the segmentation and clustering unit, wherein the segmentation and clustering unit is configured to receive the original audio stream sent by the original audio stream obtaining unit, obtain a respective audio stream of each speaker of the at least one speaker in the original audio stream according to a preset speaker segmentation and clustering algorithm, and send the respective audio stream of each speaker of the at least one speaker to the matching unit, wherein the matching unit is configured to receive the respective audio stream of each speaker of the at least one speaker sent by the segmentation and clustering unit, separately match the respective audio stream of each speaker of the at least one speaker with an original voiceprint feature model, to obtain a successfully matched audio stream, and send the successfully matched audio stream to the model updating unit, and wherein the model updating unit is configured to receive the successfully matched audio stream sent by the matching unit, use the successfully matched audio stream as an additional audio stream training sample for generating the original voiceprint feature model, and update the original voiceprint feature model. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification