DATA SHREDDING FOR SPEECH RECOGNITION ACOUSTIC MODEL TRAINING UNDER DATA RETENTION RESTRICTIONS
First Claim
1. A method of enabling training of an acoustic model, the method comprising:
- dynamically shredding a speech corpus to produce text segments and depersonalized audio features corresponding to the text segments; and
enabling a system to train an acoustic model using the text segments and the depersonalized audio features.
2 Assignments
0 Petitions
Accused Products
Abstract
Training speech recognizers, e.g., their language or acoustic models, using actual user data is useful, but retaining personally identifiable information may be restricted in certain environments due to regulations. Accordingly, a method or system is provided for enabling training of an acoustic model which includes dynamically shredding a speech corpus to produce text segments and depersonalized audio features corresponding to the text segments. The method further includes enabling a system to train an acoustic model using the text segments and the depersonalized audio features. Because the data is depersonalized, actual data may be used, enabling speech recognizers to keep up-to-date with user trends in speech and usage, among other benefits.
-
Citations
20 Claims
-
1. A method of enabling training of an acoustic model, the method comprising:
-
dynamically shredding a speech corpus to produce text segments and depersonalized audio features corresponding to the text segments; and enabling a system to train an acoustic model using the text segments and the depersonalized audio features. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for enabling training of an acoustic model, the system comprising:
-
a shredding module configured to shred a speech corpus dynamically to produce text segments and depersonalized audio features corresponding to the text segments; and an enabling module configured to enable a system to train an acoustic model using the text segments and the depersonalized audio features. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A computer program product comprising a non-transitory computer-readable medium storing instructions for performing a method of enabling training of a acoustic model, the instructions, when loaded and executed by a processor, cause the processor to:
-
dynamically shred a speech corpus to produce text segments and depersonalized audio features corresponding to the text segments; and enable a system to train an acoustic model using the text segments and the depersonalized audio features.
-
Specification