System and method for optimizing speech recognition and natural language parameters with user feedback
First Claim
Patent Images
1. A method comprising:
- weighting a first automatic speech recognition model, to yield a weighted first automatic speech recognition model;
weighting a second automatic speech recognition model, to yield a weighted second automatic speech recognition model;
converting, via a processor, a speech document to text using the weighted first automatic speech recognition model, to yield a first transcript;
converting, via the processor, the speech document to text using the weighted second automatic speech recognition model, to yield a second transcript;
receiving, from a user, a judgment of perceived accuracy of the first transcript and the second transcript; and
updating, via the processor, the weighted first automatic speech recognition model and the weighted second automatic speech recognition model based on the judgment.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.
-
Citations
20 Claims
-
1. A method comprising:
-
weighting a first automatic speech recognition model, to yield a weighted first automatic speech recognition model; weighting a second automatic speech recognition model, to yield a weighted second automatic speech recognition model; converting, via a processor, a speech document to text using the weighted first automatic speech recognition model, to yield a first transcript; converting, via the processor, the speech document to text using the weighted second automatic speech recognition model, to yield a second transcript; receiving, from a user, a judgment of perceived accuracy of the first transcript and the second transcript; and updating, via the processor, the weighted first automatic speech recognition model and the weighted second automatic speech recognition model based on the judgment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
a processor; and a computer-readable storage device having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; weighting a first automatic speech recognition model, to yield a weighted first automatic speech recognition model; weighting a second automatic speech recognition model, to yield a weighted second automatic speech recognition model; converting a speech document to text using the weighted first automatic speech recognition model, to yield a first transcript; converting the speech document to text using the weighted second automatic speech recognition model, to yield a second transcript; receiving, from a user, a judgment of perceived accuracy of the first transcript and the second transcript; and updating the weighted first automatic speech recognition model and the weighted second automatic speech recognition model based on the judgment. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising:
-
weighting a first automatic speech recognition model, to yield a weighted first automatic speech recognition model; weighting a second automatic speech recognition model, to yield a weighted second automatic speech recognition model; converting a speech document to text using the weighted first automatic speech recognition model, to yield a first transcript; converting the speech document to text using the weighted second automatic speech recognition model, to yield a second transcript; receiving, from a user, a judgment of perceived accuracy of the first transcript and the second transcript; and updating the weighted first automatic speech recognition model and the weighted second automatic speech recognition model based on the judgment. - View Dependent Claims (20)
-
Specification