SYSTEM AND METHOD FOR OPTIMIZING SPEECH RECOGNITION AND NATURAL LANGUAGE PARAMETERS WITH USER FEEDBACK
First Claim
Patent Images
1. A method comprising:
- receiving from a sender, via a processor, a speech document;
capturing a context of the speech document;
weighting an automatic speech recognition model based at least in part on the context of the speech document, yielding a weighted automatic speech recognition model;
converting the speech document to text using the weighted automatic speech recognition model, yielding a transcript;
receiving from a user a judgment of perceived accuracy of the transcript;
updating the weighted automatic speech recognition model based on the judgment.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving from a sender, via a processor, a speech document; capturing a context of the speech document; weighting an automatic speech recognition model based at least in part on the context of the speech document, yielding a weighted automatic speech recognition model; converting the speech document to text using the weighted automatic speech recognition model, yielding a transcript; receiving from a user a judgment of perceived accuracy of the transcript; updating the weighted automatic speech recognition model based on the judgment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system, comprising:
-
a processor; a first module configured to control the processor to receive, from a sender, a speech document; a second module configured to control the processor to capture a context of the speech document; a third module configured to filter an automatic speech recognition model based at least in part on the context of the speech document and word frequency, yielding a filtered automatic speech recognition model; a fourth module configured to convert the speech document to text applying the filtered automatic speech recognition model, yielding a transcript; a fifth module configured to receive from a user a judgment of perceived accuracy of the transcript; a sixth module configured to update the filtered automatic speech recognition model based on the judgment. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable storage medium storing instructions which, when executed by a computing device, cause the computing device to improve an automatic speech recognition model, the instructions comprising:
-
receiving from a sender a speech document; capturing, via a processor, a context of a speech document; weighting the automatic speech recognition model based at least in part on the context of the speech document, yielding a weighted automatic speech recognition model; converting the speech document to text using the weighted automatic speech recognition model, yielding a transcript; receiving from a user a judgment of perceived accuracy of the transcript; updating the weighted automatic speech recognition model based on the judgment. - View Dependent Claims (18, 19, 20)
-
Specification