SYSTEM AND METHOD FOR IMPROVING SPEECH RECOGNITION ACCURACY USING TEXTUAL CONTEXT
First Claim
1. A method for improving speech recognition accuracy using textual context, the method causing a computing device to perform steps comprising:
- retrieving a recorded utterance;
retrieving text captured from a device display associated with the spoken dialog and viewed by one party to the recorded utterance;
identifying words in the captured text that are relevant to the recorded utterance;
adding the identified words to a dynamic language model; and
recognizing the recorded utterance using the dynamic language model.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and computer-readable storage media for improving speech recognition accuracy using textual context. The method includes retrieving a recorded utterance, capturing text from a device display associated with the spoken dialog and viewed by one party to the recorded utterance, and identifying words in the captured text that are relevant to the recorded utterance. The method further includes adding the identified words to a dynamic language model, and recognizing the recorded utterance using the dynamic language model. The recorded utterance can be a spoken dialog. A time stamp can be assigned to each identified word. The method can include adding identified words to and/or removing identified words from the dynamic language model based on their respective time stamps. A screen scraper can capture text from the device display associated with the recorded utterance. The device display can contain customer service data.
-
Citations
20 Claims
-
1. A method for improving speech recognition accuracy using textual context, the method causing a computing device to perform steps comprising:
-
retrieving a recorded utterance; retrieving text captured from a device display associated with the spoken dialog and viewed by one party to the recorded utterance; identifying words in the captured text that are relevant to the recorded utterance; adding the identified words to a dynamic language model; and recognizing the recorded utterance using the dynamic language model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for improving speech recognition accuracy using textual context, the system comprising:
-
a processor; a module configured to control the processor to retrieve a recorded utterance; a module configured to control the processor to capture text from a device display associated with the spoken dialog and viewed by one party to the recorded utterance; a module configured to control the processor to identify words in the captured text that are relevant to the recorded utterance; a module configured to control the processor to add the identified words to a dynamic language model; and a module configured to control the processor to recognize the recorded utterance using the dynamic language model. - View Dependent Claims (15, 16, 17)
-
-
18. A computer-readable storage medium storing instructions for improving speech recognition accuracy using textual context which, when executed by a computing device, cause the computing device to perform steps comprising:
-
retrieving a recorded utterance; capturing text from a device display associated with the spoken dialog and viewed by one party to the recorded utterance; identifying words in the captured text that are relevant to the recorded utterance; adding the identified words to a dynamic language model; and recognizing the recorded utterance using the dynamic language model. - View Dependent Claims (19, 20)
-
Specification