SYSTEM AND METHOD FOR GENERATING USER MODELS FROM TRANSCRIBED DIALOGS
First Claim
1. A computer-implemented method of generating personalized user models, the method comprising:
- receiving automatic speech recognition (ASR) output of a plurality of speech interactions with a user;
receiving an ASR transcription error model characterizing how ASR transcription errors are made;
generating guesses of a true transcription and a user model via an expectation maximization (EM) algorithm based on the error model and the respective ASR output; and
generating a personalized user model based on the guesses.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for generating personalized user models. The method includes receiving automatic speech recognition (ASR) output of speech interactions with a user, receiving an ASR transcription error model characterizing how ASR transcription errors are made, generating guesses of a true transcription and a user model via an expectation maximization (EM) algorithm based on the error model and the respective ASR output where the guesses will converge to a personalized user model which maximizes the likelihood of the ASR output. The ASR output can be unlabeled. The method can include casting speech interactions as a dynamic Bayesian network with four variables: (s), (u), (r), (m), and encoding relationships between (s), (u), (r), (m) as conditional probability tables. At each dialog turn (r) and (m) are known and (s) and (u) are hidden.
12 Citations
20 Claims
-
1. A computer-implemented method of generating personalized user models, the method comprising:
-
receiving automatic speech recognition (ASR) output of a plurality of speech interactions with a user; receiving an ASR transcription error model characterizing how ASR transcription errors are made; generating guesses of a true transcription and a user model via an expectation maximization (EM) algorithm based on the error model and the respective ASR output; and generating a personalized user model based on the guesses. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for recognizing speech using personalized speech models, the system comprising:
-
a processor; a module configured to control the processor to receive automatic speech recognition (ASR) output of a plurality of speech interactions with a user; a module configured to control the processor to receive an ASR transcription error model characterizing how ASR transcription errors are made; a module configured to control the processor to generate guesses of a true transcription and a user model via an expectation maximization (EM) algorithm based on the error model and the respective ASR output; and a module configured to control the processor to generate a personalized user model based on the guesses. - View Dependent Claims (17, 18)
-
-
19. A computer-readable storage medium storing a computer program having instructions for controlling a processor to generate personalized user models, the instructions comprising:
-
receiving a user model personalized for a specific user generated by steps comprising; receiving automatic speech recognition (ASR) output of a plurality of speech interactions with the specific user; receiving an ASR transcription error model characterizing how ASR transcription errors are made; generating guesses of a true transcription and a user model via an expectation maximization (EM) algorithm based on the error model and the respective ASR output; generating a personalized user model for the specific user based on the guesses; and building a personalized dialog system for the specific user based on the received personalized user model. - View Dependent Claims (20)
-
Specification