Voice-enabled dialog system
First Claim
Patent Images
1. A voice-enabled spoken dialog service comprising:
- an automatic speech recognition module comprising a general-purpose acoustic model and a domain-specific model, wherein the general-purpose model is used for bootstrapping at initial deployment of the spoken dialog service and the domain-specific model is used to adapt the automatic speech recognition module after deployment, wherein a language corpus for the domain-specific acoustic model is drawn from at least one domain-specific website;
a spoken language understanding module that performs text normalization, entity extraction and semantic classification using a boosting approach that balances human-crafted rules with available data;
a dialog management module that comprises an interpreter, finite state machine engine and an action template;
a text-to-speech synthesis module for synthesizing speech; and
a frequently asked questions module, wherein the spoken dialog service communicates with the user in a first voice, and wherein the frequently asked questions module performs;
receiving an utterance from the user;
determining whether the utterance is a frequently asked question;
if the utterance is a frequently asked question, answering the frequently asked question in a second voice; and
after answering the frequently asked question in the second voice, providing a prompt in the second voice to return the user to the first voice.
5 Assignments
0 Petitions
Accused Products
Abstract
A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.
294 Citations
16 Claims
-
1. A voice-enabled spoken dialog service comprising:
-
an automatic speech recognition module comprising a general-purpose acoustic model and a domain-specific model, wherein the general-purpose model is used for bootstrapping at initial deployment of the spoken dialog service and the domain-specific model is used to adapt the automatic speech recognition module after deployment, wherein a language corpus for the domain-specific acoustic model is drawn from at least one domain-specific website; a spoken language understanding module that performs text normalization, entity extraction and semantic classification using a boosting approach that balances human-crafted rules with available data; a dialog management module that comprises an interpreter, finite state machine engine and an action template; a text-to-speech synthesis module for synthesizing speech; and a frequently asked questions module, wherein the spoken dialog service communicates with the user in a first voice, and wherein the frequently asked questions module performs; receiving an utterance from the user; determining whether the utterance is a frequently asked question; if the utterance is a frequently asked question, answering the frequently asked question in a second voice; and after answering the frequently asked question in the second voice, providing a prompt in the second voice to return the user to the first voice. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
Specification