Dynamically extending the speech prompts of a multimodal application
First Claim
Patent Images
1. A method of dynamically extending the speech prompts ofa multimodal application, the method comprising:
- receiving, by a prompt generation engine, a media file having a metadata container, wherein the prompt generation engine operates on one or more voice servers;
retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application, wherein the speech prompt is an audio phrase played by the multimodal application, wherein retrieving a speech prompt includes retrieving a speech artifact having a grammar rule or a pronunciation rule and wherein retrieving a speech prompt includes retrieving a speech artifact having an XML document; and
modifying, by the prompt generation engine, the multimodal application to include the speech prompt.
2 Assignments
0 Petitions
Accused Products
Abstract
A prompt generation engine operates to dynamically extend prompts of a multimodal application. The prompt generation engine receives a media file having a metadata container. The prompt generation engine operates on a multimodal device that supports a voice mode and a non-voice mode for interacting with the multimodal device. The prompt generation engine retrieves from the metadata container a speech prompt related to content stored in the media file for inclusion in the multimodal application. The prompt generation engine modifies the multimodal application to include the speech prompt.
92 Citations
14 Claims
-
1. A method of dynamically extending the speech prompts of
a multimodal application, the method comprising: -
receiving, by a prompt generation engine, a media file having a metadata container, wherein the prompt generation engine operates on one or more voice servers; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application, wherein the speech prompt is an audio phrase played by the multimodal application, wherein retrieving a speech prompt includes retrieving a speech artifact having a grammar rule or a pronunciation rule and wherein retrieving a speech prompt includes retrieving a speech artifact having an XML document; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A voice server that supports multiple modes for interacting with a multimodal device, the voice server comprising:
-
a computer processor; a computer memory operatively coupled to the computer processor, the computer memory having disposed within it computer program instructions configured to; receive a media file having a metadata container; retrieve, from the metadata container, a speech prompt related to content stored in the media file for inclusion in a multimodal application, wherein the speech prompt is an audio phrase played by the multimodal application; modify the grammar of the speech engine to include at least one of the grammar rule and the pronunciation rule; retrieve a speech artifact having an XML, document; and modify the multimodal application to include the speech prompt. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
Specification