Dynamically extending the speech prompts of a multimodal application
First Claim
1. A method of dynamically extending the speech prompts of a multimodal application, the method implemented with a prompt generation engine, a module of automated computing machinery operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes, wherein the voice mode includes accepting speech input from a user, digitizing the speech, and providing digitized speech to a speech engine, and wherein the non-voice mode includes accepting input from a user through physical user interaction with a user input device for the multimodal device;
- wherein the multimodal device comprises a module of automated computing machinery for executing the multimodal application and supports execution of a media file player, a module of automated computing machinery for playing media files;
the method comprising;
receiving, by the prompt generation engine, a media file having a metadata container;
retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and
modifying, by the prompt generation engine, the multimodal application to include the speech prompt.
3 Assignments
0 Petitions
Accused Products
Abstract
Dynamically extending the speech prompts of a multimodal application including receiving, by the prompt generation engine, a media file having a metadata container; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt.
-
Citations
18 Claims
-
1. A method of dynamically extending the speech prompts of a multimodal application, the method implemented with a prompt generation engine, a module of automated computing machinery operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes, wherein the voice mode includes accepting speech input from a user, digitizing the speech, and providing digitized speech to a speech engine, and wherein the non-voice mode includes accepting input from a user through physical user interaction with a user input device for the multimodal device;
- wherein the multimodal device comprises a module of automated computing machinery for executing the multimodal application and supports execution of a media file player, a module of automated computing machinery for playing media files;
the method comprising; receiving, by the prompt generation engine, a media file having a metadata container; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt. - View Dependent Claims (2, 3, 4, 5, 6)
- wherein the multimodal device comprises a module of automated computing machinery for executing the multimodal application and supports execution of a media file player, a module of automated computing machinery for playing media files;
-
7. An apparatus for dynamically extending the speech prompts of a multimodal application, the apparatus including a prompt generation engine and a multimodal application operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes, the apparatus comprising a computer processor and a computer memory operatively coupled to the computer processor, the computer memory having disposed within it computer program instructions for:
-
receiving, by the prompt generation engine, a media file having a metadata container; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer program product for dynamically extending the speech prompts of a multimodal application, the computer program product including a prompt generation engine for operating on a multimodal device supporting multiple modes of user interaction with the multimodal application, the modes of user interaction including a voice mode and one or more non-voice modes, the computer program product disposed upon a computer-readable, recording medium, the computer program product comprising computer program instructions for:
-
receiving, by the prompt generation engine, a media file having a metadata container; retrieving, by the prompt generation engine from the metadata container, a speech prompt related to content stored in the media file for inclusion in the multimodal application; and modifying, by the prompt generation engine, the multimodal application to include the speech prompt. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification