ENHANCING MEDIA PLAYBACK WITH SPEECH RECOGNITION
First Claim
1. A method for enhancing a media file to enable speech-recognition of spoken navigation commands, comprising:
- receiving a plurality of textual items based on subject matter of the media file;
generating a grammar for each textual item, thereby generating a plurality of grammars for use by a speech recognition engine;
associating a time stamp with each grammar, wherein a time stamp indicates a location in the media file of a textual item corresponding with a grammar; and
associating the plurality of grammars with the media file, such that speech recognized by the speech recognition engine is associated with a corresponding location in the media file.
3 Assignments
0 Petitions
Accused Products
Abstract
A method for enhancing a media file to enable speech-recognition of spoken navigation commands can be provided. The method can include receiving a plurality of textual items based on subject matter of the media file and generating a grammar for each textual item, thereby generating a plurality of grammars for use by a speech recognition engine. The method can further include associating a time stamp with each grammar, wherein a time stamp indicates a location in the media file of a textual item corresponding with a grammar. The method can further include associating the plurality of grammars with the media file, such that speech recognized by the speech recognition engine is associated with a corresponding location in the media file.
-
Citations
18 Claims
-
1. A method for enhancing a media file to enable speech-recognition of spoken navigation commands, comprising:
-
receiving a plurality of textual items based on subject matter of the media file; generating a grammar for each textual item, thereby generating a plurality of grammars for use by a speech recognition engine; associating a time stamp with each grammar, wherein a time stamp indicates a location in the media file of a textual item corresponding with a grammar; and associating the plurality of grammars with the media file, such that speech recognized by the speech recognition engine is associated with a corresponding location in the media file. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer program product comprising a computer usable medium embodying computer usable program code for enhancing a media file to enable speech-recognition of spoken navigation commands, comprising:
-
computer usable program code for receiving a plurality of textual items based on subject matter of the media file; computer usable program code for generating a grammar for each textual item, thereby generating a plurality of grammars for use by a speech recognition engine; computer usable program code for associating a time stamp with each grammar, wherein a time stamp indicates a location in the media file of a textual item corresponding with a grammar; and computer usable program code for associating the plurality of grammars with the media file, such that speech recognized by the speech recognition engine is associated with a corresponding location in the media file. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer system for enhancing a media file to enable speech-recognition of spoken navigation commands, comprising:
-
a processor configured for; receiving a plurality of textual items based on subject matter of the media file; and generating a grammar for each textual item, thereby generating a plurality of grammars for use by a speech recognition engine; and a repository for storing; a grammar file including the plurality of grammars, wherein a time stamp is associated with each grammar, and wherein a time stamp indicates a location in the media file of a textual item corresponding with a grammar; and a link for associating the grammar file with the media file, such that speech recognized by the speech recognition engine is associated with a corresponding location in the media file. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification