×

Enhancing media playback with speech recognition

  • US 8,478,592 B2
  • Filed: 07/28/2008
  • Issued: 07/02/2013
  • Est. Priority Date: 07/08/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method for enhancing a media file to enable speech-recognition of spoken navigation commands, comprising:

  • receiving a plurality of textual items relating to the subject matter of the media file;

    generating at least one grammar comprising one or more grammar entries, wherein the one or more grammar entries comprise grammar entries that are generated for at least some of the plurality of the textual items, and comprise a word or word sequence recognizable by a speech recognition engine;

    for each of the grammar entries corresponding to content in the media file, determining one or more time stamps for the grammar entry, each time stamp indicating a location in the media file of content corresponding to the grammar entry; and

    via a computer processor, locating content in the media file during playback of the media file by(a) receiving speech input from a user,(b) recognizing the speech input using the speech recognition engine and the at least one grammar to produce a speech recognition result corresponding at least in part to a recognized grammar entry of the at least one grammar, and(c) identifying one or more locations in the media file by identifying the one or more time stamps determined for the recognized grammar entry and the current time position of the media file at playback when the user input is received, wherein upon identifying the location in the media file a media controller navigates to the time stamp identified and presents the media file to the user at the identified timestamp location.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×