×

System and method for semantic video segmentation based on joint audiovisual and text analysis

  • US 8,121,432 B2
  • Filed: 03/25/2008
  • Issued: 02/21/2012
  • Est. Priority Date: 08/24/2005
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method for partitioning a video sequence, comprising:

  • dividing a video sequence into a plurality of segments;

    generating a transcript of speech content of the video sequence, wherein the transcript comprises a plurality of words and identifies temporal locations of the words in the video sequence;

    selecting a plurality of keywords from the plurality of words in the transcript;

    selecting a set of keywords from the plurality of keywords, wherein the keywords in the set of keywords are related to each other by meanings of the keywords;

    determining a distribution of occurrences across the plurality of segments of the keywords in the set of keywords;

    selecting a group of segments from the plurality of segments using the distribution, wherein the segments in the group of segments are temporally adjacent and the group of segments corresponds to a peak of the occurrences across the plurality of segments of the keywords in the set of keywords; and

    forming a partition of the video sequence from the group of segments;

    wherein generating the transcript of speech content of the video sequence comprisesgenerating the transcript from audio content of the video sequence using automatic speech recognition,determining whether the transcript generated from the audio content is satisfactory,responsive to a determination that the transcript generated from the audio content is not satisfactory, determining whether the video sequence has closed caption, andresponsive to a determination that the video sequence has closed caption, generating the transcript from the closed caption.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×