Multimodal access of meeting recordings
First Claim
Patent Images
1. A method for retrieving content comprising multimedia information, the method comprising:
- receiving a request to retrieve a portion of said multimedia information;
in response to said request, producing a first portion of said multimedia information and presenting said first portion of said multimedia information;
producing a term-frequency inverse document frequency (TF-IDF) score for each word in a plurality words contained in a transcript of said multimedia information;
producing a plurality of keywords by taking words from said plurality of words based on said TF-IDF scores; and
for each keyword, producing a document occurrence score for a plurality of segments of said transcript and associating a segment having a highest document occurrence score with said each keyword, said segments corresponding to portions of said multimedia information,wherein producing said first portion includes incorporating portions of said multimedia information corresponding to one or more of said segments of said transcript,said producing a first portion of said multimedia information comprising;
selecting a frame of video from a plurality of video frames, said selecting based on a visual significance score, said visual significance score indicating a responsiveness of said selected frame to said request;
selecting a segment of audio of said multimedia information, said selecting based on an audio significance score, said audio significance score indicating a responsiveness of said selected segment of audio to said request, said segment of audio having a corresponding video clip;
generating a video skim from said selected frame of video and said video clip; and
displaying said generated video skim.
1 Assignment
0 Petitions
Accused Products
Abstract
A meeting recorder captures multimodal information of a meeting. Subsequent analysis of the information produces scores indicative of visually and aurally significant events that can help identify significant segments of the meeting recording. Textual analysis can enhance searching for significant meeting segments and otherwise enhance the presentation of the meeting segments.
191 Citations
17 Claims
-
1. A method for retrieving content comprising multimedia information, the method comprising:
-
receiving a request to retrieve a portion of said multimedia information; in response to said request, producing a first portion of said multimedia information and presenting said first portion of said multimedia information; producing a term-frequency inverse document frequency (TF-IDF) score for each word in a plurality words contained in a transcript of said multimedia information; producing a plurality of keywords by taking words from said plurality of words based on said TF-IDF scores; and for each keyword, producing a document occurrence score for a plurality of segments of said transcript and associating a segment having a highest document occurrence score with said each keyword, said segments corresponding to portions of said multimedia information, wherein producing said first portion includes incorporating portions of said multimedia information corresponding to one or more of said segments of said transcript, said producing a first portion of said multimedia information comprising; selecting a frame of video from a plurality of video frames, said selecting based on a visual significance score, said visual significance score indicating a responsiveness of said selected frame to said request; selecting a segment of audio of said multimedia information, said selecting based on an audio significance score, said audio significance score indicating a responsiveness of said selected segment of audio to said request, said segment of audio having a corresponding video clip; generating a video skim from said selected frame of video and said video clip; and displaying said generated video skim. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for accessing content from multimedia data, the system comprising:
-
a data component configured to provide said multimedia data; a scoring component operative to produce scores corresponding to segments of said multimedia data, wherein said scores include visual significance scores and audio significance scores; and a selection component operative to select segments of said multimedia data based on their corresponding scores, wherein said selection is performed in response to a request for a portion of said multimedia data, wherein each of said scores indicates a responsiveness of the corresponding segment to the request, wherein said scoring component is further operative to produce a video skim incorporating portions of said multimedia data corresponding to one or more of said selected segments, wherein said scoring component comprises a computer program configured to perform operations comprising; producing a term-frequency inverse document frequency (TF-IDF) score for each word in a plurality words contained in a transcript of said multimedia data; producing a plurality of keywords by taking words from said plurality of words based on said TF-IDF scores; and for each keyword, producing a document occurrence score for a plurality of segments of said transcript and associating a segment having a highest document occurrence score with said each keyword, said segments corresponding to portions of said multimedia data. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17)
-
Specification