Automatic Video Event Detection and Indexing
First Claim
1. A method for use in indexing video footage, the video footage comprising an image signal and a corresponding audio signal relating to the image signals, the method comprising:
- extracting audio features from the audio signal of segments of the video footage and visual features from the image signal of the segments of the video footage, each segment comprising a plurality of frames;
comparing the extracted audio and visual features with predetermined audio and visual features associated with predetermined audio and visual keywords;
identifying the audio and visual keywords associated with the video footage based on the comparison of the extracted audio and visual features with the predetermined audio and visual features associated with the predetermined audio and visual keywords; and
determining the presence of events in the video footage based on the identified audio and visual keywords associated with the video footage.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for use in indexing video footage, the video footage comprising an image signal and a corresponding audio signal relating to the image signals, the method comprising extracting audio features from the audio signal of the video footage and visual features from the image signal of the video footage; comparing the extracted audio and visual features with predetermined audio and visual keywords; identifying the audio and visual keywords associated with the video footage based on the comparison of the extracted video and visual features with the predetermine audio and visual keywords; and determining the presence of events in the video footage based on the audio and visual keywords associated with the video footage.
156 Citations
27 Claims
-
1. A method for use in indexing video footage, the video footage comprising an image signal and a corresponding audio signal relating to the image signals, the method comprising:
-
extracting audio features from the audio signal of segments of the video footage and visual features from the image signal of the segments of the video footage, each segment comprising a plurality of frames; comparing the extracted audio and visual features with predetermined audio and visual features associated with predetermined audio and visual keywords; identifying the audio and visual keywords associated with the video footage based on the comparison of the extracted audio and visual features with the predetermined audio and visual features associated with the predetermined audio and visual keywords; and determining the presence of events in the video footage based on the identified audio and visual keywords associated with the video footage. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A system for indexing video footage, the video footage comprising an image signal and a corresponding audio signal relating to the image signals, the system comprising:
-
means for extracting audio features from the audio signal of segments of the video footage and visual features from the image signal of the segments of the video footage, each segment comprising a plurality of frames; means for comparing the extracted audio and visual features with predetermined audio and visual features associated with predetermined audio and visual keywords means for identifying the audio and visual keywords associated with the video footage based on the comparison of the extracted video and visual features with the predetermined audio and visual features associated with the predetermined audio and visual keywords; and means for determining the presence of events in the video footage based on the identified audio and visual keywords associated with the video footage.
-
Specification