Audio-based chapter detection in multimedia stream
First Claim
1. In a multimedia processing device, a method comprising:
- identifying an interval in a multimedia stream as potentially representing a chapter break in a program represented by the multimedia stream; and
characterizing the interval as representing an actual chapter break responsive to at least one variance in audio data of the multimedia stream between a point at a start of the interval and a point at an end of the interval, the at least one variance comprising at least one of;
a variance in average bitrate of the audio data; and
a variance in average highest frequency of the audio data.
3 Assignments
0 Petitions
Accused Products
Abstract
A multimedia processing system identifies chapter breaks in a program represented by multimedia data through an analysis of audio content of a portion of the multimedia data so as to identify an interval that potentially represents a chapter break. This audio analysis can include an analysis to identify changes in high frequency edges in the audio content, an analysis to identify changes in the total energy in a central frequency band of the audio content, an analysis to identify patterns of sequentially repeating values in the audio content, an analysis to identify changes in bitrate, or some combination thereof. One or more variances in the audio information at (e.g., before) the start of the interval and the audio information at (e.g., after) the end of the interval then may be used to identify or otherwise characterize the interval as representing an actual chapter break. Further, a chapter represented by two consecutive chapter breaks can be identified as an advertisement based on the duration between the two consecutive chapter breaks and thus the multimedia processing device can implement a “commercial skip function” by omitting playback of the portion of the multimedia data representing the chapter responsive to the chapter being identified as an advertisement.
-
Citations
20 Claims
-
1. In a multimedia processing device, a method comprising:
-
identifying an interval in a multimedia stream as potentially representing a chapter break in a program represented by the multimedia stream; and characterizing the interval as representing an actual chapter break responsive to at least one variance in audio data of the multimedia stream between a point at a start of the interval and a point at an end of the interval, the at least one variance comprising at least one of;
a variance in average bitrate of the audio data; and
a variance in average highest frequency of the audio data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A multimedia processing device comprising:
a chapter detection module to identify an interval in a multimedia stream as potentially representing a chapter break in a program represented by the multimedia stream and to characterize the interval as representing an actual chapter break responsive to at least one variance in audio data of the multimedia stream between a point at a start of the interval and a point at the end of the interval, the at least one variance comprising at least one of;
a variance in average bitrate of the audio data; and
a variance in average highest frequency of the audio data.- View Dependent Claims (13, 14, 15, 16, 17)
-
18. In a multimedia processing device, a method comprising:
-
identifying a first interval of a multimedia stream as potentially representing a first chapter break based on an analysis of audio data of the multimedia stream; identifying a second interval of the multimedia stream as potentially representing a second chapter break based on the analysis of audio data, the second interval following the first interval in a playback sequence of the multimedia stream; characterizing the first interval as a first advertisement chapter break based on a duration of the first interval, a variance in an average bit rate of audio data of the multimedia stream at a start of the first interval and an average bit rate of audio data of the multimedia stream at an end of the first interval, a variance in an average high frequency edge of audio data at the start of the first interval and an average high frequency edge of audio data at the end of the first interval, and a number of sequentially repeating values in the audio data for the first interval; and characterizing the second interval as a second advertisement chapter break based on a duration of the second interval, a variance in an average bit rate of audio data at a start of the second interval and an average bit rate of audio data at an end of the second interval, a variance in an average high frequency edge of audio data at the start of the second interval and an average high frequency edge of audio data at the end of the second interval, and a number of sequentially repeating values in the audio data for the second interval. - View Dependent Claims (19, 20)
-
Specification