System and method for generating an audio thumbnail of an audio track
First Claim
1. A method for generating an audio thumbnail of an audio track, comprising:
- detecting a first content feature within an audio track;
extracting a first portion of the audio track corresponding to the detected first content feature;
detecting an occurrence of an increase in energy within the audio track;
extracting a second portion of the audio track corresponding to the detected increase in energy; and
combining the extracted first and second portions of the audio track into an audio thumbnail of the audio track.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system for generating an audio thumbnail of an audio track in which a first content feature, such as singing, is detected as a characteristic of an audio track. A predetermined length of the detected portion of the audio track corresponding to the first content feature is extracted from the audio track. A highlight of the audio track, such as a portion of the audio track having a sudden increase in temporal energy within the audio track, is detected; and a portion of the audio track corresponding to the highlight is extracted from the audio track. The two extracted portions of the audio track are combined as a thumbnail of the audio track.
127 Citations
21 Claims
-
1. A method for generating an audio thumbnail of an audio track, comprising:
-
detecting a first content feature within an audio track;
extracting a first portion of the audio track corresponding to the detected first content feature;
detecting an occurrence of an increase in energy within the audio track;
extracting a second portion of the audio track corresponding to the detected increase in energy; and
combining the extracted first and second portions of the audio track into an audio thumbnail of the audio track. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method for generating an audio thumbnail of an audio track, comprising:
-
detecting a first content feature within an audio track;
mapping a pointer to the detected first content feature within the audio track;
setting a first duration of time;
detecting an occurrence of an increase in energy within the audio track;
mapping a pointer to the detected occurrence of an increase in energy within the audio track;
setting a second duration of time; and
storing the pointer to the detected first content feature, the first duration of time, the pointer to the detected occurrence of an increase in energy, and the second duration of time as an audio thumbnail of the audio track. - View Dependent Claims (18)
-
-
19. A method for detecting a highlight on an audio track, comprising:
-
determining a location of human sound on an audio track;
computing a first temporal energy envelope of a first segment of the audio track;
computing a second temporal energy envelope of a second segment of the audio track;
comparing the computed first and second temporal energy envelopes; and
if the second segment corresponds to a location of human sound on the audio track and if the computed temporal energy of the second segment exceeds the computed temporal energy of the first segment by a predetermined threshold, selecting a location on the audio track corresponding to the location of the second segment as a highlight on the audio track.
-
-
20. A computer-based system for generating a thumbnail of an audio track, comprising:
-
a recorder configured to record an audio track comprised of singing and instrumental music; and
a processor configured to;
detect a first characteristic within an audio track;
extract a first portion of the audio track corresponding to the detected first characteristic;
detect an occurrence of an increase in energy within the audio track;
extract a second portion of the audio track corresponding to the detected increase in energy; and
combine the extracted first and second portions of the audio track into an audio thumbnail of the audio track.
-
-
21. A computer readable medium encoded with software for generating an audio thumbnail of an audio track, by detecting a first characteristic within an audio track;
- extracting a first portion of the audio track corresponding to the detected first characteristic;
detecting an occurrence of an increase in energy within the audio track;
extracting a second portion of the audio track corresponding to the detected increase in energy; and
combining the extracted first and second portions of the audio track into an audio thumbnail of the audio track.
- extracting a first portion of the audio track corresponding to the detected first characteristic;
Specification