Fast playback in media files with reduced impact to speech quality
First Claim
1. A computer processing system for increasing the playback speed of audio files, comprising:
- a computer processor having a non-transitory memory containing program code for;
analyzing an audio wave;
calculating a first silent section in the audio wave;
wherein the first silent section has a length greater than a minimum short pause length required to distinguish between words;
calculating a new playback speed of the first silent section so that a total playback time for the first silent section is equal to the minimum short pause length;
detecting a second silent section after one or more first silent sections;
wherein the second silent section has a length greater than a minimum long pause length required to distinguish between sentences;
calculating a new playback speed of the second silent section so that a total playback time for the second silent section is equal to the minimum long pause length; and
adjusting an original playback speed of the second silent sections with respect to the first silent sections, such that the second silent sections are double the length of the first silent sections.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention is a system and method for increasing the playback speed of audio waves. The system analyzes an audio wave to detect a first silent section that has a length greater than a minimum short pause length required to distinguish between words. The system then calculates a new playback speed of the first silent section so that the total playback time for the first silent section is less than or equal to the minimum short pause length and controls an audio playback device to play the audio wave in a manner so that the first silent section is played back at the new playback speed. In another embodiment, the system analyzes spoken words, phonemes by phonemes, and increases the spoken word playback speed by dynamically reducing the length of each phoneme and inter-syllable silent pauses. Thus, the system functions equally well for all languages and accents.
18 Citations
13 Claims
-
1. A computer processing system for increasing the playback speed of audio files, comprising:
a computer processor having a non-transitory memory containing program code for; analyzing an audio wave; calculating a first silent section in the audio wave; wherein the first silent section has a length greater than a minimum short pause length required to distinguish between words; calculating a new playback speed of the first silent section so that a total playback time for the first silent section is equal to the minimum short pause length; detecting a second silent section after one or more first silent sections; wherein the second silent section has a length greater than a minimum long pause length required to distinguish between sentences; calculating a new playback speed of the second silent section so that a total playback time for the second silent section is equal to the minimum long pause length; and adjusting an original playback speed of the second silent sections with respect to the first silent sections, such that the second silent sections are double the length of the first silent sections. - View Dependent Claims (2, 3)
-
4. A method for increasing the playback speed of audio files, comprising the steps of:
-
analyzing an audio wave; isolating a word in an audio wave; isolating each phoneme in the word; analyzing frequencies of the phoneme; retaining a high frequency section of the phoneme; and reducing the duration of a low frequency repeating pattern section of the phoneme. - View Dependent Claims (5, 6, 7, 8)
-
-
9. A computer system increasing the playback speed of audio files, comprising:
-
a computer processor; non-transitory memory coupled to the computer processor, the non-transitory memory containing program code for; analyzing an audio wave; isolating a word in an audio wave; isolating each phoneme in the word; analyzing frequencies of the phoneme; retaining a high frequency section of the phoneme; and reducing the duration of a low frequency repeating pattern section of the phoneme. - View Dependent Claims (10, 11, 12, 13)
-
Specification