System for automatically morphing audio information
First Claim
1. A method for morphing from a first sound to a second sound, comprising the steps of:
- analyzing each of said first and second sounds to obtain a dense representation for each sound;
determining correspondence between the respective representations of said sounds;
modifying the representations of said sounds, based on said correspondence, to form a new representation; and
inverting the new representation and generating a morphed sound from the inverted representation.
2 Assignments
0 Petitions
Accused Products
Abstract
In the first step of a sound morphing process, each sound which forms the basis for the morph is converted into one or more quantitative representations, such as spectrograms. After the representations have been obtained, the temporal axes of the two sounds are matched, so that similar components of the two sounds, such as onsets, harmonic regions and inharmonic regions, are aligned with one another. Other characteristics of the sounds, such as pitch, formant frequencies, or the like, are then matched. Once the energy in each of the sounds has been accounted for and matched to that of the other sound, the two sounds are cross-faded, to produce a representation of a new sound. This representation is then inverted, to generate the morphed sound.
119 Citations
47 Claims
-
1. A method for morphing from a first sound to a second sound, comprising the steps of:
-
analyzing each of said first and second sounds to obtain a dense representation for each sound; determining correspondence between the respective representations of said sounds; modifying the representations of said sounds, based on said correspondence, to form a new representation; and inverting the new representation and generating a morphed sound from the inverted representation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A method for morphing from a first sound to a second sound, comprising the steps of:
-
factoring each of said two sounds into a plurality of representations which respectively relate to different acoustic features of the sounds; independently modifying said plural representations to produce a plurality of new representations; combining said new representations to produce a representation for a morphed sound; and inverting the representation and generating the morphed sound from the inverted representation. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A method for morphing from a first sound to a second sound, comprising the steps of:
-
analyzing each of said first and second sounds to obtain at least one representation of each sound; automatically matching common regions of said representations so that they are temporally aligned with one another; modifying predetermined portions of corresponding temporally aligned features of said first and second sounds; and inverting the modified sound representation and generating a sound having acoustic characteristics between those of said first and second sounds. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32, 33, 34)
-
-
35. A method for generating a sound based upon a dense spectral representation of a sound, comprising the steps of:
-
generating a first spectrogram of a sound; determining the mel-frequency cepstral coefficients for the sound from said first spectrogram; inverting the mel-frequency cepstral coefficients to obtain a spectrogram of the formants of the sound; and subsequently generating a sound which is based upon information contained in the formant spectrogram. - View Dependent Claims (36)
-
-
37. A method for producing a morph comprising a transition from one spoken word to another spoken word, comprising the steps of:
-
generating a dense spectral representation of each spoken word; generating a plurality of modified representations, each of which comprises a different respective interpolation of corresponding values in the representation of said two sounds; and sequentially inverting each of said modified representation and generating a series of discrete sounds which transition from one of said spoken words to the other of said spoken words, and which include characteristics of each of said spoken words.
-
-
38. A method for transforming from a one-dimensional signal representing a physical phenomenon to a second one-dimensional signal representing another physical phenomenon, comprising the steps of:
-
automatically defining points of correspondence between the respective signals; determining a desired point in a morphed signal, and selecting a pair of corresponding points in the original signals that are related to the determined point; and warping and interpolating the original signals, based on said pair of corresponding points, to form a morphed signal, and generating a sensory perceptible physical phenomenon corresponding to said morphed signal. - View Dependent Claims (39, 40, 41, 42, 43, 44, 45)
-
-
46. A method for generating an output sound which includes characteristic features of each of two input sounds, comprising the steps of:
-
factoring each of said two input sounds into representations which include at least a pitch spectrogram for a first one of said two input sounds and at least a formant spectrogram for a second one of said two input sounds; combining information obtained from said pitch spectrogram for said first input sound with information obtained from said formant spectrogram for said second input sound to form a new representation for a morphed sound; and inverting said new representation and generating an output sound.
-
-
47. A method for generating a morphed sound from first and second input sounds, comprising the steps of:
-
factoring each of said two input sounds into a plurality of representations which respectively relate to different acoustic features of the sounds; combining information obtained from a representation of the first input sound which relates to a first acoustic feature with information obtained from a representation of the second input sound that relates to a second, different acoustic feature, to produce a representation for a morphed sound; and inverting the representation for the morphed sound and generating the morphed sound from the inverted representation.
-
Specification