Composite audio waveforms with precision alignment guides
First Claim
1. A method comprising the steps of:
- generating a first visual representation of a first media clip;
generating a second visual representation of a second media clip;
automatically analyzing the first media clip to identify a first point within the first media clip, wherein the first point is an intra-clip point of interest (POI) that is neither the beginning of the first media clip nor the end of the first media clip;
after identifying the first point, generating a visual indication of the first point;
receiving input that drags one of the first visual representation and the second visual representation relative to the other of the first visual representation and the second visual representation;
while receiving said input, monitoring the distance between the first point of the first visual representation and a second point of the second visual representation;
in response to detecting that the distance falls below a threshold while one of the first visual representation and the second visual representation is being dragged, automatically shifting one of the first or second visual representations to cause the first point and the second point to be aligned;
wherein the method is performed by one or more computer systems.
2 Assignments
0 Petitions
Accused Products
Abstract
A technique for aligning a plurality of media clips is provided. One or more intra-clip points of interest (POIs) are identified in at least a first media clip. When aligning a first point in the first media clip with a second point in a second media clip, the first point may be snapped to the second point, wherein at least one of the first point and second point is an intra-clip POI. When a snap occurs, at least one of a visual or audible indication is generated, such as a “pop” sound, a snap line, or automatically aligning the first point with the second point when the first point is within a specified number of pixels of the second point. Techniques for representing multiple channels of an audio clip as a single waveform and caching waveforms are also provided.
-
Citations
36 Claims
-
1. A method comprising the steps of:
-
generating a first visual representation of a first media clip; generating a second visual representation of a second media clip; automatically analyzing the first media clip to identify a first point within the first media clip, wherein the first point is an intra-clip point of interest (POI) that is neither the beginning of the first media clip nor the end of the first media clip; after identifying the first point, generating a visual indication of the first point; receiving input that drags one of the first visual representation and the second visual representation relative to the other of the first visual representation and the second visual representation; while receiving said input, monitoring the distance between the first point of the first visual representation and a second point of the second visual representation; in response to detecting that the distance falls below a threshold while one of the first visual representation and the second visual representation is being dragged, automatically shifting one of the first or second visual representations to cause the first point and the second point to be aligned; wherein the method is performed by one or more computer systems. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising the steps of:
-
at a first point in time, a media editing application displaying an audio clip waveform by; reading audio data from storage; calculating, based on the audio data, an audio clip waveform, wherein the audio clip waveform represents one or more characteristics of the audio data; causing the audio clip waveform to be displayed on a screen; caching the audio clip waveform for subsequent use by the media editing application by; generating, by the media editing application, a digital image that depicts the audio clip waveform as the audio clip waveform appeared at a particular point in time; durably storing, by the media editing application and separate from the audio data, the digital image to a persistent storage; at a second point in time, the media editing application displaying the audio clip waveform by; in response to receiving input that requires redisplay of the audio clip waveform; reading the digital image from the persistent storage without calculating the audio clip waveform based on the audio data, and displaying the audio clip waveform as the audio waveform appeared at the particular point in time by causing the digital image to be displayed via a user interface; after the audio clip waveform is redisplayed based on the digital image, the media editing application allowing edit operations, to be performed on the audio data, using the redisplayed audio clip waveform; wherein the method is performed by one or more computer systems. - View Dependent Claims (10, 11, 34)
-
-
12. One or more storage media storing instructions which, when executed by one or more processors, cause the performance of:
-
generating a first visual representation of a first media clip; generating a second visual representation of a second media clip; analyzing the first media clip to identify a first point within the first media clip, wherein the first point is an intra-clip point of interest (POI) that is neither the beginning of the first media clip nor the end of the first media clip; after identifying the first point, generating a visual indication of the first point; receiving input that moves one of the first visual representation and the second visual representation relative to the other of the first visual representation and the second visual representation; while receiving said input, when the first point of the first visual representation is within a particular distance from a second point of the second visual representation, automatically shifting one of the first or second visual representations to cause the first point and the second point to be aligned. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19)
-
-
20. One or more storage media storing instructions which, when
executed by one or more processors, cause the performance of: -
at a first point in time, a media editing application displaying an audio clip waveform by; reading audio data from storage; calculating, based on the audio data, an audio clip waveform, wherein the audio clip waveform represents one or more characteristics of the audio data; causing the audio clip waveform to be displayed on a screen; caching the audio clip waveform for subsequent use by the media editing application by; generating, by the media editing application, a digital image that depicts the audio clip waveform; durably storing, by the media editing application and separate from the audio data, the digital image to a persistent storage; at a second point in time, the media editing application displaying the audio clip waveform by;
in response receiving input that requires redisplay of the audio clip waveform;reading the digital image from the persistent storage without calculating the audio clip waveform based on the audio data, and displaying the audio clip waveform as the audio waveform appeared at the particular point in time by causing the digital image to be displayed via a user interface; after the audio clip waveform is redisplayed based on the digital image, the media editing application allowing edit operations, to be performed on the audio data, using the redisplayed audio clip waveform. - View Dependent Claims (21, 22, 35)
-
-
23. An apparatus comprising:
-
one or more processors; one or more storage media storing instructions which, when executed by the one or more processors, cause the performance of; generating a first visual representation of a first media clip; generating a second visual representation of a second media clip; analyzing the first media clip to identify a first point within the first media clip, wherein the first point is an intra-clip point of interest (POI) that is neither the beginning of the first media clip nor the end of the first media clip; after identifying the first point, generating a visual indication of the first point; receiving input that moves one of the first visual representation and the second visual representation relative to the other of the first visual representation and the second visual representation; while receiving said input, when the first point of the first visual representation is within a particular distance from a second point of the second visual representation, automatically shifting one of the first or second visual representations to cause the first point and the second point to be aligned. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30)
-
-
31. An apparatus comprising:
-
one or more processors; one or more storage media storing instructions which, when executed by the one or more processors, cause performance of; at a first point in time, a media editing application displaying an audio clip waveform by;
reading audio data from storage;calculating, based on the audio data, an audio clip waveform, wherein the audio clip waveform represents one or more characteristics of the audio data; causing the audio clip waveform to be displayed on a screen; caching the audio clip waveform for subsequent use by the media editing application by;
generating, by the media editing application, a digital image that depicts the audio clip waveform;durably storing, by the media editing application and separate from the audio data, the digital image to a persistent storage; at a second point in time, the media editing application displaying the audio clip waveform by; in response receiving input that requires redisplay of the audio clip waveform; reading the digital image from the persistent storage without calculating the audio clip waveform based on the audio data, and displaying the audio clip waveform as the audio waveform appeared at the particular point in time by causing the digital image to be displayed via a user interface; after the audio clip waveform is redisplayed based on the digital image, the media editing application allowing edit operations, to be performed on the audio data, using the redisplayed audio clip waveform. - View Dependent Claims (32, 33, 36)
in response to receiving third input, initiating a second session of the media editing application; receiving fourth input to load the audio waveform; reading, from the persistent storage, the digital image that represents the audio clip waveform; and displaying the digital image via the user interface.
-
-
36. The apparatus of claim 31, wherein the instructions, when executed by the one or more processors, further cause, while the digital image is displayed, re-calculating the audio clip waveform based on the audio data.
Specification