Image with audio conversation system and method
First Claim
1. A computerized method comprising:
- a) at a first mobile device, presenting a plurality of still visual content items on a touchscreen display of the first mobile device;
b) at the first mobile device and while presenting the visual content, recording an audio commentary through a microphone on the first mobile device;
c) at the first mobile device and while recording the audio commentary;
i) identifying user touch input on the touchscreen display at a particular time relative to the audio commentary wherein the user touch input comprises a plurality of user requests to transition to a new one of the plurality of still visual content items, andii) presenting on the touchscreen display, in response to the user touch input, a visual augmentation of the visual content wherein the visual augmentation comprises visual transitions between the plurality of still images;
d) at the first mobile device, encoding the visual content, the audio commentary and the visual augmentation into a video file;
e) at the first mobile device, generating metadata defining the visual augmentation and the timing of the visual augmentation with respect to the audio commentary, wherein the visual augmentation is defined in metadata so as to allow the visual augmentation to be recreated solely through the metadata;
f) determining that a second mobile device is not operating a custom app capable of rendering the visual content, the audio commentary, and the visual augmentation outside of the video file and determining that a third mobile device is operating the custom app;
g) at the first mobile device and based on the determining step, transmitting the video file to the second mobile device and separately transmitting the visual content, the audio commentary, and the metadata without the video file to the third mobile device, wherein metadata is sufficient to allow the third mobile device to present the visual transitions at particular times in the audio commentary corresponding to the user touch input during recording of the audio commentary.
4 Assignments
0 Petitions
Accused Products
Abstract
A system and method are presented to allow audio communication between users concerning an image. The originator of the communication uses a mobile device app to select an image and record an audio commentary. The image, audio commentary, and metadata are submitted to a cloud server for storage. The app uses the server to analyze a recipient address to determine the preferred mode of delivery. If the recipient is a known user of the app, the file is delivered without combining the image, audio commentary, and metadata into a standard movie file. Otherwise, the originator'"'"'s app delivers the file through MMS or e-mail for the recipient as a movie file for viewing using a standard video player.
76 Citations
17 Claims
-
1. A computerized method comprising:
-
a) at a first mobile device, presenting a plurality of still visual content items on a touchscreen display of the first mobile device; b) at the first mobile device and while presenting the visual content, recording an audio commentary through a microphone on the first mobile device; c) at the first mobile device and while recording the audio commentary; i) identifying user touch input on the touchscreen display at a particular time relative to the audio commentary wherein the user touch input comprises a plurality of user requests to transition to a new one of the plurality of still visual content items, and ii) presenting on the touchscreen display, in response to the user touch input, a visual augmentation of the visual content wherein the visual augmentation comprises visual transitions between the plurality of still images; d) at the first mobile device, encoding the visual content, the audio commentary and the visual augmentation into a video file; e) at the first mobile device, generating metadata defining the visual augmentation and the timing of the visual augmentation with respect to the audio commentary, wherein the visual augmentation is defined in metadata so as to allow the visual augmentation to be recreated solely through the metadata; f) determining that a second mobile device is not operating a custom app capable of rendering the visual content, the audio commentary, and the visual augmentation outside of the video file and determining that a third mobile device is operating the custom app; g) at the first mobile device and based on the determining step, transmitting the video file to the second mobile device and separately transmitting the visual content, the audio commentary, and the metadata without the video file to the third mobile device, wherein metadata is sufficient to allow the third mobile device to present the visual transitions at particular times in the audio commentary corresponding to the user touch input during recording of the audio commentary. - View Dependent Claims (12, 13)
-
-
2. A computerized method comprising:
-
a) at a first mobile device, presenting a plurality of still visual content items on a touchscreen display of the first mobile device; b) at the first mobile device and while presenting the visual content, recording an audio commentary through a microphone on the first mobile device; c) at the first mobile device and while recording the audio commentary; i) identifying user touch input on the touchscreen display at a particular time relative to the audio commentary, wherein the user touch input comprises a plurality of user requests to transition to a new one of the plurality of still visual content items, and ii) presenting on the touchscreen display, in response to the user touch input, a visual augmentation of the visual content wherein the visual augmentation comprises visual transitions between the plurality of still images; d) at the first mobile device, generating metadata defining the visual augmentation and the timing of the visual augmentation with respect to the audio commentary, wherein the visual augmentation is defined in metadata so as to allow the visual augmentation to be recreated solely through the metadata; e) at the first mobile device, transmitting the visual content, the audio commentary, and the metadata to a second mobile device, f) at the second mobile device, using the metadata to recreate the visual augmentation at the particular time relative to the audio commentary when presenting the visual content and the audio commentary, wherein the second mobile device presents the visual transitions at particular times in the audio commentary corresponding to the user touch input during recording of the audio commentary. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
14. A system for transmitting audio commentaries on video content comprising:
-
a) a mobile device having i) a microphone, ii) a touchscreen display, iii) a processor, iv) a network interface, v) non-transitory, physical memory, and vi) a cellular interface for communicating cellular messages with a remote mobile device; b) cellular messaging programming on the non-transitory, physical memory providing instructions that program the processor to transmit and receive the cellular messages via the cellular interface, and to maintain a list of incoming cellular messages, the cellular messaging programming instructions including an application programming interface to receive content and commands from other programming on the mobile device and to submit content and the messages to other programming on the mobile device; c) app programming on the non-transitory, physical memory comprising instructions that program the processor to; i) present a plurality of still visual content items on the touchscreen display; ii) while presenting the visual content, record an audio commentary through the microphone; iii) while recording the audio commentary, identifying a plurality of user touch input requests to transition to a new one of the plurality of still visual content items; iv) generating metadata defining the timing of the transitions between the still visual content items with respect to the audio commentary, wherein the metadata allows the transitions to be recreated in sync with the audio commentary solely through the metadata; iv) submit messaging data including the metadata to the application programming interface for transmission to the remote mobile device through the cellular interface to allow the remote mobile device to play the transitions between the plurality of still visual content items at particular times in the audio commentary corresponding to the user touch input requests made during recording of the audio commentary. - View Dependent Claims (15, 16, 17)
-
Specification