Eyes free entertainment
First Claim
Patent Images
1. A method for converting audio-video content into audio-only content, the method comprising:
- decomposing, by a computer system, the audio-video content into a plurality of frames and a sound component;
creating, by the computer system, an object layer, the creating the object layer comprising;
for each frame in the plurality of frames;
decomposing, by the computer system, the frame into one or more visual objects in the frame, andgenerating, by the computer system, a description of each of the one or more visual objects in the frame to create a plurality of object descriptions;
generating, by the computer system, an object layer audio component based on the plurality of object descriptions;
creating, by the computer system, a sound layer, the creating the sound layer comprising generating a sound layer audio component from the sound component;
creating, by the computer system, a motion layer, the creating the motion layer comprising;
analyzing, by the computer system, each frame in the plurality of frames to identify motion between consecutive frames, andgenerating, by the computer system, a motion layer audio component based on a description of the motion between consecutive frames;
generating, by the computer system, an audio only output of the audio-video content based on the object layer audio component, the sound layer audio component, and the motion layer audio component; and
transmitting, by the computer system, the audio only output to a device of a user.
2 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems and methods for converting audio-video content into audio-only content. Audio-video content is readily accessible, but for various reasons users often cannot consume content visually. In those circumstances, for example, when a user is interrupted during a movie to drive to pick up a spouse or child, the user may not want to forego consuming the audio-video content. The audio-video content can be converted into audio only content for the user to aurally consume, allowing the user to consume the content despite interruptions or other reasons for which the audio-video content cannot be consumed visually.
-
Citations
20 Claims
-
1. A method for converting audio-video content into audio-only content, the method comprising:
-
decomposing, by a computer system, the audio-video content into a plurality of frames and a sound component; creating, by the computer system, an object layer, the creating the object layer comprising; for each frame in the plurality of frames; decomposing, by the computer system, the frame into one or more visual objects in the frame, and generating, by the computer system, a description of each of the one or more visual objects in the frame to create a plurality of object descriptions; generating, by the computer system, an object layer audio component based on the plurality of object descriptions; creating, by the computer system, a sound layer, the creating the sound layer comprising generating a sound layer audio component from the sound component; creating, by the computer system, a motion layer, the creating the motion layer comprising; analyzing, by the computer system, each frame in the plurality of frames to identify motion between consecutive frames, and generating, by the computer system, a motion layer audio component based on a description of the motion between consecutive frames; generating, by the computer system, an audio only output of the audio-video content based on the object layer audio component, the sound layer audio component, and the motion layer audio component; and transmitting, by the computer system, the audio only output to a device of a user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 20)
-
-
13. A system for converting audio-video content into audio-only content, the system comprising:
-
a processor; and a memory having stored thereon instructions that, when executed by the processor, cause the processor to; decompose the audio-video content into a plurality of frames and a sound component; create an object layer, the create the object layer comprising; for each frame in the plurality of frames; decompose the frame into one or more visual objects in the frame, and generate a description of each of the one or more visual objects in the frame to create a plurality of object descriptions; generate an object layer audio component based on the plurality of object descriptions; create a sound layer, the create the sound layer comprising generating a sound layer audio component from the sound component; create a motion layer, the create the motion layer comprising; analyze each frame in the plurality of frames to identify motion between consecutive frames, and generate a motion layer audio component based on a description of the motion between consecutive frames; generate an audio only output of the audio-video content based on the object layer audio component, the sound layer audio component, and the motion layer audio component; and transmit the audio only output to a device of a user. - View Dependent Claims (14, 15, 16, 17, 18, 19)
-
Specification