AUTOMATIC GENERATION OF VIDEO AND DIRECTIONAL AUDIO FROM SPHERICAL CONTENT

US 20190005987A1
Filed: 08/20/2018
Published: 01/03/2019
Est. Priority Date: 07/03/2014
Status: Active Grant

First Claim

Patent Images

1. A method for generating a video with corresponding audio, the method performing by a computing system including one or more processors, the method comprising:

receiving, by the computing system, a video, the video comprising frames including a target, the video having a field of view;

receiving, by the computing system, directional audio signals captured concurrently with the video;

determining, by the computing system, a time-varying path of the target within the video based on an analysis of content of the video or information associated with the video;

identifying, by the computing system, sub-frames from the frames based on the time-varying path of the target, the sub-frames having a reduced field of view relative to the field of view of the video, the sub-frames including the target;

generating, by the computing system, an audio stream from the directional audio signals based on the time-varying path of the target, the audio stream including portions of one or more of the directional audio signals corresponding to a direction of the target; and

outputting, by the computing system, the sub-frames and the audio stream.

View all claims

4 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A spherical content capture system captures spherical video and audio content. In one embodiment, captured metadata or video/audio processing is used to identify content relevant to a particular user based on time and location information. The platform can then generate an output video from one or more shared spherical content files relevant to the user. The output video may include a non-spherical reduced field of view such as those commonly associated with conventional camera systems. Particularly, relevant sub-frames having a reduced field of view may be extracted from each frame of spherical video to generate an output video that tracks a particular individual or object of interest. For each sub-frame, a corresponding portion of an audio track is generated that includes a directional audio signal having a directionality based on the selected sub-frame.

Citations

20 Claims

1. A method for generating a video with corresponding audio, the method performing by a computing system including one or more processors, the method comprising:
- receiving, by the computing system, a video, the video comprising frames including a target, the video having a field of view;
  
  receiving, by the computing system, directional audio signals captured concurrently with the video;
  
  determining, by the computing system, a time-varying path of the target within the video based on an analysis of content of the video or information associated with the video;
  
  identifying, by the computing system, sub-frames from the frames based on the time-varying path of the target, the sub-frames having a reduced field of view relative to the field of view of the video, the sub-frames including the target;
  
  generating, by the computing system, an audio stream from the directional audio signals based on the time-varying path of the target, the audio stream including portions of one or more of the directional audio signals corresponding to a direction of the target; and
  
  outputting, by the computing system, the sub-frames and the audio stream.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1, wherein determining the audio stream further comprises selecting the portions based on matching directionalities of the directional audio signals with the direction of the target.
  - 3. The method of claim 1, wherein the directional audio signals correspond to directions perpendicular to faces of at least one member of a group consisting of:
    - a tetrahedron, a cube, or a pyramid.
  - 4. The method of claim 1, wherein the information associated with the video includes location information generated by a tracking device carried by the target.
  - 5. The method of claim 1, wherein the analysis of the content of the video includes visual recognition of the target within the video.
  - 6. The method of claim 5, wherein the visual recognition of the target includes facial recognition, object recognition, motion recognition, or gesture recognition.
  - 7. The method of claim 1, wherein the information associated with the video includes a directionality of an audio source of one or more of the directional audio signals.
  - 8. The method of claim 1, wherein a scene motion analysis is performed based on the directional audio signals.

9. A non-transitory computer-readable storage medium storing instructions for generating a video with corresponding audio, the instructions when executed by one or more processors causing the one or more processors to perform steps including:
- receiving a video, the video comprising frames including a target, the video having a field of view;
  
  receiving directional audio signals captured concurrently with the video;
  
  determining a time-varying path of the target within the video based on an analysis of content of the video or information associated with the video;
  
  identifying sub-frames from the frames based on the time-varying path of the target, the sub-frames having a reduced field of view relative to the field of view of the video, the sub-frames including the target;
  
  generating an audio stream from the directional audio signals based on the time-varying path of the target, the audio stream including portions of one or more of the directional audio signals corresponding to a direction of the target; and
  
  outputting the sub-frames and the audio stream.
- View Dependent Claims (10, 11, 12, 13, 14)
- - 10. The non-transitory computer-readable storage medium of claim 9, wherein determining the audio stream further comprises selecting the portions based on matching directionalities of the directional audio signals with the direction of the target.
  - 11. The non-transitory computer-readable storage medium of claim 9, wherein the directional audio signals correspond to directions perpendicular to the faces of at least one member of a group consisting of:
    - a tetrahedron, a cube, or a pyramid.
  - 12. The non-transitory computer-readable storage medium of claim 9, wherein the information associated with the video includes location information generated by a tracking device carried by the target.
  - 13. The non-transitory computer-readable storage medium of claim 9, wherein the analysis of the content of the video includes visual recognition of the target within the video.
  - 14. The non-transitory computer-readable storage medium of claim 13, wherein the visual recognition of the target includes facial recognition, object recognition, motion recognition, or gesture recognition.

15. A system for generating a video with corresponding audio, the system comprising:
- one or more processors; and
  
  a non-transitory computer-readable storage medium storing instructions that when executed by the one or more processors causes the one or more processors to perform steps including;
  
  receiving a video, the video comprising frames including a target, the video having a field of view;
  
  receiving directional audio signals captured concurrently with the video;
  
  determining a time-varying path of the target within the video based on an analysis of content of the video or information associated with the video;
  
  identifying sub-frames from the frames based on the time-varying path of the target, the sub-frames having a reduced field of view relative to the field of view of the video, the sub-frames including the target;
  
  generating an audio stream from the directional audio signals based on the time-varying path of the target, the audio stream including portions of one or more of the directional audio signals corresponding to a direction of the target; and
  
  outputting the sub-frames and the audio stream.
- View Dependent Claims (16, 17, 18, 19, 20)
- - 16. The system of claim 15, wherein determining the audio stream further comprises selecting the portions based on matching directionalities of the directional audio signals with the direction of the target.
  - 17. The system of claim 15, wherein the directional audio signals correspond to directions perpendicular to faces of at least one member of a group consisting of:
    - a tetrahedron, a cube, or a pyramid.
  - 18. The system of claim 15, wherein the information associated with the video includes location information generated by a tracking device carried by the target.
  - 19. The system of claim 15, wherein the analysis of the content of the video includes visual recognition of the target within the video.
  - 20. The system of claim 15, wherein the information associated with the video includes a directionality of an audio source of one or more of the directional audio signals.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
GoPro, Inc.
Original Assignee
GoPro, Inc.
Inventors
Campbell, Scott Patrick, Jing, Zhinian, Macmillan, Timothy, Newman, David A., Adsumilli, Balineedu Chowdary

Granted Patent

US 10,410,680 B2
Time in Patent Office

Days
Field of Search
US Class Current
CPC Class Codes

G11B 27/3081   used signal is a video-fram...

H04N 23/698   for achieving an enlarged f...

H04N 5/77   between a recording apparat...

H04N 9/806   with processing of the soun...

H04N 9/8205   involving the multiplexing ...

H04N 9/8211   the additional signal being...

AUTOMATIC GENERATION OF VIDEO AND DIRECTIONAL AUDIO FROM SPHERICAL CONTENT

First Claim

4 Assignments

0 Petitions

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

AUTOMATIC GENERATION OF VIDEO AND DIRECTIONAL AUDIO FROM SPHERICAL CONTENT

First Claim

4 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links