INTERACTIVE CONTENT CREATION

US 20150370528A1
Filed: 08/31/2015
Published: 12/24/2015
Est. Priority Date: 12/27/2010
Status: Active Grant

First Claim

Patent Images

1. A method for audio content creation, comprising:

defining a plurality of three-dimensional collision volumes, the plurality of three-dimensional collision zones being at least partially different three-dimensional spaces;

assigning a base track of music to each of the plurality of three-dimensional collision volumes;

receiving a depth image including depth data representing distances from an origin to objects within a scene;

processing the depth data to determine an instance where the depth data indicates a human user in the scene;

tracking movement of the human user in the instance where the depth data indicates a human user in the scene to determine interaction of the human user with one or more of the plurality of collision volumes; and

automatically changing audio content that is played upon interaction of the user with the one or more of the plurality of collision volumes.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

An audio/visual system (e.g., such as an entertainment console or other computing device) plays a base audio track, such as a portion of a pre-recorded song or notes from one or more instruments. Using a depth camera or other sensor, the system automatically detects that a user (or a portion of the user) enters a first collision volume of a plurality of collision volumes. Each collision volume of the plurality of collision volumes is associated with a different audio stem. In one example, an audio stem is a sound from a subset of instruments playing a song, a portion of a vocal track for a song, or notes from one or more instruments. In response to automatically detecting that the user (or a portion of the user) entered the first collision volume, the appropriate audio stem associated with the first collision volume is added to the base audio track or removed from the base audio track.

Citations

20 Claims

1. A method for audio content creation, comprising:
- defining a plurality of three-dimensional collision volumes, the plurality of three-dimensional collision zones being at least partially different three-dimensional spaces;
  
  assigning a base track of music to each of the plurality of three-dimensional collision volumes;
  
  receiving a depth image including depth data representing distances from an origin to objects within a scene;
  
  processing the depth data to determine an instance where the depth data indicates a human user in the scene;
  
  tracking movement of the human user in the instance where the depth data indicates a human user in the scene to determine interaction of the human user with one or more of the plurality of collision volumes; and
  
  automatically changing audio content that is played upon interaction of the user with the one or more of the plurality of collision volumes.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 16, 17)
- - 2. The method of claim 1, wherein said step of processing the depth data to determine an instance where the depth data indicates a human user in the scene comprises the step of distinguishing between foreground and background points, and fitting foreground points to different parts of a body of the human user.
  - 3. The method of claim 2, further comprising the step of smoothing high variance noisy data from the depth data.
  - 4. The method of claim 1, wherein:
    - the interaction of the user includes a portion of the user entering a first collision volume;
      
      the automatically changing the audio content includes automatically changing the audio content being played in response to detecting that the portion of the user entered the first collision volume; and
      
      the automatically changing the audio content being played includes adding a first audio stem to the base track and synchronized with the base track in response to detecting that the portion of the user entered the first collision volume, the first audio stem is associated with the first collision volume.
  - 5. The method of claim 4, further comprising:
    - automatically detecting that a portion of the user entered a second collision volume that is different than the first collision volume, the first collision volume and the second collision volume are part of a plurality of collision volumes, each collision volume of the plurality is associated with a different audio stem, a second audio stem is associated with the second collision volume; and
      
      automatically changing the audio content being played by adding the second audio stem to the base track in response to detecting that the portion of the user entered the second collision volume.
  - 6. The method of claim 5, further comprising:
    - automatically detecting that a portion of the user entered the second collision volume after changing the audio content being played by adding the second audio stem to the base track; and
      
      automatically changing the audio content being played by removing the second audio stem from the base track in response to detecting that the portion of the user entered the second collision volume after changing the audio content being played by adding the second audio stem to the base track.
  - 7. The method of claim 4, further comprising:
    - detecting predefined movement within the first collision volume; and
      
      performing an effect on the audio content being played in response to detecting the predefined movement.
  - 8. The method of claim 4, further comprising:
    - automatically detecting a location of the user, and choosing a base audio track as the audio content from multiple audio tracks based on the detected location of the user.
  - 9. The method of claim 8, wherein:
    - the base audio track is a portion of a song; and
      
      the first audio stem is sound from a subset of instruments playing the song.
  - 10. The method of claim 4, further comprising:
    - automatically detecting which zone the user is located in of a plurality of zones, each zone corresponds to a different base audio track, the playing the base audio track is performed in response to the automatically detecting which zone the user is located in.
  - 11. The method of claim 1, further comprising:
    - displaying visual content and changing the visual content in response to detecting the predefined motion of the user, the visual content are simultaneously generated by both the movements of the user and analysis of the music itself.
  - 12. The method of claim 1, wherein the step of receiving a depth image comprises receiving a depth image using a depth camera.
  - 13. The method of claim 1, wherein:
    - the tracking movement of the user includes automatically tracking movement of multiple users; and
      
      the automatically changing the audio content being played includes automatically changing the audio content being played in response to and based on the tracked movement of multiple users such that different movement changes the audio content being played in different ways.
  - 16. The apparatus of claim 13, wherein:
    - the first predefined movement includes a portion of the user entering a first collision volume of a plurality of collision zone; and
      
      the processor detects predefined motion in a collision volume using data from the depth camera and performs an audio effect on audio being played in response to the detecting of the predefined motion in the collision volume.
  - 17. The apparatus of claim 13, wherein:
    - the predefined movement includes any of a gesture, motion or a center of mass, movement of a limb.

14. An apparatus that creates audio content, comprising:
- a depth camera for producing depth data indicative of distances to objects in a scene;
  
  a display interface;
  
  an audio interface; and
  
  a processor in communication with the depth camera, display interface and audio interface, the processor configured to process the depth data from the depth camera to determine the presence and movement of a human user in the scene, the processor further configured to play a first base audio track, and detect first predefined movement of the user from a plurality of predefined movements based on data from the depth camera, each predefined movement is associated with a different audio stem, the processor adds a first audio stem to the base track in response to detecting that the user performed the first predefined movement, the first audio stem corresponds to the first predefined movement.
- View Dependent Claims (15)
- - 15. The apparatus of claim 14, wherein:
    - the processor is programmed to automatically detect that a portion of a user performs a second predefined movement based on data from the depth camera, the processor adds a second audio stem to the base track that in response to detecting that the user performed the second predefined movement, the second base track corresponds to the second collision volume.

18. One or more processor readable storage devices storing processor readable code thereon, the processor readable code for programming one or more processors to perform a method comprising:
- obtaining depth data representing distances to points in a scene;
  
  detecting a human user in the scene from analysis of the depth data;
  
  defining a plurality of three dimensional movement zones and a plurality of collision volumes in the scene;
  
  defining one or more base tracks for the plurality of movement zones;
  
  detecting when a body part of the user enters one or more of the movement zones from the depth data indicating a three-dimensional position of the body part;
  
  detecting when a body part of the user enters one or more of the collision volumes from the depth data indicating a three-dimensional position of the body part;
  
  identifying audio stems for a set of collision volumes for each movement zone; and
  
  creating code based on the defined one or more base tracks for the plurality of movement zones and the identified audio stems for the set of collision volumes for each zone, the code capable of configuring a computing device to play the one or base tracks depending on which zone the user is positioned, the code capable of configuring the computing device to add or subtract audio stems based on the user intersecting corresponding collision volumes.
- View Dependent Claims (19, 20)
- - 19. One or more processor readable storage devices according to claim 18, wherein:
    - the method further comprises associating one or more audio effects with one or more predefined movements within one or more of the collision volumes; and
      
      the code is capable of configuring the computing device to perform the one or more audio effects in response to one or more predefined movements within one or more of the collision volumes.
  - 20. One or more processor readable storage devices according to claim 18, wherein:
    - the method further comprises defining a custom avatar; and
      
      the creating code includes providing a description of the avatar in the code.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Original Assignee
Microsoft Technology Licensing LLC (Microsoft Corporation)
Inventors
Flaks, Jason, Poot, Rudy Jacobus, Kipman, Alex Aben-Athar, Miles, Chris, Fuller, Andrew John, Margolis, Jeffrey Neil

Granted Patent

US 9,529,566 B2
Time in Patent Office

Days
Field of Search
US Class Current

1/1
CPC Class Codes

G05B 15/02   electric

G06F 3/011   Arrangements for interactio...

G06F 3/017   Gesture based interaction, ...

G06F 3/165   Management of the audio str...

G10H 1/0008   Associated control or indic...

G10H 2220/201   for movement interpretation...

G10H 2220/455   Camera input, e.g. analyzin...

INTERACTIVE CONTENT CREATION

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

INTERACTIVE CONTENT CREATION

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links