Coordinating and mixing audiovisual content captured from geographically distributed performers
First Claim
1. A method of preparing coordinated audiovisual performances from geographically distributed performer contributions, the method comprising:
- receiving via a communication network, a first audiovisual encoding of a first performer, including first performer vocals captured at a first remote device and first performer video;
receiving via the communication network, a second audiovisual encoding of a second performer, including second performer vocals captured at a second remote device and second performer video;
determining, from the first performer vocals, at least one time-varying, computationally-defined audio feature;
determining, from the second performer vocals, at least one time-varying, computationally-defined audio feature; and
based on comparison of the computationally-defined audio feature determined from the first and second performer vocals, dynamically varying relative visual prominence of first and second performer video throughout a combined audiovisual performance mix of the captured first and second performer vocals with a backing track and the first and second performer video; and
supplying the first and second remote devices with corresponding, but differing, versions of the combined audiovisual performance mix,wherein the combined audiovisual performance mix supplied to the first remote device features the first performer video and first performer vocals more prominently than the second performer video and second performer vocals, andwherein the combined performance mix supplied to the second remote device features the second performer video and second performer vocals more prominently than the first performer video and first performer vocals.
6 Assignments
0 Petitions
Accused Products
Abstract
Audiovisual performances, including vocal music, are captured and coordinated with those of other users in ways that create compelling user experiences. In some cases, the vocal performances of individual users are captured (together with performance synchronized video) on mobile devices, television-type display and/or set-top box equipment in the context of karaoke-style presentations of lyrics in correspondence with audible renderings of a backing track. Contributions of multiple vocalists are coordinated and mixed in a manner that selects for visually prominent presentation performance synchronized video of one or more of the contributors. Prominence of particular performance synchronized video may be based, at least in part, on computationally-defined audio features extracted from (or computed over) captured vocal audio. Over the course of a coordinated audiovisual performance timeline, these computationally-defined audio features are selective for performance synchronized video of one or more of the contributing vocalists.
91 Citations
26 Claims
-
1. A method of preparing coordinated audiovisual performances from geographically distributed performer contributions, the method comprising:
-
receiving via a communication network, a first audiovisual encoding of a first performer, including first performer vocals captured at a first remote device and first performer video; receiving via the communication network, a second audiovisual encoding of a second performer, including second performer vocals captured at a second remote device and second performer video; determining, from the first performer vocals, at least one time-varying, computationally-defined audio feature; determining, from the second performer vocals, at least one time-varying, computationally-defined audio feature; and based on comparison of the computationally-defined audio feature determined from the first and second performer vocals, dynamically varying relative visual prominence of first and second performer video throughout a combined audiovisual performance mix of the captured first and second performer vocals with a backing track and the first and second performer video; and supplying the first and second remote devices with corresponding, but differing, versions of the combined audiovisual performance mix, wherein the combined audiovisual performance mix supplied to the first remote device features the first performer video and first performer vocals more prominently than the second performer video and second performer vocals, and wherein the combined performance mix supplied to the second remote device features the second performer video and second performer vocals more prominently than the first performer video and first performer vocals. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
Specification