Optimized video snapshot

US 9,609,272 B2
Filed: 05/02/2013
Issued: 03/28/2017
Est. Priority Date: 05/02/2013
Status: Active Grant

First Claim

Patent Images

1. A method for presenting an aesthetic image, said method comprising:

receiving at an audio analysis tool, a set of audio and video streams for at least one of a plurality of users in a video conference, said audio and video streams being synchronized with each other;

analyzing said audio track of said one of a plurality of users in said video conference to determine when said user is an active speaker;

when said user is an active speaker, analyzing a speech signal of the audio track to identify aesthetic phonemes of said active speaker, wherein the aesthetic phonemes comprise phonemes that, when spoken by said active speaker, produce aesthetically pleasing face expressions; and

extracting an optimal image from said video stream of said active speaker corresponding to one of said aesthetic phonemes, said optimal image comprising a frame from said video stream.

View all claims

18 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Methods, media and devices for generating an optimized image snapshot from a captured sequence of persons participating in a meeting are provided. In some embodiments, methods media and devices for utilizing a captured image as a representative image of a person as a replacement of a video stream; as a representation of a person in offline archiving systems; or as a representation of a person in a system participant roster.

5 Citations

16 Claims

1. A method for presenting an aesthetic image, said method comprising:
- receiving at an audio analysis tool, a set of audio and video streams for at least one of a plurality of users in a video conference, said audio and video streams being synchronized with each other;
  
  analyzing said audio track of said one of a plurality of users in said video conference to determine when said user is an active speaker;
  
  when said user is an active speaker, analyzing a speech signal of the audio track to identify aesthetic phonemes of said active speaker, wherein the aesthetic phonemes comprise phonemes that, when spoken by said active speaker, produce aesthetically pleasing face expressions; and
  
  extracting an optimal image from said video stream of said active speaker corresponding to one of said aesthetic phonemes, said optimal image comprising a frame from said video stream.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
- - 2. The method of claim 1, wherein said process of analyzing the speech signal of the audio track comprises classifying phonemes of the speech signal into at least two sets of phonemes.
  - 3. The method of claim 2, wherein one of said at least two sets of phonemes is aesthetic phonemes.
  - 4. The method of claim 1, said method further comprising refining said determined aesthetic image utilizing audio parameters of the audio track.
  - 5. The method of claim 4, wherein said audio parameters comprise an estimation of the audio direction.
  - 6. The method of claim 4, wherein said audio parameters comprise a background noise estimation.
  - 7. The method of claim 4, wherein said audio parameters comprise cross talk detection.
  - 8. The method of claim 1, further comprising:
    - replacing said video stream on the video conference with said optimal image.

9. A system for presenting an aesthetic image, said system comprising:
- an audio analysis tool enabled to receive a set of audio and video streams for at least one of a plurality of user in a video conference, said audio and video streams being synchronized with each other, analyze said audio track of said one of a plurality of users in said video conference to determine when said one of a plurality of users is an active speaker, analyze a speech signal of the audio track to identify aesthetic phonemes of said active speaker, wherein the aesthetic phonemes comprise phonemes that, when spoken by said active speaker, produce aesthetically pleasing face expressions, and extract an optimal image from said video stream of said active speaker corresponding to one of said aesthetic phonemes, said optimal image comprising a frame from said video stream.
- View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- - 10. The system of claim 9, wherein said analysis of the speech signal comprises classification of phonemes of the speech signal into at least two sets of phonemes.
  - 11. The system of claim 10, wherein one of said at least two sets of phonemes is aesthetic phonemes.
  - 12. The system of claim 9, wherein said tool is further enabled to refine said determination of the aesthetic image by utilizing audio parameters of the audio track.
  - 13. The system of claim 12, wherein said audio parameters comprise an estimation of the audio direction.
  - 14. The system of claim 12, wherein said audio parameters comprise a background noise estimation.
  - 15. The system of claim 12, wherein said audio parameters comprise cross talk detection.
  - 16. The system of claim 9, wherein the audio analysis tool is further enabled to:
    - replace said video stream on the video conference with said optimal image.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Arlington Technologies, LLC (Dominion Harbor Enterprises, LLC)
Original Assignee
Avaya Incorporated
Inventors
Wiener, Yair, Modai, Ori
Primary Examiner(s)
ZENATI, AMAL S

Application Number

US13/875,390
Publication Number

US 20140327730A1
Time in Patent Office

1,426 Days
Field of Search

348/14.07
US Class Current

1/1
CPC Class Codes

G10L 25/57 for processing of video sig...

H04N 7/15 Conference systems

Optimized video snapshot

First Claim

18 Assignments

0 Petitions

Accused Products

Abstract

5 Citations

16 Claims

Specification

Solutions

Use Cases

Quick Links

Optimized video snapshot

First Claim

18 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

5 Citations

16 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links