×

Systems and methods for manipulating electronic content based on speech recognition

  • US 9,311,395 B2
  • Filed: 06/09/2011
  • Issued: 04/12/2016
  • Est. Priority Date: 06/10/2010
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for manipulating electronic multimedia content, the method comprising:

  • generating, using a processor, a speech model, a non-speech model, at least one speaker model of an individual speaker, and a non-speaker speech model;

    receiving electronic media content over a network;

    extracting an audio track from the electronic media content;

    detecting speech segments within the extracted audio track based on the speech model and the non-speech model, the speech segments containing speech from at least one of a plurality of speakers;

    detecting a speaker segment within the detected speech segments based on the speaker model and the non-speaker speech model, the speaker segment containing speech from the individual speaker;

    calculating a first probability of the detected speaker segment involving the individual speaker based on the at least one speaker speech model and the non-speaker speech model;

    determining a ranking or filtration of the electronic media content relative to other electronic media content based on the first probability of the detected speaker segment;

    detecting a face within a part of the electronic media content corresponding to the detected speaker segment and calculating a second probability of the detected face being a face of the individual speaker; and

    adjusting the ranking or filtration of the electronic media content based on the second probability.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×