Systems And Methods for Manipulating Electronic Content Based On Speech Recognition
First Claim
1. A computer-implemented method for manipulating electronic multimedia content, the method comprising:
- generating, using a processor, a speech model and at least one speaker model of an individual speaker;
receiving electronic media content over a network;
extracting an audio track from the electronic media content;
detecting speech segments within the electronic media content based on the speech model; and
detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model.
6 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods are disclosed for displaying electronic multimedia content to a user. One computer-implemented method for manipulating electronic multimedia content includes generating, using a processor, a speech model and at least one speaker model of an individual speaker. The method further includes receiving electronic media content over a network; extracting an audio track from the electronic media content; and detecting speech segments within the electronic media content based on the speech model. The method further includes detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model.
60 Citations
20 Claims
-
1. A computer-implemented method for manipulating electronic multimedia content, the method comprising:
-
generating, using a processor, a speech model and at least one speaker model of an individual speaker; receiving electronic media content over a network; extracting an audio track from the electronic media content; detecting speech segments within the electronic media content based on the speech model; and detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A system for manipulating electronic multimedia content, the system comprising:
-
a data storage device storing instructions for manipulating electronic multimedia content; and a processor configured to execute the instructions stored in the data storage device for; generating a speech model and at least one speaker model of an individual speaker; receiving electronic media content over a network; extracting an audio track from the electronic media content; detecting speech segments within the electronic media content based on the speech model; and detecting a speaker segment within the electronic media content and calculating a probability of the detected speaker segment involving the individual speaker based on the at least one speaker model. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification