Method and system for processing multimedia content to dynamically generate text transcript
First Claim
1. A method for dynamically generating a text transcript, comprising:
- segmenting, by an application server comprising a processor, each identified set of text frames to determine one or more spatial regions, wherein the one or more spatial regions comprise at least one or more portions of text content;
extracting, by an application server comprising a processor, one or more keywords from the one or more spatial regions, wherein the one or more keywords are extracted from one or more available portions of the text content in the one or more spatial regions;
determining, by an application server comprising a processor, a first set of keywords from the one or more extracted keywords by filtering one or more off-topic keywords from the one or more extracted keywords;
extracting, by an application server comprising a processor, a second set of keywords similar to or related with the one or more second set of keywords, wherein the second set of keywords being retrieved from one or more knowledge databases;
generating, by an application server comprising a processor, a graph from a semantic relationship between the first set of keywords and the second set of keywords, wherein the graph comprises one or more first nodes and one or more second nodes with each node in the one or more first nodes corresponding with a keyword in the first set of keywords and each node in the one or more second nodes corresponding with a keyword in the second set of keywords; and
generating, by an application server comprising a processor, the text transcript of the audio content in the multimedia content using the generated graph, wherein the generating of the text transcript comprises utilizing at least one of an updated language module and an updated dictionary module to generate the text transcript of the audio content in the multimedia content.
2 Assignments
0 Petitions
Accused Products
Abstract
The disclosed embodiments illustrate method and system of processing multimedia content to generate a text transcript. The method includes segmenting each of a set of text frames to determine spatial regions. The method further includes extracting one or more keywords from each of the determined spatial regions. The method further includes determining the first set of keywords from the extracted one or more keywords based on filtering of one or more off-topic keywords from the extracted one or more keywords. The method further includes extracting a second set of keywords based on the determined first set of keywords. The method further includes generating a graph between each of a first set of keywords and one or more of a second set of keywords. The method further includes dynamically generating the text transcript of audio content in the multimedia content based on the generated graph.
-
Citations
11 Claims
-
1. A method for dynamically generating a text transcript, comprising:
-
segmenting, by an application server comprising a processor, each identified set of text frames to determine one or more spatial regions, wherein the one or more spatial regions comprise at least one or more portions of text content; extracting, by an application server comprising a processor, one or more keywords from the one or more spatial regions, wherein the one or more keywords are extracted from one or more available portions of the text content in the one or more spatial regions; determining, by an application server comprising a processor, a first set of keywords from the one or more extracted keywords by filtering one or more off-topic keywords from the one or more extracted keywords; extracting, by an application server comprising a processor, a second set of keywords similar to or related with the one or more second set of keywords, wherein the second set of keywords being retrieved from one or more knowledge databases; generating, by an application server comprising a processor, a graph from a semantic relationship between the first set of keywords and the second set of keywords, wherein the graph comprises one or more first nodes and one or more second nodes with each node in the one or more first nodes corresponding with a keyword in the first set of keywords and each node in the one or more second nodes corresponding with a keyword in the second set of keywords; and generating, by an application server comprising a processor, the text transcript of the audio content in the multimedia content using the generated graph, wherein the generating of the text transcript comprises utilizing at least one of an updated language module and an updated dictionary module to generate the text transcript of the audio content in the multimedia content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
Specification