Cognitive print speaker modeler
First Claim
1. A computer-implemented method for subtitling of streaming video with audio, comprising executing on a computer processor:
- identifying a speaker in a streaming video with audio according to words spoken by the speaker matched to a cognitive print, wherein the cognitive print comprises a plurality of traits classified according to a hierarchical long short term memory (LSTM) model, wherein the hierarchical LSTM model comprises a plurality of layers of LSTMs and each layer corresponds to the classification of one trait of the plurality of traits;
annotating a subtitle of the words spoken by the speaker, which decorates the subtitle with a label representative of the identified speaker; and
adding the decorated subtitle to the streaming video with audio.
1 Assignment
0 Petitions
Accused Products
Abstract
Aspects of the present invention provide devices that subtitle streaming video with audio and identify a speaker in a streaming video with audio according to words spoken by the speaker matched to a cognitive print. The cognitive print includes traits classified according a hierarchical long short term model (LSTM). The hierarchical LSTM includes layers of LSTMs and each layer corresponds to the classification of one trait. A processor annotates a subtitle of the words spoken by the speaker, which decorates the subtitle with a label representative of the identified speaker, and streams the decorated subtitle with the streaming video with audio.
-
Citations
20 Claims
-
1. A computer-implemented method for subtitling of streaming video with audio, comprising executing on a computer processor:
-
identifying a speaker in a streaming video with audio according to words spoken by the speaker matched to a cognitive print, wherein the cognitive print comprises a plurality of traits classified according to a hierarchical long short term memory (LSTM) model, wherein the hierarchical LSTM model comprises a plurality of layers of LSTMs and each layer corresponds to the classification of one trait of the plurality of traits; annotating a subtitle of the words spoken by the speaker, which decorates the subtitle with a label representative of the identified speaker; and adding the decorated subtitle to the streaming video with audio. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system for subtitling streaming video with audio, comprising:
-
a processor; a computer readable memory in circuit communication with the processor; and a computer readable storage medium in circuit communication with the processor; wherein the processor executes program instructions stored on the computer-readable storage medium via the computer readable memory and thereby; identify a speaker in a streaming video with audio according to words spoken by the speaker matched to a cognitive print, wherein the cognitive print comprises a plurality of traits classified according to a hierarchical long short term memory (LSTM) model, wherein the hierarchical LSTM model comprises a plurality of layers of LSTMs and each layer of LSTMs corresponds to the classification of one trait of the plurality of traits; annotate a subtitle of the words spoken by the speaker, which decorates the subtitle with a label representative of the identified speaker; and stream the decorated subtitle with the streaming video with audio. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A computer program product for subtitling streaming video with audio, the computer program product comprising:
-
a computer readable storage medium having computer readable program code embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the computer readable program code comprising instructions for execution by a processor that causes the processor to; identify a speaker in a streaming video with audio according to words spoken by the speaker matched to a cognitive print, wherein the cognitive print comprises a plurality of traits classified according to a hierarchical long short term memory (LSTM) model, wherein the hierarchical LSTM model comprises a plurality of layers of LSTMs and each layer corresponds to the classification of one trait of the plurality of traits; annotate a subtitle of the words spoken by the speaker, which decorates the subtitle with a label representative of the identified speaker; and stream the decorated subtitle with the streaming video with audio. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification