Systems and methods for transcribing videos using speaker identification
First Claim
Patent Images
1. A method for transcribing video, comprising:
- receiving, by a computer-based system from a network, a video feed having video data, audio data, and closed-captioning data, the closed-captioning data indicative of speech defined by the audio data;
identifying a speech segment within the closed-captioning data;
automatically defining, by the computer-based system, a transcript of the speech based on the closed-captioning data in the video feed received from the network;
automatically analyzing, by the computer-based system, the video data, audio data, and closed-captioning data by the computer-based system;
automatically identifying, by the computer-based system, a speaker for the speech segment within the transcript based on the analyzing;
automatically marking, by the computer-based system, the speech segment in the transcript with an identifier of the speaker thereby attributing the speech segment to the speaker;
summarizing the transcript, wherein the summarizing comprises determining an overall percentage of speech attributable to the speaker for the closed-captioning data and selecting portions of the transcript for removal based on the determined overall percentage; and
storing the transcript in memory.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for summarizing video feeds correlated to identified speakers. A transcriber system includes multiple types of reasoning logic for identifying specific speakers contained in the video feed. Each type of reasoning logic is stored in memory and may be combined and configurable to provide an aggregated speaker identification result useful for full or summarized transcription before transmission across a network for display on a network-accessible device.
-
Citations
24 Claims
-
1. A method for transcribing video, comprising:
-
receiving, by a computer-based system from a network, a video feed having video data, audio data, and closed-captioning data, the closed-captioning data indicative of speech defined by the audio data; identifying a speech segment within the closed-captioning data; automatically defining, by the computer-based system, a transcript of the speech based on the closed-captioning data in the video feed received from the network; automatically analyzing, by the computer-based system, the video data, audio data, and closed-captioning data by the computer-based system; automatically identifying, by the computer-based system, a speaker for the speech segment within the transcript based on the analyzing; automatically marking, by the computer-based system, the speech segment in the transcript with an identifier of the speaker thereby attributing the speech segment to the speaker; summarizing the transcript, wherein the summarizing comprises determining an overall percentage of speech attributable to the speaker for the closed-captioning data and selecting portions of the transcript for removal based on the determined overall percentage; and storing the transcript in memory. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A transcriber system, comprising:
-
a network interface for receiving a video feed from a network, the video feed having video data, audio data, and closed-captioning data, wherein the closed-captioning data is indicative of speech defined by the audio data; memory; and logic configured to define a transcript of the speech based on the closed-captioning data in the video feed received by the network interface, the logic configured to identify a speech segment within the closed-captioning data, the logic further configured to analyze the video feed and to identify a speaker for the speech segment within the transcript, wherein the logic is configured to mark the speech segment in the transcript with an identifier of the speaker thereby attributing the speech segment to the speaker, wherein the logic is configured to determine an overall percentage of speech attributable to the speaker for the closed-captioning data and to summarize the transcript by selecting portions of the transcript for removal based on the determined overall percentage, and wherein the logic is configured to store the transcript in the memory. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
Specification