Visual similarity for video content
First Claim
Patent Images
1. A method comprising:
- receiving a stream of video content;
generating speech to text for the received video content;
generating passage level annotations from the generated text using natural language processing (NLP);
associating the passage level annotations with a timeline; and
associating imagery with the text to generate thumbnails at periodic time intervals resulting in a database of annotations to imagery and imagery to annotations.
5 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus, including computer program products, for visual similarity. A method includes receiving a stream of video content, generating interpretations of the received video content using speech/natural language processing (NLP), associating the interpretations of the received video content with images extracted from video content based on timeline, and using the interpretations to obtain interpretations of other images or other video content.
14 Citations
19 Claims
-
1. A method comprising:
-
receiving a stream of video content; generating speech to text for the received video content; generating passage level annotations from the generated text using natural language processing (NLP); associating the passage level annotations with a timeline; and associating imagery with the text to generate thumbnails at periodic time intervals resulting in a database of annotations to imagery and imagery to annotations. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus comprising:
-
a local computing system linked to a network of interconnected computer systems, the local computing system comprising a processor, a memory and a storage device; the memory comprising an operating system and a visual similarity process, the visual similarity process comprising; receiving a stream of video content; generating speech to text for the received video content; generating passage level annotations from the generated text using natural language processing (NLP); associating the passage level annotations with the text time aligned to result in text, annotations and a time stamp; and associating imagery with the annotated text to generate thumbnails at periodic time intervals resulting in a database of annotations to imagery and imagery to annotations. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
Specification