Visual similarity
First Claim
Patent Images
1. A method comprising:
- receiving a stream of video content;
generating interpretations of the received video content using speech/natural language processing (NLP);
associating the interpretations of the received video content with images extracted from video content based on timeline; and
using the interpretations in the database to obtain interpretations of other images or other video content.
5 Assignments
0 Petitions
Accused Products
Abstract
Methods and apparatus, including computer program products, for visual similarity. A method includes receiving a stream of video content, generating interpretations of the received video content using speech/natural language processing (NLP), associating the interpretations of the received video content with images extracted from video content based on timeline, and using the interpretations to obtain interpretations of other images or other video content.
-
Citations
28 Claims
-
1. A method comprising:
-
receiving a stream of video content; generating interpretations of the received video content using speech/natural language processing (NLP); associating the interpretations of the received video content with images extracted from video content based on timeline; and using the interpretations in the database to obtain interpretations of other images or other video content. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method comprising:
-
receiving a stream of video content; generating speech to text for the received video content; generating passage level annotations from generated text using natural language processing (NLP); associating the passage level annotations with the text from the speech time aligned to result in text, annotations and a time stamp; and associating imagery with the annotated text to generate thumbnails at periodic time intervals resulting in a database of annotations to imagery and imagery to annotations. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. An apparatus comprising:
-
a local computing system linked to a network of interconnected computer systems, the local computing system comprising a processor 18, a memory and a storage device; the memory comprising an operating system and a visual similarity process, the visual similarity process comprising; receiving a stream of video content; generating speech to text for the received video content; generating passage level annotations from generated text using natural language processing (NLP); associating the passage level annotations with the text from the speech time aligned to result in text, annotations and a time stamp; and associating imagery with the annotated text to generate thumbnails at periodic time intervals resulting in a database of annotations to imagery and imagery to annotations. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
-
Specification