Data recognition in content
First Claim
1. A method, comprising:
- identifying a set of entities in video content;
for a scene in the video content, identifying a first confidence value vector that is representative of features of the scene and that is a result of a video recognition process;
for the scene, identifying a second confidence value vector that is representative of features of the scene and that is a result of an audio recognition process; and
based on the first confidence value vector and the second confidence value vector, determining, by a computing device, at least one identifier that defines whether an entity in the set of entities is present in the scene.
1 Assignment
0 Petitions
Accused Products
Abstract
The disclosure relates to recognizing data such as items or entities in content. In some aspects, content may be received and feature information, such as face recognition data and voice recognition data may be generated. Scene segmentation may also be performed on the content, grouping the various shots of the video content into one or more shot collections, such as scenes. For example, a decision lattice representative of possible scene segmentations may be determined and the most probable path through the decision lattice may be selected as the scene segmentation. Upon generating the feature information and performing the scene segmentation, one or more items or entities that are present in the scene may be identified.
34 Citations
20 Claims
-
1. A method, comprising:
-
identifying a set of entities in video content; for a scene in the video content, identifying a first confidence value vector that is representative of features of the scene and that is a result of a video recognition process; for the scene, identifying a second confidence value vector that is representative of features of the scene and that is a result of an audio recognition process; and based on the first confidence value vector and the second confidence value vector, determining, by a computing device, at least one identifier that defines whether an entity in the set of entities is present in the scene. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus, comprising:
-
one or more processors; memory storing executable instructions configured to, with the one or more processors, cause the apparatus to; identify a set of entities in video content; for a scene in the video content, identify a first confidence value vector that is representative of features of the scene and that is a result of a video recognition process; for the scene, identify a second confidence value vector that is representative of features of the scene and that is a result of an audio recognition process; and based on the first confidence value vector and the second confidence value vector, determine at least one identifier that defines whether an entity in the set of entities is present in the scene. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
performing feature recognition on video content using a at least a video recognition technique and an audio recognition technique, which results in feature information for the video content; determining, based on a selection of a path from a plurality of possible paths through a node lattice that comprises at least one of a scene boundary node or a non-scene boundary node for each shot in the video content, defining boundaries of a scene in the video content; identify, from the feature information, a set of confidence value vectors for the scene that comprises a first confidence value vector for the video recognition technique and a second confidence value vector for the audio recognition technique; and identify one or more items present in the scene based on the set of confidence value vectors. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification