DYNAMIC HYBRID MODELS FOR MULTIMODAL ANALYSIS
First Claim
Patent Images
1. A multimodal data analyzer comprising instructions embodied in one or more non-transitory machine accessible storage media, the multimodal data analyzer configured to cause a computing system comprising one or more computing devices to:
- access a set of time-varying instances of multimodal data having at least two different modalities, each instance of the multimodal data having a temporal component; and
algorithmically learn a feature representation of the temporal component of the multimodal data using a deep learning architecture.
1 Assignment
0 Petitions
Accused Products
Abstract
Technologies for analyzing temporal components of multimodal data to detect short-term multimodal events, determine relationships between short-term multimodal events, and recognize long-term multimodal events, using a deep learning architecture, are disclosed.
-
Citations
20 Claims
-
1. A multimodal data analyzer comprising instructions embodied in one or more non-transitory machine accessible storage media, the multimodal data analyzer configured to cause a computing system comprising one or more computing devices to:
-
access a set of time-varying instances of multimodal data having at least two different modalities, each instance of the multimodal data having a temporal component; and algorithmically learn a feature representation of the temporal component of the multimodal data using a deep learning architecture. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A method for classifying multimodal data, the multimodal data comprising data having at least two different modalities, the method comprising, with a computing system comprising one or more computing devices:
-
accessing a set of time-varying instances of multimodal data, each instance of the multimodal data having a temporal component; and algorithmically classifying the set of time-varying instances of multimodal data using a discriminative temporal model, the discriminative temporal model trained using a feature representation generated by a deep temporal generative model based on the temporal component of the multimodal data. - View Dependent Claims (13, 14, 15)
-
-
16. A system for algorithmically recognizing a multimodal event in data, the system comprising:
-
a data access module to access a set of time-varying instances of multimodal data, each instance of the multimodal data having a temporal component; a classifier module to classify different instances in the set of time-varying instances of multimodal data as indicative of different short-term events; and an event recognizer module to (i) recognize a longer-term multimodal event based on a plurality of multimodal short-term events identified by the classifier module and (ii) generate a semantic label for the recognized multimodal event. - View Dependent Claims (17, 18, 19, 20)
-
Specification