Please enable JavaScript.
Coggle requires JavaScript to display documents.
Video Understanding (Action (Temporal Action Detection, Spatial Temporal…
Video Understanding
Action
Temporal Action Detection
Spatial Temporal Action Detection
Trimmed Action Recognition
Action Anticipation
Video
Video Editing
Video Retrieval
Video Style Transfer
Video Segmentation
Text
Video Caption
Localization by Language
Video QA
Audio
Audio-Visual Separation/ Localization
Audio-visual Matching/Recognition
Audio-Visual Generation
recognition/match/retrieval