[KD][AR] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
[KD][AR] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
[KD][AR] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
[OD] SAM-DETR: Accelerating DETR Convergence via Semantic-Aligned Matching
[SSL2][AR] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
[AR] UNIFORMERV2: SPATIOTEMPORAL LEARNING ARMING IMAGE VITS WITH VIDEO UNIFORMER
[DM][PS] CustomDiffusion: Multi-Concept Customization of Text-to-Image Diffusion