[MM] BLIP: Bootstrapping Language-Image Pretraining for Unified Vision-Language Understanding and Generation
[MM] BLIP: Bootstrapping Language-Image Pretraining for Unified Vision-Language Understanding and Generation
[MM] BLIP: Bootstrapping Language-Image Pretraining for Unified Vision-Language Understanding and Generation
[DM][LG] LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
[DM][LG] LACE: Towards Aligned Layout Generation Via Diffusion Model With Aesthetic Constraints
[L][LG] LayoutNUWA: Revealing The Hidden Layout Expertise of Large Language Models
[GM][OD] DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception