[MM] BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
[MM] BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
[MM] BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
[MM] BLIP: Bootstrapping Language-Image Pretraining for Unified Vision-Language Understanding and Generation
[DM][LG] LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
[DM][LG] LACE: Towards Aligned Layout Generation Via Diffusion Model With Aesthetic Constraints
[L][LG] LayoutNUWA: Revealing The Hidden Layout Expertise of Large Language Models