[L][LG] LayoutNUWA: Revealing The Hidden Layout Expertise of Large Language Models
[L][LG] LayoutNUWA: Revealing The Hidden Layout Expertise of Large Language Models
[L][LG] LayoutNUWA: Revealing The Hidden Layout Expertise of Large Language Models
[GM][OD] DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
[GM][3DOD] MAGIC DRIVE : STREET VIEW GENERATION WITH DIVERSE 3D GEOMETRY CONTROL
[GM][OD] GEODIFFUSION: TEXT-PROMPTED GEOMETRIC CON- TROL FOR OBJECT DETECTION DATA GENERATION
[MM] InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks