[Layout] Interactively Optimizing Layout Transfer for Vector Graphics
[Layout] VLT: Interactively Optimizing Layout Transfer for Vector Graphics
[Layout] VLT: Interactively Optimizing Layout Transfer for Vector Graphics
[MM] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
[Layout] LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
[MM] Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
[MM][GUI] V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM