[Agent] Magma: A Foundation Model for Multimodal AI Agents
[Agent] Magma: A Foundation Model for Multimodal AI Agents
[Agent] Magma: A Foundation Model for Multimodal AI Agents
[Agent] Mind2Web: Towards a Generalist Agent for the Web
[Agent] WEB-SHEPHERD: Advancing PRMs for Reinforcing Web Agents
[Agent] VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
[Agent] PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides