VisualWebArena: Evaluating Multimodal Agents on Realistic Visually Grounded Web Tasks
[WebAgent] VisualWebArena: Evaluating Multimodal Agents on Realistic Visually Grounded Web Tasks
[WebAgent] VisualWebArena: Evaluating Multimodal Agents on Realistic Visually Grounded Web Tasks
[WebAgent] CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
[WebAgent] WebRL: Training LLM Web Agents Via Self-Evolving Online Curriculum Reinforcement Learning
[GUI] DesignCoder: Hierarchy-Aware and Self-Correcting UI Code Generation with Large Language Models
[WebAgent] The BrowserGym Ecosystem for Web Agent Research