Recent posts
VisualWebArena: Evaluating Multimodal Agents on Realistic Visually Grounded Web Tasks
[WebAgent] VisualWebArena: Evaluating Multimodal Agents on Realistic Visually Grounded Web Tasks
CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
[WebAgent] CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
WebRL: Training LLM Web Agents Via Self-Evolving Online Curriculum Reinforcement Learning
[WebAgent] WebRL: Training LLM Web Agents Via Self-Evolving Online Curriculum Reinforcement Learning
DesignCoder: Hierarchy-Aware and Self-Correcting UI Code Generation with Large Language Models
[GUI] DesignCoder: Hierarchy-Aware and Self-Correcting UI Code Generation with Large Language Models