A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models
[WebAgent] A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models
[WebAgent] A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models
[Layout] PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation
[Layout] DocMark: Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
[Retrieval] NV-Retriever: Improving text embedding models with efective hard-negative mining
[Agent] AGENT S: AN OPEN AGENTIC FRAMEWORK THAT USES COMPUTERS LIKE A HUMAN