SEAL: Suite for Evaluating API-use of LLMs

Foundational AI/NLP

Sep 23

Written By Emergence AI

https://arxiv.org/abs/2409.15523

Foundational AI/NLP

Emergence AI

Multimodal Auto Validation For Self-Refinement in Web Agents

The Ungrounded Alignment Problem