1st Workshop: Assessing and Improving the Capabilities and Safety of Agents

Share the event

This workshop brings together two critical threads in the development of autonomous agents: their growing capabilities and the imperative to ensure their safe and aligned deployment. As agents become increasingly capable of long-horizon reasoning, goal-directed behavior, and autonomous decision-making, understanding how to measure and extend these capabilities remains a central research challenge. At the same time, these advances raise urgent questions about robustness, misalignment, risk mitigation, and unintended behaviors.

The workshop will explore current bottlenecks to agent development—including the challenge of collecting meaningful agentic data (via human interaction or synthetic generation) and ensuring generalization without mode collapse—as well as cutting-edge approaches to building agents that are safe by design. By bringing together researchers working on both the capabilities and safety of agents, this workshop aims to foster cross-pollination between communities and develop a shared research agenda for the next generation of autonomous systems.

Confirmed Speakers

Aishwarya Agrawal (Université de Montréal), Prithviraj “Raj” Ammanabrolu (UCSD), Andrea Bajcsy (Carnegie Mellon University), Yoshua Bengio (IVADO, Université de Montréal, Mila), Yonathan Bisk (CMU), Joyce Chai (University of Michigan), Alexandre Drouin (ServiceNow & Université Laval), Krishanmurthy “DJ” Dvijotham (ServiceNow), Daniel Fried (Carnegie Mellon University), Zhijing Jin (University of Toronto), Mirella Lapata (The University of Edinburgh), Bo Li (UIUC), Graham Neubig (Carnegie Mellon University), Ofir Press (Princeton University), Siva Reddy (IVADO, McGill University, Mila), Abhik Roychoudhury (National University of Singapore), Ruslan “Russ” Salakhutdinov (Carnegie Mellon University), Dawn Song  (UC Berkeley), Yu Su (The Ohio State University), Alane Suhr (UC Berkeley), Huan Sun (The Ohio State University), Ivan Titov (University of Edinburgh), Xin Eric Wang (UCSB, Simular), Tao Yu (University of Hong Kong), Victor Zhong (University of Waterloo, Vector Institute).

More to be announced.