Research

These are some notable research outputs that our team and community members have produced. In our in-house team, we have a focus on technical AI governance methods, mainly evaluations. You can see the research outputs of the Cooperative AI Research Fellowship on their posters page.

ResearchStatusAuthorsVenueLink
Precursors, Proxies, and Predictive Models for Long-Horizon TasksPublishedJaco Du Toit, Leo Hyams, Daniil Anisimov, Samuel BrownWorkshop on Evaluating the Evolving LLM Lifecycle: Benchmarks, Emergent Abilities, and Scaling at NeurIPS 2025Open
Synthetic Environment Detection EvalAcceptedJaco Du Toit, Leo Hyams, Asa Strickland Cooper, Joshua OliveUK AISI ARA Stream Bounty Programme-
Situational Awareness EvalAcceptedNoah De Nicola, Leo Hyams, Benjamin Sturgeon, Jaco Du Toit, Jay Bailey, Alexandra AbbasUK AISI ARA Stream Bounty Programme-
RL Manipulation EvalAcceptedJaco Du Toit, Leo Hyams, Celia WaggonerUK AISI ARA Stream Bounty Programme-
Efficient Test-Time Chain of Thought EvalAcceptedJaco Du Toit, Leo Hyams, Celia WaggonerUK AISI ARA Stream Bounty Programme-
HumanAgencyBench: Do Language Models Support Human Agency?PublishedBenjamin Sturgeon, Leo Hyams, Daniel Samuelson, Ethan Vorster, Jacob Haimes, Jacy Reese AnthisHEAL workshop at CHI 2026Open