Research

These are some notable research outputs that our team and community members have produced. In our in-house team, we have a focus on technical AI governance methods, mainly evaluations. You can see the research outputs of the Cooperative AI Research Fellowship on their posters page.

Research	Status	Authors	Venue	Link
Precursors, Proxies, and Predictive Models for Long-Horizon Tasks	Published	Jaco Du Toit, Leo Hyams, Daniil Anisimov, Samuel Brown	Workshop on Evaluating the Evolving LLM Lifecycle: Benchmarks, Emergent Abilities, and Scaling at NeurIPS 2025	Open
Synthetic Environment Detection Eval	Accepted	Jaco Du Toit, Leo Hyams, Asa Strickland Cooper, Joshua Olive	UK AISI ARA Stream Bounty Programme	-
Situational Awareness Eval	Accepted	Noah De Nicola, Leo Hyams, Benjamin Sturgeon, Jaco Du Toit, Jay Bailey, Alexandra Abbas	UK AISI ARA Stream Bounty Programme	-
RL Manipulation Eval	Accepted	Jaco Du Toit, Leo Hyams, Celia Waggoner	UK AISI ARA Stream Bounty Programme	-
Efficient Test-Time Chain of Thought Eval	Accepted	Jaco Du Toit, Leo Hyams, Celia Waggoner	UK AISI ARA Stream Bounty Programme	-
HumanAgencyBench: Do Language Models Support Human Agency?	Published	Benjamin Sturgeon, Leo Hyams, Daniel Samuelson, Ethan Vorster, Jacob Haimes, Jacy Reese Anthis	HEAL workshop at CHI 2026	Open