Research
These are some notable research outputs that our team and community members have produced. In our in-house team, we have a focus on technical AI governance methods, mainly evaluations. You can see the research outputs of the Cooperative AI Research Fellowship on their posters page.
| Research | Status | Authors | Venue | Link |
|---|---|---|---|---|
| Precursors, Proxies, and Predictive Models for Long-Horizon Tasks | Published | Jaco Du Toit, Leo Hyams, Daniil Anisimov, Samuel Brown | Workshop on Evaluating the Evolving LLM Lifecycle: Benchmarks, Emergent Abilities, and Scaling at NeurIPS 2025 | Open |
| Synthetic Environment Detection Eval | Accepted | Jaco Du Toit, Leo Hyams, Asa Strickland Cooper, Joshua Olive | UK AISI ARA Stream Bounty Programme | - |
| Situational Awareness Eval | Accepted | Noah De Nicola, Leo Hyams, Benjamin Sturgeon, Jaco Du Toit, Jay Bailey, Alexandra Abbas | UK AISI ARA Stream Bounty Programme | - |
| RL Manipulation Eval | Accepted | Jaco Du Toit, Leo Hyams, Celia Waggoner | UK AISI ARA Stream Bounty Programme | - |
| Efficient Test-Time Chain of Thought Eval | Accepted | Jaco Du Toit, Leo Hyams, Celia Waggoner | UK AISI ARA Stream Bounty Programme | - |
| HumanAgencyBench: Do Language Models Support Human Agency? | Published | Benjamin Sturgeon, Leo Hyams, Daniel Samuelson, Ethan Vorster, Jacob Haimes, Jacy Reese Anthis | HEAL workshop at CHI 2026 | Open |
