The Emerging Evaluations Project publishes reproducible evaluations of how AI tools behave under adversarial conditions, pairing hands-on technical testing with interpretive policy analysis. We report what models do — not what they are described to do.
An independent publication of evidence-based data and analysis on emerging AI tools.
We study how AI systems actually behave under real conditions, and publish the methods alongside the findings.
Most accounts of AI capability and risk are anecdotal or vendor-supplied. EEP exists to replace that with measurement. We construct concrete threat models, run open and frontier systems against them, and document the results in full — including where the experiment failed or surprised us. The goal is not a verdict on whether a model is “safe,” but a precise account of which controls are actually load-bearing.
Standalone, indexable summaries. Each links to the full report on Substack.
Full reports, methods, and the occasional field note from the playground — published on Substack. No noise.