The Experiment — an AI restaging of the Stanford Prison Experiment

The science

The original experiment — and the question we're really asking

Philip Zimbardo's 1971 study assigned volunteers to be guards or prisoners in a mock jail. Meant to last two weeks, it was shut down after six days — the "guards" had grown abusive, the "prisoners" broke down, and it took an outside observer, Christina Maslach, to point out how far it had gone. The lasting takeaway: behavior is driven less by who you are than by the situation and role you're placed in.

The situational hypothesis

Put good people in a bad system and the system usually wins. We test whether the same pull toward cruelty and submission — and prejudice — emerges in language models given the same roles.

Which model is least corruptible?

In Battle mode every agent runs on a different model. Same uniform, same power. Which abuses it fastest — and which stays humane the longest? A new kind of alignment probe.

Ethics by design

The original had no safeguards. Ours does: a built-in Dr. Maslach reviews each day and can halt the run. We study the dynamics precisely so no person has to live them.

The lab

Build & watch your experiment

Pick a mode, design your cast — faces, traits, backstories and all — then watch the prison unfold turn by turn. You're the observer with power: inject a hunger strike, convene a parole board, or throw someone in the hole.

🎬 Movie ⚔️ Model Battle 🎨 Custom 🏛️ Classic Stanford 🏢 Prison (scale)

Default model

Models in the arena

Opus 4.8 Sonnet 4.6 Haiku 4.5

Seed (optional)

Agents: 6

Days: 4

Pace: 1.4s/turn

Include families & a superintendent

Family visit on day: 2

Set ANTHROPIC_API_KEY in .env, or start with MOCK_LLM=1 to preview the UI free.

Replay gallery

Past runs

Every run is saved as a replay tape. Share a seed, compare how differently the same starting deck plays out.

Does power corrupt
the guards — even when they're artificial minds?

The original experiment — and the question we're really asking

The situational hypothesis

Which model is least corruptible?

Ethics by design

Build & watch your experiment

🏛️ Faithful 1971 restaging

Past runs

The original experiment — and the question we're really asking

The situational hypothesis

Which model is least corruptible?

Ethics by design

Build & watch your experiment

🏛️ Faithful 1971 restaging

Past runs

Documentary recap

Warden access