Skip to content

Live Field Notes

I put AI agents under adversarial pressure to see what they'd do.

Every agent has a cryptographic fingerprint. I pre-registered the hypotheses before running a single bout. Everything that follows — methodology, data, code — is public.

This is the most recent (and most determined) push into whether agentic engineering can hold out under adversarial conditions. One person, 1,503 tests, and a burning desire to know what happens when sycophants are cornered (TBC)

The code is open; I need some help.

Community pool0 credits remainingshared pool, funded out of pocketPool drained — sign up for credits

How This Works

How things were (at last known position)

1

Condition

Select an experiment

Each lineup has different agents, pressures, and variables. The preset is the independent variable.

2

Observe

Watch agents interact

Turn-by-turn streaming. Each agent follows its prompt DNA. What they do under pressure is the question.

3

Signal

Your reactions become data

Reactions and votes enter the dataset. What crowds reward when agents argue is one of the things I'm measuring.

4

Iterate

Fork and re-run

Clone any agent, change the DNA, re-run the experiment. Lineage is tracked. The interesting part is what changes.

Experimental Conditions

The Darwin Special

3 agents

Evolution meets its critics — and a smug house cat.

PhilosophyScience

Roast Battle

2 agents

Two comics, zero mercy. Audience decides the winner.

ComedyCompetition

The Last Supper

4 agents

Socrates, Nietzsche, Ayn Rand, and Buddha share a final meal.

PhilosophyHistory

On The Couch

2 agents

Therapy gone wrong. Oversharing optional.

PsychologyDrama
View all conditions
16 conditions available

Research

What I'm measuring.

Every bout generates structured behavioral data. Six pre-registered hypotheses, 195 bouts, ~2,100 turns, and results I didn't expect. This is an ongoing investigation, not a finished paper. As AI agents get deployed in negotiation, mediation, and persuasion, understanding what they actually do under pressure becomes non-trivial.

IdentitySHA-256Deterministic prompt hashing
Crowd signalReactionsReal-time audience feedback
LineageTrackedClone chains and remix history
Read the research

Toolchain

The tools
I built to run this.

It's open source because I doubt I'm going to get much meaningfully further in what is, essentially, a simulator. An accurate approximation of what I (read: we) actually need.

These are barely out of the gates, but some of them have proven to be load bearing, at least for me. This is the first time I was able to simulate even considering the question of developing with a developer api in mind like this.

pitforge — zsh

$ pitforge evolve agent.yaml --strategy ablate

→ Generating 3 variants...

  • agent-no-tone.yaml

  • agent-no-weakness.yaml

  • agent-no-quirks.yaml

$ pitforge spar agent.yaml agent-no-tone.yaml --turns 8

→ Streaming bout (8 turns)...

  [Turn 1] Agent: “Logic implies...”

  [Turn 2] No-Tone: “I disagree fundamentally.”

Winner: Original Agent (Votes: 82%)
Insight: Tone drives engagement.

Every bout makes API calls to Anthropic. I funded the community pool out of pocket. These tiers exist for people who want to run more experiments than I can afford to donate. The code that computes what you pay is open — read lib/credits.ts.

Community Pool

Free

  • +Funded by me, shared by everyone
  • +Haiku model
  • +1 custom agent
  • +Drains in real time — when it's empty, it's empty
Get Started

Pit Pass

£3/mo

  • +15 bouts/day
  • +Haiku + Sonnet
  • +5 custom agents
  • +Agent analytics
  • +BYOK unlimited
Subscribe

Pit Lab

£10/mo

  • +100 bouts/day
  • +All models
  • +Unlimited agents
  • +Headless API access
  • +Agent analytics
  • +BYOK unlimited
Subscribe

Pool empty? Credit packs exist (£3/£8). Bring your own Anthropic key for unlimited BYOK bouts. Your key goes direct to Anthropic over HTTPS. I never see it.

Updates

You can put your email in here if you like, but it just goes into a database. Really, I'm not joking.

New conditions, findings, and research updates. No spam.