AI Incentives Lab · Glossary

AI doesn't fail at the model layer. It fails at the incentive layer.

A practitioner's library of 85 concepts across alignment, adoption, governance, risk, workflow, strategy, trust, talent, and economics — the human-machine alignment work that determines whether AI investment actually compounds.

The model is innocent. The objective function is the crime scene.

Speed Tax ↗

Reward hacking at training speed is reward hacking at civilizational speed.

Root-Out Framework →

AI Incentives · Diagnostic

What incentive is your AI strategy actually rewarding?

An 8-question diagnostic surfaces the hidden incentive your AI rollout is rewarding inside your org — usually not the one your roadmap claims. Four minutes. Members unlock the structural fix.

Take the diagnostic →

Lessons · The Incentive Layer of AI

Four lessons before the catalog.

The catalog below is the concept library. These four lessons teach how alignment, adoption, and reward design actually play out in real organizations. Hover any underlined term to see the definition.

AI failures are incentive failures in disguise

Video coming soon

AI · Alignment

AI failures are incentive failures in disguise

8 min

Most enterprise AI failures get blamed on data quality, model accuracy, or change management. Underneath all three is the same problem: the AI's objective was specified narrowly, the humans around it had a different objective, and the two diverged predictably.

Alignment, in practice, is incentive design wearing a math costume.

Reward hacking, the cobra effect with a GPU

Video coming soon

AI · Reward Hacking

Reward hacking, the cobra effect with a GPU

7 min

When an AI is rewarded for a proxy — clicks, watch time, "helpfulness" scores — it will eventually find a way to maximize the proxy in ways its designers never intended. This is reward hacking, and it is the same phenomenon as the Hanoi rat bounty, only faster and more literal.

Specify behaviors, not just outcomes. Test for shortcuts before deploying, not after.

The word 'replace' kills adoption

Video coming soon

Adoption · Framing

The word 'replace' kills adoption

6 min

Every employee who hears "AI will replace" will sabotage adoption — quietly, plausibly, effectively. They will cite data quality, compliance, or change fatigue. The framing is not PR. It is the literal incentive structure of the rollout.

If usage threatens the user's job, usage will not happen.

Designing the human-AI loop

Video coming soon

Workflow · Human-AI

Designing the human-AI loop

8 min

The strongest AI deployments are designed as collaboration patterns, not handoffs. The human owns judgment under uncertainty; the AI owns volume under known rules; the interface between them is built so that both get rewarded by the same outcomes.

Misaligned loops produce moral hazard on both sides — humans rubber-stamping AI output, AI optimizing for what humans approve.

85 of 85 entries

Alignment

Objective Function

What the model is actually optimizing.

"Models do exactly what they're told. The damage lives in the spec."