Why Your AI Agent Keeps Breaking and How to Fix It

Most developers treat project documentation as something you write after the code works. Jas Randhawa's approach to CLAUDE.md flips this: write the specification first, as if you're programming the AI itself. The difference shows up when your agent stops hallucinating and starts shipping.

The guide treats CLAUDE.md not as documentation but as an execution specification. That's a meaningful distinction. Documentation describes what exists. A specification defines what should happen. When you're working with AI agents that autonomously write code, the specification IS the constraint that keeps them from wandering off into architectural chaos.

The structure Randhawa lays out covers four critical areas: repository identity, tool permissions, memory architecture, and anti-patterns. Each one addresses a failure mode that shows up when agents operate at scale.

Repository Identity - Teaching an Agent Where It Lives

An AI agent needs to know what it's working on. Not just "a web app" but the specific stack, dependencies, folder structure, and constraints. The identity section of CLAUDE.md defines this explicitly: what languages are in use, what frameworks, what the file tree looks like, and what the agent is NOT allowed to touch.

Without this, agents make assumptions. They'll scaffold a React component in a Vue project. They'll import libraries that don't exist. They'll rewrite config files they should leave alone. Defining identity upfront prevents this - not by hoping the agent remembers, but by making the boundaries explicit in every interaction.

Tool Permissions - Saying No Before It's Too Late

Agents with unrestricted tool access do expensive, dangerous things. They'll run database migrations in production. They'll install packages without checking compatibility. They'll delete files to "clean up" and take your deployment config with them.

The tool permissions section defines exactly what the agent can do: which files it can read, which it can modify, which commands it can run, and which operations require explicit human approval. This isn't about limiting capability - it's about preventing the 3 AM incident where an agent decided to "optimise" your authentication logic and locked everyone out.

Randhawa's examples are specific. Not "be careful with database operations" but "you may read from the users table, you may NOT write to it, you must ask before running migrations". The precision matters. Agents follow literal instructions. Vague guidelines get interpreted in creative ways.

Memory Architecture - How an Agent Tracks State

Agents working on multi-file changes need memory. They need to remember what they modified three steps ago, what the goal of the current task is, and what constraints they're operating under. Without explicit memory architecture, they forget mid-task and either repeat work or contradict their own changes.

The CLAUDE.md spec includes sections for task context (what are we trying to achieve?), change log (what have we modified?), and blockers (what's preventing progress?). These aren't nice-to-haves. They're the difference between an agent that completes a feature and one that gets halfway, loses the thread, and starts over.

Memory is especially critical when agents span multiple sessions. If you stop work and resume the next day, the agent needs to pick up where it left off - not rehash decisions already made or undo work already completed. The memory section provides that continuity.

Anti-Patterns - Where Agents Fail at Scale

This is the section that separates theoretical agents from production ones. Randhawa documents failure modes that show up when agents run unsupervised: hallucinating dependencies, over-abstracting simple logic, ignoring error handling, and the classic "works in development, crashes in production".

Each anti-pattern includes an example of what NOT to do and a corrected version showing the right approach. It's training data for the agent, embedded directly in the project. When Claude sees "do not create wrapper functions for single-use operations", it's less likely to turn every function call into a needless abstraction layer.

The anti-patterns also address social failures - agents that commit incomplete work, agents that don't document their changes, agents that silently fail and leave the repo in a broken state. These aren't bugs in the model. They're gaps in the specification.

Why This Matters for Real Projects

Most AI coding tools assume you'll supervise every step. That works for small tasks. It breaks down when you're building something complex over days or weeks. At that scale, you need the agent to operate semi-autonomously - making decisions, writing code, testing, and iterating without constant oversight.

CLAUDE.md as an execution spec makes that possible. It's the difference between "write me a function" and "here's the system you're operating in, the rules you must follow, and the outcomes I expect". The first gets you a function. The second gets you a feature that actually works.

For developers already using AI agents in their workflow, this guide is worth implementing immediately. The full article is on Dev.to with examples and templates you can adapt. It's comprehensive, practical, and directly applicable to any project where an agent is writing code you'll actually ship.

The larger point: agentic coding isn't about prompting better. It's about architecting constraints. Give the agent clear boundaries, explicit memory, and a map of the system it's working in. That's when it stops being a novelty and starts being a tool you can actually rely on.