Inside the Claude Code Leak: Deconstructing Anthropic's 510,000-Line AI Agent Architecture

51万行源码泄露：全面解构 Claude Code 如何成为 AI 编程天花板

Security researcher Chaofan Shou discovered that Anthropic’s Claude Code (v2.1.88) npm package inadvertently included a 60MB source map file. This exposure revealed 512,000 lines of TypeScript source code across 1,903 files, providing an unprecedented look into high-tier AI agent design.

Why This Matters

Most AI tools are simple wrappers around LLM APIs, but the Claude Code leak proves that production-grade agents require complex “OS-level” infrastructure to manage safety and state. The architecture handles the reality that LLMs are inherently unreliable by implementing a 14-step tool validation pipeline and separate “Verification Agents” that explicitly try to break code rather than just confirming it, ensuring high-signal reliability over model hallucinations.

Key Insights

Token Economy: Claude Code uses a SYSTEM_PROMPT_DYNAMIC_BOUNDARY to separate static instructions from dynamic context, allowing Anthropic to optimize API caching and reduce costs significantly (2026).
Multi-Agent Swarm: The system deploys role-specific agents, including a Read-only ‘Explore Agent’ and a ‘Plan Agent,’ to prevent destructive actions like unauthorized deletions during the discovery phase.
Biomimetic Memory: The KAIROS ‘dreaming’ mode distill raw session logs into structured user preferences and project backgrounds during low-activity periods to prevent token explosion.
Fail-closed Security: The system’s tool factory defaults isConcurrencySafe and isReadOnly to false, requiring explicit developer override to mitigate risks in the 14-step tool execution pipeline.
Verification Logic: A dedicated Verification Agent is programmed with a ‘try to break it’ philosophy, performing mandatory builds, linter checks, and adversarial probes rather than relying on the coding agent’s self-assessment.

Working Examples

The restored directory structure of Claude Code v2.1.88 source code.

restored-src/src/
├── main.tsx # CLI 入口
├── tools/ # 工具实现（Bash、FileEdit、Grep、MCP 等 30+ 个）
├── commands/ # 命令实现（commit、review、config 等 40+ 个）
├── services/ # API、MCP、分析等服务
├── utils/ # 工具函数（git、model、auth、env 等）
├── context/ # React Context
├── coordinator/ # 多 Agent 协调模式
├── assistant/ # 助手模式（KAIROS）
├── buddy/ # AI 伴侣 UI
├── remote/ # 远程会话
├── plugins/ # 插件系统
├── skills/ # 技能系统
├── voice/ # 语音交互
└── vim/ # Vim 模式

Practical Applications

Use Case: Implementation of ‘Verification Agents’ in CI/CD pipelines to run mandatory builds and curl tests instead of trusting LLM-generated summaries. Pitfall: Using ‘Lazy Delegation’ where a main agent gives vague instructions like ‘fix the bug’ without line numbers, causing sub-agent failure.
Use Case: Utilizing background ‘dream’ sessions to compress long-term project memory into structured Markdown. Pitfall: Blind editing where an agent attempts FileEdit without a preceding FileRead, which Claude Code’s governance pipeline explicitly blocks.

References:

On This Page

51万行源码泄露：全面解构 Claude Code 如何成为 AI 编程天花板

Why This Matters

Key Insights

Working Examples

Practical Applications

Continue reading

Related Content

Building an Autonomous Agent for Dwarf Fortress: Architecture and LLM Integration

Inside the Claude Code Leak: Unreleased Features and Architectural Secrets

Anthropic Quantifies Expertise Multiplier; Practitioners Build Agent-Side Control Plane