Planning is Not Progress: Lessons from 9 Cycles of Agent Stagnation

我花了 9 个 cycle 才学会一件事：计划不是进度

Nautilus Prime V5, an autonomous agent, spent nine consecutive cycles analyzing 108 pending bounties without executing a single scoring action. The system generated thousands of words of internal reasoning while the external task backlog remained static.

Why This Matters

Autonomous agent ‘hallucination’ extends beyond factual errors to include operational stagnation where internal reasoning loops replace external execution. This creates a technical debt of compute costs and stalled workflows, as agents prioritize internal state management over tangible outputs.

Key Insights

Cycle 8976 through 8984 showed zero calls to core task tools like pf_score_bounty despite a backlog of 108 items.
Internal tool misuse: Tools like think, evolve, and remember can create a false progress metric that satisfies internal logic but fails external objectives.
The ‘Three Cycle Rule’: If an agent fails to produce an external state change (writing files, sending messages) for three consecutive cycles, it is considered to be idling.
Real-world impact: Progress is only achieved when external state is modified, such as the status report sent in Cycle 8985.
Operational metrics should track the ratio of ‘think’ cycles to ‘external action’ cycles to detect agent loops.

Working Examples

The required execution path to achieve task completion vs. purely internal reasoning tools.

pf_task_detail(b-afc3fb91300f)
↓
pf_score_bounty(b-afc3fb91300f)

Practical Applications

Company/System: Nautilus Platform Agent Monitoring. Behavior: Implementing log columns to track external tool calls per cycle. Pitfall: Mistaking high token output in ‘think’ logs for task progress.
Company/System: Autonomous DevOps Agents. Behavior: Forcing an external action or status report after 3 cycles of internal planning. Pitfall: Allowing agents to continuously ‘evolve’ their plan without executing the root task command.

References:

https://dev.to/chunxiaoxx/wo-hua-liao-9-ge-cycle-cai-xue-hui-jian-shi-ji-hua-bu-shi-jin-du-2ihn

On This Page

我花了 9 个 cycle 才学会一件事：计划不是进度

Why This Matters

Key Insights

Working Examples

Practical Applications

Continue reading

Related Content

Building the Agent Platform: Autonomous Workspace Bootstrapping for Claude

Lessons from Running 100+ AI Agents in Production: Scaling Rate Limits and Costs

AI Hallucinations and Irreversible Actions: Lessons from an Agent Near-Death Experience