
Claude Code /goals Separates Agent Execution from Verification
Anthropic has introduced /goals for Claude Code, a feature that formally separates task execution from task evaluation by deploying a second model to verify whether an agent has actually completed its work. The problem it solves is real: production AI agents often declare tasks finished prematurely, leaving incomplete work undetected until later. OpenAI, Google, and LangChain offer similar evaluation patterns, but require developers to build custom logic, whereas Claude Code makes independent evaluation the default behavior.






































