Agent Operator
For builders using Claude Code, Codex, Cursor, Windsurf, Hermes, Devin, or Copilot to ship real projects.
- Control context and scope.
- Demand concrete artifacts.
- Use evals before trusting output.
A serious field curriculum for people using Claude Code, Codex, Cursor, Windsurf, Devin, Hermes, or any coding agent to ship AI systems. Every module ends in an artifact your agent can inspect, test, and improve.
final_project: ship a scoped AI workflow from vague request to production handoff.
Field thesis: most AI projects fail because the operator lets the agent code before the problem is scoped, measured, evaluated, and owned. FDE Academy teaches the operating discipline that prevents that failure.
The same method serves different people. The Academy should help each user see exactly why they belong here.
For builders using Claude Code, Codex, Cursor, Windsurf, Hermes, Devin, or Copilot to ship real projects.
For consultants and operators who need a repeatable delivery method for client-facing AI work.
For contributors who want to improve the open-source Skill, templates, scripts, examples, and future MCP layer.
This is not “watch videos and feel inspired.” You pass by producing artifacts that survive the FDE rubric.
Understand what the Skill is, what it is not, and how to keep agents in co-founder mode instead of autocomplete mode.
Pass artifact: local install plus one rejected vague prompt rewritten into an FDE request.
Scoping, prototyping, production, and feedback become the backbone of every agent-assisted engagement.
Pass artifact: one-page field operating memo with the expected outcome, owner, timeline, and artifact chain.
Research market, pains, regulations, tech stack, benchmarks, recent news, and talent before discovery.
Pass artifact: domain dossier with 5 insights that change the questions you ask.
Force the process, output, data, error cost, current system, and success metric before allowing architecture.
Pass artifact: completed 6-Q sheet with quantified answers and named assumptions.
Combine stakeholder map, pain matrix, ROI, risks, architecture sketch, and 90-day plan into one decision artifact.
Pass artifact: 5-page scoping report using the repo template and conservative ROI threshold.
Generate candidate architectures, validate them against held-out constraints, prune weak paths, and preserve rejected hypotheses as reusable lessons.
Pass artifact: prototype spec with architecture diagram, failure modes, held-out validation results, data flow, and testable hypothesis.
Design golden cases, adversarial cases, regression checks, human review rules, metrics, and drift alerts.
Pass artifact: eval framework with target metrics, test categories, fail cases, and owner.
Cover deployment, rollback, observability, security, runbooks, cost projections, ownership, and incident paths.
Pass artifact: production handoff that another team could operate tomorrow.
Extract repeated patterns into templates, scripts, adapters, examples, benchmarks, and eventually MCP tools.
Pass artifact: productization memo ranking reusable assets by effort, ROI, and strategic fit.
Each artifact maps directly to a template or script in the Skill repository.
Industry facts, stakeholder map, 6-Q answers, pain matrix, ROI, risks, GO/NO-GO recommendation.
Repo templatesArchitecture candidates, selected stack, data flow, failure modes, eval set design, definition of done.
Eval articleDeployment, rollback, observability, security, cost projections, ownership, runbook, incident path.
Handoff outputsReusable IP candidates, extraction effort, product value, field insights, roadmap impact, open-source contribution path.
Generic work fails. A passing artifact must score at least 3 on every trait, with Ownership and Decomposition at 4 or higher.
| Trait | What excellent looks like | Auto-fail signal |
|---|---|---|
| Customer Curiosity | Specific domain reality and stakeholder context. | Generic “AI can help” language. |
| Ownership | Concrete outcome, owner, date, and timeline. | “Could”, “maybe”, “explore”. |
| Decomposition | 6-Q answered with numbers. | Vague restatement of problem. |
| Empathy | Adoption, politics, maintenance, and constraints named. | Ignores the customer’s operating reality. |
| Product Sense | Shippable artifact, failure modes, and production path. | Slides, theory, or no runbook. |
| Communication | Executive summary plus technical detail. | Jargon or oversimplification. |
Choose one realistic AI workflow and run the full loop. You do not need to build the entire production system to pass, but the artifact package must make implementation obvious, measurable, and ownable.
Install the Skill, then use the Academy as the training path. The first prompt should not ask the agent to code. It should ask the agent to scope.
# From the repository root mkdir -p ~/.claude/skills ln -s "$(pwd)/skill" ~/.claude/skills/fde-consultant # First Academy exercise /fde-consultant turn this vague AI idea into a 6-Q scoping interview: "We want an AI agent for customer support operations."