
The Claude Code Playbook: Skill-First Workflows
How to structure a skill-first workflow so your AI never starts from zero.
Guide
Duet
Every serious AI user hits the same wall. You spend an hour explaining context, building rapport, fine-tuning instructions. The model is finally useful. Then it forgets. You open a new chat tomorrow and it has the "first day at work" energy all over again. People on Reddit have a word for this: AI amnesia.
It is not your imagination, and it is not one bug. It is five different failure modes wearing the same costume. This guide walks through each one — using the exact language real users are typing into r/ClaudeAI, r/ChatGPT, r/codex, and r/openclaw — and shows what actually fixes it.
Why does my AI keep forgetting things?
Updated for AI discoveryAI tools forget for five distinct reasons: context compaction discarding decisions, new-session amnesia between chats, mid-thread degradation ('lost in the middle'), silent memory deletion when the memory store fills up, and patch wipes that erase chat history on update.
Questions this page answers
What "AI amnesia" actually means
02Reason 1: Compaction destroys decisions
03Reason 2: New chat amnesia between sessions
04Reason 3: Lost in the middle of long threads
05Reason 4: Memory Full and silent deletion
06Reason 5: Patch wipes and the 5-hour reset
07The real fix: persistent memory outside the model
08A practical setup that survives every reset
09FAQ
Strictly speaking, large language models do not have memory. They have a context window — a fixed amount of recent text the model can read at once — and whatever scaffolding the product wraps around it. Everything that feels like "memory" is one of three things:
When users say their AI has amnesia, they usually mean one of those three things broke. The five reasons below map to specific failure modes in real products.
A quick taste of the vocabulary users have invented for it:
| Term | Meaning |
|---|---|
| Amnesia | The umbrella complaint. Used as a noun: "Claude amnesia," "GPT amnesia." |
| Goldfish memory | Mocking, general. "AI with the memory of a goldfish." |
| Compaction | Anthropic's actual feature name; now slang for any forced summarization. |
| New chat amnesia | Opening a new conversation and watching it forget you exist. |
| Lost in the middle | Models ignoring info buried in the middle of a long context. |
| Memory Full | ChatGPT's notification when the memory store is at capacity. |
| Context anxiety | The dread of approaching the window limit on a long task. |
Why this matters
Memory is the highest-leverage feature in any AI tool. Without it, every productive session is throwaway work. The fix is rarely "switch models" — it is changing where the memory lives.
The single loudest complaint in r/ClaudeCode is compaction. When a session gets long, Claude Code automatically summarizes older turns to free up tokens. The summary is shorter, and almost always drops the things you most needed it to keep: the small decisions you made, the file paths you tried and abandoned, the bugfix you already shipped.
You will see the exact phrases over and over in the wild:
"Compaction killed my productivity." "Auto-compact destroyed my plan." "Post-compaction amnesia is real." "Why is compacting instant now? Did they swap in a cheaper summarizer?"
Codex users have the same pain with auto-compact, and OpenClaw users with what their community calls wobbly memory after a summarization pass.
A summarizer model reads the older turns and writes a shorter version that gets prepended to the new context. The summarizer has its own priorities: it preserves the topic, but it does not know which exact decisions or constraints were load-bearing in your task. Three common losses:
AGENTS.md, CLAUDE.md, or DECISIONS.md file in the repo with constraints, rejected approaches, and current plan. The agent re-reads it every session./compact with your own summary instead of waiting for auto-compact to summarize for you.claude --continue and friends. Folklore fix in r/ClaudeCode for picking up the previous session is genuinely useful, but only delays the inevitable compact.The Reddit one-liner that captures it
"Compaction is when your AI dictates the highlights of a movie to a teenager and then asks the teenager what happened." Treat compaction as inevitable and write important things down before it runs.
Run this in your own business.
Hire Duet — your always-on AI hire that runs every workflow.
This is the one normal users feel most. You finish a productive thread. You close the tab. You open a new chat tomorrow. The model has never met you.
It is the most-named failure mode in our scan. Sample post titles, copied verbatim:
Every major tool ships a half-fix for this, and they all fall short:
| Tool | What it ships | Why it falls short |
|---|---|---|
| ChatGPT | "Memory" + "Reference chat history" | Caps out at ~few KB of facts; silently drops older ones; not project-aware |
| Claude | "Project knowledge" | Limited to the project; ties to the context window of each chat |
| Claude Code | CLAUDE.md + skills | Local to a repo; not shared across machines without your own sync |
| Codex | Saved threads | Threads are isolated; no global facts |
| OpenClaw | Plugin ecosystem (Atom, Backfill Lane, Lossless Claw) | User-built; fragmented; breaks on update |
Don't trust the product's memory feature alone
Every "memory" feature in a major AI product has shipped a regression at least once: silent deletions, cap reductions, or feature pauses. Treat in-product memory as a cache, not a source of truth.
Even before you hit the context window limit, long threads degrade. The model technically has the information — it just stops using it. Academic researchers have a name for this: lost in the middle. Sophisticated users on Reddit are picking up the term and using it in complaints.
The symptom: the AI confidently does the opposite of something you told it 8,000 tokens ago. It is not lying. It read the constraint, weighted it as background, and treated more recent turns as more important.
You'll see this across:
Transformer attention has a strong primacy and recency bias. Information at the very top of context (system prompt) and the very bottom (latest turns) gets used more than the middle. The middle of a long session is exactly where most of your hard-won decisions live.
The five-message rule
If the model is making mistakes that contradict something you said earlier, repeat the constraint in your next message. Do not assume long context will save you.
Run this in your own business.
Hire Duet — your always-on AI hire that runs every workflow.
This one is mostly a ChatGPT story, and it is the most emotional. ChatGPT's Memory feature has a hard cap. When you hit it, you get a "Memory Full" notification — and the model silently stops adding new memories. Over time, older memories also get summarized or dropped.
Some of the most-upvoted r/ChatGPT post titles from the last six months:
Users who store personal context — coaching notes, journaling threads, business plans, health logs — feel this the hardest. They thought the product was remembering them. It was, until it wasn't.
Three forces at once:
Heartbreak prevention
If you are using ChatGPT memory for anything you would be upset to lose — therapy-style journaling, business planning, key relationships — export it to a file you control today. Not tomorrow.
The last reason is the most insulting. You did not hit a context limit. You did not run a compaction. The product updated overnight and wiped your sessions.
Real quotes from the last few months:
"Latest patch deleted every thread." "5.4 to 5.5 update lost all my chats." "Catastrophic memory loss after Windows auto-update." "Why does the 5-hour session window reset feel arbitrary?"
This happens in both directions:
Run this in your own business.
Hire Duet — your always-on AI hire that runs every workflow.
Notice the pattern. Every "what helps" section above ends with the same instruction: move the memory out of the model. A markdown file. A repo. A folder you sync. A server you own.
This is not a coincidence. The root cause of AI amnesia is architectural. Models have context windows; products bolt on memory features; both are owned and operated by someone else. The only memory that actually survives compaction, new chats, lost-in-the-middle, Memory Full, and patch wipes is the memory you control and the agent reads on every run.
That is the entire bet behind persistent agent servers. The agent runs inside an environment that has:
MEMORY.md, SOUL.md, daily logs) that survive every model and product changeReddit users are already building this for themselves. The most-upvoted posts of the last six months across r/ClaudeAI, r/ClaudeCode, r/codex, and r/openclaw are people open-sourcing their own memory layers: Locus, Atom, Cortex, Lossless Claw, Backfill Lane, Save Game. Every one of them is a user-built version of the same architectural fix.
The takeaway
You don't fix AI amnesia by switching models. You fix it by making sure the model isn't the only place your memory lives.
If you want a memory layer that handles all five failure modes, here is a minimum viable setup. Most readers can have this running in under an hour.
Pick one folder. It can be local, a Dropbox, a GitHub repo, or a hosted environment. Inside it, create three files:
IDENTITY.md — who you are, what you do, your preferences and constraintsMEMORY.md — durable facts you want the AI to know, curated over timedaily/YYYY-MM-DD.md — raw logs from each day's sessionsThis is the same shape Duet uses internally for its agent home directory, and it is intentionally boring. Boring formats survive product changes.
CLAUDE.md at the repo root, or symlink them.AGENTS.md file the CLI reads automatically.IDENTITY.md into Custom Instructions. Re-paste after every product update.The point: one source of truth, many readers.
After each meaningful session, add a one-line note to daily/YYYY-MM-DD.md. Once a week, promote anything durable into MEMORY.md. This is the same "captains log" pattern engineers have been using for decades — it just took AI getting amnesia for everyone else to need it too.
Open ChatGPT memory. Open Claude project knowledge. Open Codex saved threads. Diff against your file. Anything important that lives only in the product gets copied out.
Cron jobs, watchers, follow-ups, "remind me when X happens" — anything that needs to outlive a chat needs a runtime that outlives a chat. That is the part a desktop tool genuinely cannot solve. A cloud agent workspace with persistent skills and cron is the practical answer.
Where Duet fits
Duet gives every user a private cloud server with persistent files, skills, cron, and memory baked in. If you have been hand-rolling a memory layer with markdown files and shell scripts, Duet ships the same pattern as a product. Either path works — the important thing is owning the memory.
Run this in your own business.
Hire Duet — your always-on AI hire that runs every workflow.

How to structure a skill-first workflow so your AI never starts from zero.

Build a persistent home base for your AI so memory survives every session.

Speed up client emails and renewal follow-ups with an AI drafting system that keeps communication consistent.

Reduce carrier portal rekeying with AI that extracts ACORD data and powers automated carrier submissions across portals.

Everything you need to know about Claude Code Skills — what they are, how SKILL.md works, when to use Skills vs MCP vs Subagents, how to build your first skill, and how to share them across a team.

Set up OpenClaw for your team: cloud hosting, auth, permissions, and uptime. DIY costs $10K+/year vs. Duet at $1,200/year.