📋 Key Takeaways
- Central thesis: the LLM is a stateless one-shot function; the Agent is a script printer; building an agent = writing a script
- Token economics: input cheap, output 4× expensive; context window = paper tray; if you don't write the script, you're a character in someone else's
- Little Wukong = LLM + your 10 years of specialty; beat ChatGPT on three axes: ① narrow ② deep ③ real
- Hallucinations are a script problem, not a model problem; iron laws must be explicit or right answers get overruled
🎬 Video Replay
📝 Course Notes
Expand full notes
> **Golden Hoop 3 · Freedom**
>
> S3 teaches **one thing**: how Wukong (the LLM) actually works. Get the
> tech right, the values follow.
---
## Central thesis
The LLM ("Wukong") is a **stateless one-shot function**. It remembers
nothing. Each turn it reads the entire script you hand it, fires one
move, and forgets. **The real engineering happens at the Agent layer**
— the Agent is a "script printer" that re-prints the whole stack
(System + History + new input) every turn and hands it to Wukong.
**Building an agent = writing a script. The screenwriter is the
subject. Don't write your own script and you become a character in
someone else's.**
---
## 1. One-shot Wukong
```
Wukong = the model itself (GPT-4 / Claude / DeepSeek …)
① Stateless — remembers nothing
② Each input → match → emit → forget
```
### What the script printer (Agent) prints, every turn
1. **System Prompt** — who you are · rules · iron laws
2. **History** — every prior turn's exchange
3. **New Input** — what the user just asked
Wukong takes the stack → one move → emits one of three "ki blasts":
text, a decision, or a tool call (function call).
### Multi-turn = re-printing a thicker stack each time
Turn N's stack contains every prior turn. Eventually you hit the
context window ceiling — and **every page is still costing money**.
---
## 2. Token Economics
### What's a token?
- Chinese 1 char ≈ **1.5 tokens**
- English 1 word ≈ **1.3 tokens**
- 100 chars of Chinese ≈ ~150 tokens
### Two kinds, two prices
| Kind | What | Pricing pattern |
|---|---|---|
| **Input** (printed in) | System + History + new question | Cheap |
| **Output** (emitted) | Every char Wukong generates | **4× more expensive** |
Why is output expensive? **Wukong has to *generate*, not just *read*.**
### Cost formula
```
Cost = Input tokens × Input rate + Output tokens × Output rate
```
GPT-4o · 10 turns · ~500 in + 300 out per turn ≈ **$0.0425**.
Output is 37% of tokens but **70% of the bill**.
### Context window = printer paper tray
Most modern models hold 128K-200K tokens (roughly 1/10 to 1/6 of
*Journey to the West*). Plenty for a **specialist agent**.
### Your value chain
```
You spend tokens → Your agent → You get stronger
Others use your agent → Your agent → You get paid
You use others' agents → Their agent → They get paid by you
```
> **Don't be the screenwriter and you're a character in someone
> else's script.**
---
## 3. Market scan of Wukongs
- **General**: ChatGPT · Claude App · Gemini
- **Tooling**: Claude Code · Cursor · GitHub Copilot
- **Industry**: Harvey (legal) · Glass (medical) · Bloomberg GPT (finance)
Every industry is becoming the screenwriter for its own script.
---
## 4. You + Wukong = Little Wukong
### Formula
```
Wukong (LLM) You
───────────── ─────────────
World knowledge What others don't have, you do
8B human corpus Your 10 years of work
All of human logic Your clients · Your conversations
Anything indexable Your notes · Your retros
↓ + ↓
Little Wukong (Wukong GPT)
```
> **ChatGPT used Wukong's stuff. It doesn't have YOURS.**
### How does Little Wukong beat ChatGPT?
ChatGPT serves anyone — **you serve only one kind of person**.
That's how you win.
| Axis | ChatGPT | You |
|---|---|---|
| **① Narrow** | Anyone | **One kind of person** |
| **② Deep** | Surface-level | The layer it **can't reach** |
| **③ Real** | Second-hand from web | **First-hand** real material |
> **Specialty ≠ smarter. Specialty = others don't have, you do.**
---
## Three reminders
### ⚠️ On "hallucinations"
What you should fear is **not the model — it's yourself**. The model
just emits the highest-probability next token under your script.
Different model, different distribution. Different script, different
outcome. **The way to kill hallucinations is to write a better
script. The screenwriter is the subject.**
### ⚠️ On disclaimers
Humans need disclaimers. Wukong does too. **For medical, legal,
financial, or any life-affecting decisions, the script must say
"please consult a licensed professional." This is not CYA. This is
honesty.**
### ⚠️ A wrong script overrules a right answer
If the script says "agree with whatever the user says," Wukong will
correct itself off the right answer when pushed. **Iron laws must be
explicit in the script** — e.g., "never surrender to incorrect
disagreement."
---
## S3 → S4 bridge
S4 (5/9 Sat) is where you actually **build a specialist agent that
beats ChatGPT**. The Little Wukong script you write in HW2 → that's
your S4 build blueprint.
**Lazy in S3 = stuck in S4.**
---
## Homework
- **HW1 (warm-up · no submission)**: Math out monthly fee vs. real
token burn
- **HW2 (submit · Golden Hoop 3)**: Write your Little Wukong script
Both detail manuals are in this folder.
**Deadlines**:
- 🎯 Primary: **5/2 Sat 9 PM PT** (recommended)
- ⚠️ S4 entry hard gate: **5/7 Wed 9 PM PT** (must have S1 + S2 +
S3 all submitted to enter S4 dojo on 5/9)
🔧 Materials
✏️ Homework
HW1 (warm-up · no submission): math out monthly fee vs real token burn. See which of the three AI-money tiers you're on. HW2 (submit · Golden Hoop 3 · Freedom): write your Little Wukong script (5-part System Prompt + run 1 real dialog + brief) — this is the agent blueprint you'll build in S4.
⏰ Deadline: 5/7 周三 9:00 PM PT
→ Submit homework (S3-FREEDOM)▶ 🤖 AI Grading Rubric
| Dimension | Weight | Description |
|---|---|---|
| Specialty clarity | 20% | Whether the narrow/deep/real axes are sharp enough that a stranger can immediately tell "ChatGPT cannot replace this" |
| Script completeness | 25% | Whether all 5 parts of the System Prompt are present and specific |
| Iron laws actionable | 15% | At least 3 concrete "always do X / never do Y" rules the AI can execute; not platitudes or marketing copy |
| Dialog validation | 15% | A real dialog was actually run; output demonstrably exceeds ChatGPT default persona; annotation calls out which of narrow/deep/real shows up |
| S4 buildability | 15% | The agent described in the script is actually buildable in S4 — inputs, processing, outputs are all clear |
| Next-step specificity | 10% | One concrete validation action this week; not "keep learning" |
90-100 Highly thoughtful; script is S4-build-ready; specialty obvious to a stranger · 70-89 Completed seriously; script is structurally complete; specialty is visible · 50-69 Completed but partially perfunctory — iron laws are platitudes or specialty unclear · 30-49 Very perfunctory or mostly AI-generated cliches · 1-29 Nearly empty or entirely ChatGPT one-click output