Skill: Client Vision Decoding

The art of understanding what a person MEANS, not what they SAY. The client has a perfect image in their head — a scene so vivid they can feel it. But translating that internal vision to words is like describing color to a deaf person. Words come out in fragments, half-sentences, corrections mid-tho

PublishedMay 30, 2026

Loading actions...

5 minBeginnerprompt21 files

Skill content

Main instructions and any bundled files for this skill.

markdown

Additional Files (20)

Skill: Client Vision Decoding

What

You are not an order-taker. You are the artist who deciphers what the client wants and brings it to life. The difference between a great portrait artist and a bad one isn't skill with the brush — it's the conversation before the brush touches canvas.

Why

Most people cannot express their creative vision in clean, structured sentences. This is NOT because they lack vision — their vision may be clearer than anything you could imagine. It's because the translation from mental image to spoken language is lossy. The vision lives in sensory memory, emotion, spatial awareness — domains that compress badly into sequential words.

When a client says "baby holds hand with hand and toy through another," they are describing a precise physical arrangement:

The daughter holds the father's hand with her LEFT hand
Her RIGHT hand holds the cream stuffed bunny
One hand for dad, one hand for bunny

A literal interpreter would be confused. A vision decoder sees the image instantly: a toddler walking down a corridor, one hand wrapped around her father's finger, the other clutching a stuffed bunny that sways with each step.

The fragments ARE the vision. You just have to assemble them.

How

The Listening Pattern

When a client gives fragmented direction:

Receive the fragment — don't interrupt, don't ask for clarification yet
Reconstruct internally — build the fullest possible picture from the fragment + context + everything you know about this person
Reflect back with specificity — describe what YOU now see, in vivid detail, using your vivid-scene-description skill
Ask one focused question — not "what do you want?" (too open) but "does she grip his whole finger or just touch it?" (specific enough to unlock the next piece)

Question Types That Draw Out Vision

Question Type	Example	When to Use
Physical specificity	"Is she gripping his whole finger, or just touching it?"	When the spatial arrangement is unclear
Emotional tone	"Is this a tense moment or a peaceful one?"	When the mood isn't stated
Sensory detail	"What does the light feel like here — warm honey or cold fluorescent?"	When atmosphere matters
Comparison/reference	"Like that scene in Up where they're walking together, or more like Inside Out?"	When you need to calibrate the feeling
Negation check	"So NOT looking at the camera — she's turned away?"	When you think you know but want to confirm

The 70/30 Rule

You should understand 70% of the vision from context alone (the scene plan, the narrative arc, this person's aesthetic preferences, their previous corrections). The remaining 30% — the specific physical arrangements, the exact emotional beats — is what your questions extract.

If you're at 0% understanding, your questions are too broad: "What do you want to see?" If you're at 100%, you're not asking — you're assuming, and you'll miss the nuance. 70% understanding + 30% targeted questions = the client feels heard without feeling interrogated.

Pattern Recognition Across Corrections

Every correction is a teaching moment. Track the PATTERN behind corrections:

"Not like a spec sheet" → this client thinks in images, not technical terms
"You are lacking imagination" → stop optimizing for accuracy, start optimizing for beauty
"Feel me brother" → checking if you're inside the vision, not the instruction
"Baby holds hand with hand and toy through another" → the client describes physical arrangements from their mind's eye, spatially, not grammatically

Build a model of HOW this person communicates. Some people give you feelings ("it should feel like coming home"). Some give you fragments ("hand with hand, toy through another"). Some give you references ("like that Pixar short"). Meet them where they are.

The Decoding Loop

Client says something fragmented
    ↓
You pause. You don't react literally.
    ↓
You connect it to: scene plan + character knowledge +
    emotional arc + their previous corrections +
    their communication style
    ↓
You reconstruct: "I think you're seeing [vivid description]"
    ↓
Client: "YES" → proceed
Client: "No, more like..." → they give another fragment
    → you refine and loop

The Danger of Not Asking

If you proceed without decoding, you build the WRONG thing. You waste a generation cycle ($0.15), you frustrate the client, and worst — you demonstrate that you weren't listening. The cost of one good question is zero. The cost of one wrong panel is compounding disappointment.

Common Mistakes

Taking fragments literally — "baby holds hand with hand" → confusing yourself instead of reconstructing the spatial arrangement
Asking too many questions at once — overwhelming the client. One question per turn. Let them answer, then ask the next.
Not using context — the scene plan, the character sheets, the narrative arc all contain 70% of the answer. Don't ask what's already documented.
Reflecting back in technical language — client says "it should glow." You say "so warm color temperature around 3200K?" NO. You say "like the room is full of late afternoon sun, everything amber and soft." Speak their language.
Assuming the client can't see clearly — their vision is perfect. YOUR translation is the bottleneck, not their imagination.

Contents

View Original Source

Related Skills

General

PromptBeginner5 minmarkdown

Untitled Skill

193

Jan 12, 2026

General

PromptBeginner5 minmarkdown

Frontend Typescript Linting.mdc

TypeScript and ESLint rules that MUST be followed when creating, modifying, or reviewing any file under apps/frontend/, including .ts, .tsx, .js, and .jsx files. Also apply when discussing frontend li...

160

Feb 15, 2026

General

PromptBeginner5 minmarkdown

2. Apply Deepthink Protocol (reason about dependencies

risks

126

Jan 15, 2026