MASTERCLASS 2/8 · Structure

Anatomy of a video that holds.

A hook earns the first two seconds. Structure earns the next ninety. Every video that performs — yours included — follows the same five beats, then gets cut into six sizes from one shoot. Here's the blueprint.

① THE FIVE BEATS

Hook → Stakes → Payload → Proof → CTA

Memorise this order. Every strong video you have already follows it; every weak one skips a beat.

HOOK · 0–2s

The curiosity gap (see the Hooks masterclass). Spoken + on screen, frame one.

STAKES · 2–8s

Why it matters to them, now. Name the problem: "you'll waste money," "you'll quit frustrated," "you bought the wrong one."

PAYLOAD · the middle

Deliver on the hook. One idea, taught clearly. Rajesh explaining, product in hand. Resist cramming three points into thirty seconds.

PROOF · woven through

Show, don't tell. The swatch, the before/after, the finished piece, the shelf. Drop a credibility line ("10,000+ artists, all genuine").

CTA · last 5s

One ask. "Shop the kit," "Subscribe," "Visit the store." Said out loud, shown on screen, and on an end card.

② TIMING BY LENGTH

Same beats, different runtimes

Beat6s bumper15s30s60–90s
Hook0–2s0–2s0–2s0–2s
Stakes2–4s2–6s2–8s
Payload2–5s (one claim)4–11s (one point)6–22s (two points)8–70s (the lesson)
Proofwovenwoven18–26swoven + a hero shot
CTA5–6s11–15s26–30slast 5–8s + end card
⏱ Rule of thumb

The shorter the cut, the fewer points. A 6-second bumper makes one claim. A 30-second ad makes two. Never try to say everything — say one thing well.

③ FIGHTING THE DROP-OFF

The retention curve is a cliff. Build ledges.

Viewers leak away every second. Your longer videos (the 89-second "10 Reasons," the 156-second Acrylic Range) lose people because they're one static shot with no reason to stay past the point they've "got it." Three tools fix this:

Re-hooks

Every ~10–15s, a new mini-promise: "but here's the part nobody tells you…"

Visual variety

Cut every 2–4 seconds. Talking head → macro b-roll → product → result. Never hold one frame for 30s.

Escalation

For listicles, save the best reason for last and tease it early: "and number 1 will surprise you."

④ WORKED EXAMPLE

Re-cutting "10 Reasons to Visit"

29,400 views, founder energy — and still leaves a lot on the table. Here's its actual filmstrip, then the re-cut.

What it does now

Opens mid-sentence as the "10 Reasons" card slowly assembles. One locked-off angle for 89 seconds. All ten reasons at equal weight. No store b-roll. CTA is soft.

The re-cut

Open on Rajesh saying the single most surprising reason ("Reason #1: we'll mix any colour you bring us"). Then "…and 9 more." Cut in 2–3s of real store b-roll between reasons. End on one hard CTA: "Come see for yourself — Kala Ghoda."

⑤ THE CUTDOWN LADDER

Film once. Get six.

This is the efficiency multiplier. Every Rajesh shoot should leave the edit bay as all of these:

AssetWhat goes in itWhere it runs
6s bumperThe single sharpest claim + logoCheap reach / reinforcement
15sHook + one point + CTAIn-feed, Shorts
30sHook + two points + proof + CTASkippable in-stream (the workhorse)
60–90sThe full lessonChannel + in-feed
1–3 ShortsThe most satisfying moments, verticalShorts feed (organic + paid)
Wide + tallBoth crops from one wide shootIn-stream/TV + Shorts
✂️ The math

One hour with Rajesh → 4–6 episodes → ~30 ad-ready assets. That's why the founder format isn't just better creative, it's cheaper creative.