The best hook in the world dies in the wrong frame or behind garbled captions. 54 of your 62 videos are vertical-only, several are letterboxed in the worst way, and auto-captions are quietly wrecking watch time. Here's the production rulebook.
9:16 (tall) is for Shorts and Reels. 16:9 (wide) is for YouTube's main player, in-stream ads, and TVs. Right now almost everything you make is tall, so on the main YouTube surface your videos show up small and boxed-in with grey bars โ they look like leftovers. And the few wide ones (the Galeria spot, the search-experience demo) are stranded in landscape on a vertical-first feed.
Shoot wide (16:9), crop tall (9:16) in the edit. Frame Rajesh with empty space on both sides so a centre crop works. One shoot โ both shapes. Never do it the other way around โ you can't un-crop a tall video into wide.
On Shorts and Reels, the platform stacks its own UI over your video: the top strip and especially the bottom third (like button, caption, share, the channel handle). Several of your videos burn captions and the logo right into that bottom third, where they get covered.
Keep all text and logos in the central "safe zone." Captions sit just above the bottom-third line; logo top-corner. Never the very bottom.
Captions aren't an accessory โ they're how the majority experience the video. Treat them as primary design.
Several videos ship raw auto-transcriptions that mangle the Hindi/Marathi audio into nonsense ("CHIZE VAPARE"). To a viewer it reads as broken. Always burn in clean, written captions. Never let YouTube auto-generate them for an ad.
Out of the UI safe zone. Big, high-contrast, two or three words at a time. The Hindi "SASTA ROVE BARA-BARA" all-caps style works; tiny low subtitles don't.
Word-by-word "karaoke" highlight keeps eyes moving and aids muted viewing. One consistent caption font across the channel โ not a different style per video.
You already shoot Hindi / Marathi / English โ that's a real edge. But label and target each cut correctly (the "Marathi" video with English captions is why it has 16 views). Match caption language to the audience.
Face the biggest window, or one soft light at 45ยฐ. Rajesh's desk setup is already decent โ just keep it consistent and avoid the overhead-shadow look.
A โน1,500 clip-on mic beats any phone mic. Clean audio is the difference between "shop video" and "creator."
Clear the table to the 2โ3 hero products. The cluttered desk hides the very thing you're selling.
From the audit: a printout of a product used as the hero shot, a frame frozen mid-transition, and a "Wainsor & Newton" typo on a premium brand. One person reviewing before publish catches all of it. Make it a step.
Several "ads" are repurposed manufacturer clips with no Art Lounge mark โ that's free advertising for Winsor & Newton. Logo top-corner on everything, always.
The Stabilo unboxing's persistent "SHOP NOW ยท KALA GHODA" banner is the model โ copy it. End card with product + price + one CTA on every cut.
Rajesh's face + one bold question word, high contrast. A clean product shot loses to a human face with an expression every single time.
Lead with the search or the curiosity: "Why Are Art Supplies So Expensive? (honest answer)", "Best Watercolour Kit Under โน2,000". Put the keyword first.