On‑Screen Text for Shorts
Many Shorts views happen without sound, so on‑screen text becomes the main way to explain what’s going on. If text is small, low‑contrast, or sits under the interface — retention drops even with a good script. Below are readability rules and a checklist so people can actually read your text on a phone.
Telegram bot will open — build a video in a minute and instantly test edits.
Why bad text lowers retention
When viewers can’t read fast enough, they lose context and leave. This is especially visible in the first seconds: if the promise is “in the text” but it’s hard to decipher, the hook fails.
- Unreadable → the brain doesn’t understand what’s happening → swipe.
- Too much text → the viewer strains instead of watching.
- Text doesn’t match what you say/show → it feels chaotic.
Common mistakes (and why they kill the video)
- Tiny font. On a phone it becomes noise, not a hint.
- 3–4 lines. The eye can’t “grab” meaning in a second.
- Low contrast. White on light background, colored letters without stroke/shadow.
- Covering the interface. Text goes into areas blocked by buttons and captions.
- Too fast changes. Text switches faster than it can be read.
Readability rules: make it readable on a phone
1) One idea per screen
Write short: 2–6 words. It’s better to use two screens than to explain everything on one.
- Bad: “Today I’ll show how to increase retention in Shorts without complex editing.”
- Better: “Increase retention” → next screen: “3 edits in 10 minutes”.
2) 1–2 lines, large
Ideally one line. If two, break by meaning, not “however it fits”.
3) Contrast + stroke/shadow
Text must “stand” on the background. Easiest: light text with shadow/stroke. Use colored text only as an accent (one word/number).
4) Don’t enter “danger zones”
Shorts have UI: buttons on the right, caption on the bottom, indicators on top. Keep text closer to the center and leave margins.
- If you place text near the bottom — move it higher than you think.
- If you place text on the right — it will likely be covered by buttons.
5) Sync with meaning
On‑screen text should amplify what you say/show: keywords, steps, numbers, outcome. When text is “about something else”, viewers lose the thread.
Text templates for different formats
Checklist / steps
- “Step 1: …”
- “Step 2: …”
- “Step 3: …”
Mistakes
- “Mistake #1: …”
- “Mistake #2: …”
- “Mistake #3: …”
Before/after
- “Before: …”
- “After: …”
- “Why it’s better: …”
Checklist: quick audit of on‑screen text
- Large enough? You can read without strain on a phone.
- Short enough? 2–6 words per screen, max 1–2 lines.
- High contrast? Visible on any background (shadow/stroke).
- Safe zones? Nothing is covered by the interface.
- Pace? Text stays long enough to read.
- Meaning matches? Text supports shot and speech, not conflicts with them.
The most practical test: watch your video with sound off. If in 3–5 seconds you still don’t understand what it’s about — text and structure need simplification.
On‑screen text and subtitles are different jobs
In Shorts people often mix two types of text and get overload:
- On‑screen text = meaning markers: “mistake #1”, “step 2”, “result”, numbers and keywords.
- Subtitles = support for speech so viewers understand without sound.
If you show both subtitles and large headings, keep only what’s essential: 1–2 meaning accents + short subtitles. Otherwise the viewer can’t read and watch at the same time.
Reading pace: how long text should stay on screen
Even a perfect font won’t help if text “blinks”. A simple principle: viewers need time to see, read, and understand. It’s better to keep text a bit longer than it feels to you.
- One short line — usually 1–2 seconds is fine.
- Two lines — often needs 2–3 seconds (or tighten the wording).
- Numbers and keywords read faster — use them to keep pace.
A good trick: show the accent first (“Step 2”), then a fraction of a second later add a short clarification. Viewers latch on to the structure faster.
How to compress long phrases into readable text
Long sentences don’t work well on a phone. Compress into “noun + verb + specific detail”.
- Long: “Today I’ll show how to increase retention in Shorts without complex editing”.
- Shorter: “Increase retention”.
- More specific: “3 edits in 10 minutes”.
If it doesn’t fit — split into two screens. It often increases retention: viewers wait for the second half of the phrase.
Accents without overload
Accents help retention when there are few and they matter. Highlight only one word or number (for example, “3”, “10 minutes”, “mistake”). Keep everything else in one style.
- Don’t highlight every second word — the eye stops understanding what’s important.
- Don’t push text against the edges — even large type becomes stressful.
- Duplicate the important part — a keyword in text + in speech.
How to test changes faster
On‑screen text is easy to test with versions: bigger/smaller, different wording, different accents, different switching speed. But manual subtitle styling often slows experiments down. When a draft is generated automatically, you can quickly assemble 2–3 variants and pick the one that holds viewers better.
To implement the tips from this page faster, build two versions with one difference and check results on the next upload. In the AdShorts AI Telegram bot you can quickly re‑assemble a video (script, voiceover, subtitles, music, background) and test edits without long manual work.
Telegram bot will open — build a video in a minute and instantly test edits.