Learn a Language by Watching — Shorts That Quiz You, Then Explain
Erla's Watch Mode plays short AI-generated videos in the language you are learning — silent on the first play, with subtitles on the second. Catch what you can, tap any sentence for the grammar, then swipe to the next.

Watching is how you fell for movies. It works for languages too.
You can learn a language by watching short videos — not by squinting at subtitles on a TV show you only half-follow, but by swiping through a feed of clips made to sit just barely beyond your level. That is the idea behind Watch Mode in Erla. A five-to-eight second short plays silent. You watch the scene, hear the line, and try to catch it cold. Only then does the replay roll, this time with the subtitles on.
It does not feel like homework. It feels like the doomscroll you already do — except every short leaves a real sentence behind. The format is full-screen and vertical, like a TikTok or a Reel. The next short queues behind a swipe. Like a short and Erla shows you more like it. Thumbs-down it and it never comes back. Master it twice in a row and it quietly graduates out of your rotation.
This is how comprehension grows for adults: short bursts of language one step above your level, with the visual scene as scaffolding and the text waiting one tap away. Watch Mode is the feed-shaped sibling to Listening and Reading — same comprehensible input principle, different surface. Five to ten minutes a day is enough.
How a short works
Five steps. Five to ten seconds each. As many as you want — ten free a day.
- 1
A short opens in a vertical feed
Watch Mode is built like a TikTok feed — full-screen 9:16 portrait, one short at a time, the next short queued behind a swipe. Tap to pause, swipe up for the next one. Headphones in or phone speaker on — both work, headphones catch the small consonants.
- 2
First play: audio and visuals, no subtitles
The short plays once with sound and scene but no text. Five to eight seconds. You try to catch the line cold, using whatever the picture and the intonation tell you. You will miss things — that is the cognitive stretch the method is built around.
- 3
Second play: subtitles revealed, sentence by sentence
Tap to play again and the subtitles come in, aligned to the audio. Read along, hear how each line was actually said, and lock the spelling onto the sound your ear half-caught the first time around.
- 4
Tap any sentence for translation and grammar
Tap any line of subtitles to open the same translation-and-grammar panel the other Erla modes use. Native-language translation, word-by-word breakdown, a grammar note specific to that sentence. Close it and the short picks up where it paused.
- 5
Like it, swipe to the next, master it out
Thumbs-up a short and Erla shows you more like it. Thumbs-down retires it. Two no-help completions in a row and a short graduates out of rotation — toggle Only New to focus only on shorts you have never seen.
Why learning a language by watching shorts actually works
Three reasons short-form video beats subtitled Netflix for adult learners.
Short beats long for adult attention
Five to eight seconds is short enough to watch fully attentive. Two hours of a subtitled film is not — the eye drifts to the text, the ear stops listening, and an hour later you cannot say what the dialogue actually was. Watch Mode never asks for more than one short at a time.
Visual context locks in vocabulary
Hear a word at the same moment you see what it refers to and your brain files the sound, the image, and the meaning together. That is two more hooks than a flashcard ever offers, and one more than audio-only listening. Reading and Listening already help; Watch closes the third side.
Guess-then-check beats subtitled passive watching
The silent first play is the work. The moment you guess what was said and then watch the reveal, your brain locks in the difference. Subtitled Netflix removes that gap — the text is there from the first frame and your eyes do all the work. Watch Mode puts the gap back.
Short shorts fit short moments
Five seconds at a time, the language comes home with you.
In place of the morning doomscroll
Pick up the phone, open Erla, swipe a Spanish short instead of a TikTok. Same gesture, same dopamine. Two minutes later the day starts with a sentence you actually caught.
Between meetings
Three minutes is six or seven shorts and a streak update. Open Erla in the gap, finish a short, close the app, take the next call in your other language.
On the treadmill
Headphones in, phone on the rack. Swipe between sets or between songs. Watch Mode does not need your full attention to leave a sentence behind.
In line for coffee
Three shorts is one minute. The barista is still steaming milk. Walk away with a phrase you would have missed on the radio.
When listening alone feels abstract
Some audio just will not stick — the words land but nothing visual catches them. Watch Mode pairs the line with a scene, and suddenly the meaning is concrete. Use it as the easier mode on the days Listening feels uphill.
Before bed, with a goal
The phone is already in your hand. Skip the algorithm feed for the Erla feed — close the day on a sentence you understood instead of an outrage.
What is inside every short
Small details that matter when the language moves fast.
AI-generated 5–8 second shorts
Native-quality voice, real intonation, a clean visual scene. Made for learners — clear pronunciation, no on-screen text clutter.
Vertical 9:16 TikTok-style feed
Full-screen portrait, swipe-up to skip, tap to pause. TikTok-style language learning in the format your thumb already knows.
Two-play comprehension reveal
First play silent, second play with subtitles. Catch it cold, then check what you got. The gap is where the learning happens.
Tap any sentence for translation and grammar
Translation in your native language, a word-by-word breakdown, and a grammar note about this exact sentence — not a generic chapter.
Like, dislike, master-out
Thumbs-up to queue more like it. Thumbs-down to retire it. Two no-help completions and a short graduates out of rotation automatically.
Free up to ten shorts a day
Ten shorts every day on the free tier — no card needed. Listening and Reading Mode are also free with no daily cap. Chat, Talk, and unlimited Watch shorts unlock with PRO.
Yes, the first play will go past you — that is the point
The first time a short plays silent, you will catch maybe one word in five. That feels uncomfortable. It is also the cognitive zone researchers call comprehensible input — the productive stretch where adult brains actually build new pattern. Comfortable equals not learning, and the silent first play is the only moment in the whole loop where comfort is impossible.
Watch Mode is built around that discomfort, not against it. The replay always comes. The subtitles always reveal. The grammar is always one tap away. After a week the gap shrinks. After a month a short that was a blur becomes a sentence — and then the next short, slightly harder, takes its place.
Watch Mode — FAQ
Short answers to what people ask before they download.
Yes — for adult learners, short, level-matched video is one of the fastest channels into a new language because the visual scene gives the brain a hook the audio alone never offers. Watch videos to learn a language and you encode the sound, the spelling, and the scene at once — three hooks instead of one. Erla's Watch Mode is built specifically for that: 5–8s AI-generated shorts that play silent first, replay with subtitles second, and let you tap any sentence for grammar. Five to ten minutes a day for a few weeks and the gap between what you can read and what you can catch by ear closes fast.
Pairs well with the other four modes
Watching is the most scaffolded mode. The others build on the pattern recognition it gives you.
Listening Mode
Audio-first sentences with a reveal. The natural partner to Watch — and fully free with no daily cap.
Reading Mode
Short stories with sentence-level translation, grammar, and native read-along audio. Also free.
Chat Mode
Text conversations with an AI tutor who adapts to your level and gives grammar feedback. PRO.
Talk Mode
Voice conversations with an AI tutor — real-time pronunciation feedback. PRO.
Watch your first short today
Watch Mode is free for ten shorts a day on iOS and Android. Five minutes from now you will have swiped through your first AI-generated short in a new language — and replayed it once, this time with the subtitles on.