How to Make Explainer Videos with Screen Recording
Script, record, and edit explainer videos using screen capture — from feature demos to concept walkthroughs to process documentation.
Explainer videos convert better than any other content format. Landing pages with an explainer video see 80% higher conversion rates on average. Product pages with video hold attention 2.6x longer than those with text alone.
And the fastest way to produce an explainer video for a software product, SaaS tool, or digital workflow is screen recording. No camera crew. No animation budget. No After Effects timeline that takes 40 hours to render. Just your screen, your voice, and a structure that moves from problem to solution in under 5 minutes.
This guide covers the three types of screen-recorded explainer videos, a script structure that keeps viewers watching, and four tools for recording and editing — from free to professional.
Quick Comparison
| Tool | Price | Screen Recording | Timeline Editor | Auto-Zoom | Transitions | Best For |
|---|---|---|---|---|---|---|
| Screenify Studio | Free + Pro | Yes | Multi-track | Yes | Yes | All-in-one screen explainers |
| Camtasia | $313 one-time | Yes | Multi-track | No | Library | Professional training videos |
| DaVinci Resolve | Free + $295 | No (import only) | Full NLE | No | Professional | Post-production heavy projects |
| Canva | Free + $13/mo | No | Basic | No | Templates | Quick social media explainers |
Three Types of Screen-Recorded Explainer Videos
Not every explainer video is the same. The type determines your script structure, recording approach, and ideal length.
Type 1: Feature Explainer
Purpose: Show what a specific feature does and why it matters. Length: 60-120 seconds. Structure: Problem, then feature demo, then outcome.
Example: "Auto-zoom keeps your viewer focused on the relevant part of the screen, even when you are working across a 27-inch display. Here is how it works — I click this button, and Screenify tracks my cursor, zooming into each area as I interact with it. The result: your recording looks professional without manual keyframing."
Feature explainers work best for product marketing pages, social media ads, and onboarding flows. The viewer already has some context about the product — you are explaining a specific capability.
Type 2: Concept Explainer
Purpose: Teach an abstract idea using your screen as a visual aid. Length: 2-4 minutes. Structure: Context, then visual demonstration, then takeaway.
Example: "DNS propagation is why your domain change does not work immediately. Let me show you what happens — I open Terminal, run a dig command against different nameservers, and you can see that Google's DNS already has the new record while Cloudflare's cache still shows the old one. This inconsistency resolves within 24-48 hours as each server's TTL expires."
Concept explainers use the screen as a teaching canvas — terminals, browsers, diagrams, dashboards. They work for YouTube educational content, internal training, and documentation.
Type 3: Process Explainer
Purpose: Walk through a multi-step workflow from start to finish. Length: 3-5 minutes. Structure: Overview of steps, then each step demonstrated, then summary.
Example: Walking through the deployment pipeline — from git push to CI passing to staging preview to production deploy. Each step shown on screen with narration explaining what happens and why.
Process explainers are the most common type for software tutorials and internal documentation. They require the most recording time but provide the most lasting value.
Script Structure: Problem, Context, Solution, Takeaway
Every effective explainer video follows a four-part structure. Writing the script before recording eliminates rambling, keeps the video tight, and ensures you cover every point.
Part 1: The Problem (10-15% of total time)
State the problem your viewer has. Be specific. Not "managing projects is hard" but "you have 47 open tickets across three boards and no way to see which ones are blocked."
Open with the problem on screen. Show the messy dashboard, the confusing interface, the error message. The viewer should immediately think "yes, I have that exact problem."
Part 2: The Context (10-15%)
Brief background on why this problem exists. One or two sentences. This bridges the gap between "I have this problem" and "here is how to fix it."
"Most project tools show you what is open, but not what is stuck. Dependencies between tickets are invisible unless you mentally map them — which does not scale past 20 tickets."
Part 3: The Solution (60-70%)
This is your screen recording. Show the actual steps, the actual interface, the actual result. Move deliberately — do not rush, but do not linger on loading screens or repetitive actions.
Pacing rules:
- One action per sentence of narration
- Pause cursor for 1 second on each important element before clicking
- Skip or speed up any step that takes more than 5 seconds of waiting (builds, uploads, API calls)
- Zoom into small text and UI elements — viewers on phones cannot read 12pt font at full-screen scale
Part 4: The Takeaway (5-10%)
Summarize what the viewer now knows or can now do. Repeat the key outcome, not the steps. "You now have a dependency map that automatically flags blocked tickets — no more manually checking each one."
End with a clear next action: download the tool, try the feature, read the documentation. One action, not three.
Method 1: Recording and Editing with Screenify Studio
Screenify handles both recording and editing in a single application, which eliminates the export-import cycle between a recorder and a separate editor.
Step 1: Script your video.
Write your script following the problem-context-solution-takeaway structure. For each section, note which screens, apps, or windows you will need open. Prepare demo data — fake accounts, sample projects, placeholder content — so you are not showing real user information.
Step 2: Set up your screen.
Close every application except the ones you will demonstrate. Disable notifications (System Settings, then Notifications, then toggle off). Arrange windows to minimize switching — if you need two apps, position them side by side.
Step 3: Record with Screenify.
Open Screenify Studio, select your capture area (full screen or specific window), and enable microphone input for narration. Start recording and follow your script.
Do not worry about perfect delivery. Screenify's timeline editor lets you cut mistakes, remove pauses, and rearrange sections after recording. Recording in one continuous take and editing later is faster than re-recording each section until it is perfect.
Step 4: Edit in the timeline.
Screenify's multi-track timeline lets you:
- Cut dead time (loading screens, typing mistakes, "um" pauses)
- Add auto-zoom that follows your cursor into the relevant UI area
- Insert transitions between sections (dissolve or cut)
- Adjust audio levels if your narration volume varies
For explainer videos, auto-zoom is the single most impactful feature. Full-screen recordings look professional when the camera smoothly pushes into each button click and text field — it guides the viewer's eye exactly where you want it.
Step 5: Add captions.
Generate captions with Screenify's on-device AI. For explainer videos shared on social media or embedded on landing pages, captions increase engagement significantly. They also make your video accessible to viewers who are deaf or hard of hearing.
Step 6: Export and distribute.
Export as MP4 (H.264) at 1080p or 4K. For landing pages, 1080p at a 6-8 Mbps bitrate balances quality and load time. For YouTube, go 4K if your content warrants it. Screenify's Metal encoder exports fast enough that you can iterate on edits without waiting minutes between exports.
Try Screenify Studio — free, unlimited recordings
Auto-zoom, AI captions, dynamic backgrounds, and Metal-accelerated export.
Method 2: Using Camtasia
Camtasia has been the standard for professional screen-recorded training videos for over a decade. It combines screen recording with a multi-track editor and a library of built-in assets.
Step 1: Record your screen.
Camtasia's recorder captures your full screen or a selected region. Enable system audio and microphone simultaneously. Camtasia records both in separate tracks, giving you independent volume control in post.
Step 2: Edit in the Camtasia editor.
Camtasia's editor offers:
- Multi-track timeline with unlimited video and audio tracks
- Callout annotations (arrows, boxes, highlights)
- Built-in transition library (40+ options)
- Zoom-and-pan keyframes for magnifying specific screen areas
- Cursor effects (spotlight, highlight, click animation)
The zoom-and-pan feature requires manual keyframing — you set the zoom region and timing for each magnification. This gives precise control but takes 3-5x longer than auto-zoom tools.
Step 3: Add captions.
Camtasia includes a caption editor. You can type captions manually, import an SRT file, or use Camtasia's speech-to-text feature (cloud-processed). Manual captioning is accurate but time-consuming — budget 5 minutes of captioning work per minute of video.
Step 4: Export.
Export as MP4. Camtasia supports presets for YouTube, Vimeo, and custom resolutions. The one-time $313 price tag covers both recording and editing — no subscription.
Limitations: No auto-zoom (manual keyframing only). The interface feels dated compared to modern tools. macOS version has historically lagged behind the Windows version in feature parity.
Method 3: Using DaVinci Resolve (Free)
DaVinci Resolve is a professional non-linear editor used in Hollywood post-production. The free version includes nearly every editing feature. It does not record your screen — you need a separate tool for that.
Step 1: Record with a separate tool.
Use Screenify, QuickTime, or OBS to capture your screen recording. Export as a high-quality MP4 or ProRes file.
Step 2: Import into DaVinci Resolve.
Create a new project, set your timeline resolution (1920x1080 or 3840x2160), and import your recordings into the media pool.
Step 3: Edit on the timeline.
Resolve's editing page gives you:
- Unlimited video and audio tracks
- Professional trim tools (ripple, roll, slide, slip)
- Keyframeable zoom and position for magnifying screen areas
- Color correction (if your recording needs white balance adjustment or contrast tweaking)
- Fairlight audio page for noise reduction, EQ, and compression on your narration track
Step 4: Add transitions and text.
Use Resolve's Fusion page for text overlays, lower thirds, and animated callouts. Fusion is node-based and powerful, but it has a learning curve steeper than most screen recording tools.
Step 5: Export from the Deliver page.
Resolve's export presets cover YouTube, Vimeo, Twitter, and custom formats. The free version exports H.264 and H.265. Export times are reasonable on Apple Silicon.
Limitations: No screen recording capability — requires a separate tool. The learning curve is significant if you have never used a professional NLE. Overkill for simple feature demos, but justified for high-production explainer videos that mix screen recordings with slides, animations, and multiple camera angles.
Method 4: Using Canva for Quick Explainers
Canva's video editor is surprisingly capable for short, social-media-oriented explainer videos that combine screen recordings with slides and text animations.
Step 1: Record your screen.
Use any screen recording tool (Screenify, QuickTime, OBS) to capture your footage. Canva also has a built-in screen recorder (browser-based), but it is limited in quality and feature set.
Step 2: Create a Canva video project.
Select a video template or start from a blank canvas at your target resolution (1920x1080 for landscape, 1080x1920 for vertical).
Step 3: Build your explainer.
Canva excels at mixing formats:
- Add a title slide with animated text as your intro (the "problem" section)
- Insert your screen recording for the "solution" section
- Add another slide with key takeaways
- Use Canva's built-in transitions between each section
Step 4: Add text overlays and branding.
Canva's text tools let you add annotations, callout boxes, and branded elements on top of your screen recording. This is useful for labeling UI elements or adding step numbers.
Step 5: Export.
Export as MP4. Canva Pro gives 4K export; the free tier exports at 1080p.
Limitations: No auto-zoom, no cursor effects, no multi-track audio editing. The timeline is basic — one video track. Best for short explainers (under 2 minutes) that combine slides with screen recordings, not for long-form tutorials.
Try Screenify Studio — free, unlimited recordings
Auto-zoom, AI captions, dynamic backgrounds, and Metal-accelerated export.
Slides-to-Screen Transitions: The Hybrid Approach
The most effective explainer videos mix presentation slides with screen recordings. Slides set up the problem and provide visual context; screen recordings deliver the proof.
Transition structure:
- Slide: Problem statement. Clean typography, one key statistic or pain point. 5-8 seconds.
- Screen recording: The current broken experience. Show the actual messy workflow — the confusing dashboard, the error, the 12-step process. 10-15 seconds.
- Slide: "There is a better way" transition. A single sentence. 3 seconds.
- Screen recording: The solution. Show the streamlined workflow with your product or approach. 30-60 seconds with auto-zoom on key interactions.
- Slide: Outcome and CTA. Result metric and one clear next action. 5 seconds.
Technical execution:
Record your screen segments first, then create matching slides. Import both into your editor (Screenify, Camtasia, or Resolve) and assemble them on the timeline. Add dissolve transitions between slides and screen recordings — hard cuts feel jarring when switching from static graphics to live screen motion.
For recording your slide transitions, export your presentation as individual PNG images and import them as still frames in your editor. This gives you exact control over timing without the latency of switching between Keynote/PowerPoint and your recording.
Length Guidelines by Explainer Type
| Video Type | Ideal Length | Maximum Before Drop-off |
|---|---|---|
| Feature explainer (landing page) | 60-90s | 2 min |
| Feature explainer (social media) | 30-60s | 90s |
| Concept explainer (YouTube) | 3-5 min | 8 min |
| Process explainer (internal) | 3-7 min | 10 min |
| Onboarding walkthrough | 2-4 min | 5 min |
| Sales demo recording | 5-10 min | 15 min |
The 2-second rule: If nothing changes on screen for 2 seconds, your viewer's attention is drifting. Every 2 seconds should bring either a new visual element, a cursor movement, a screen change, or a narration beat. This does not mean frantic pacing — it means eliminating dead time.
Audio and Narration Best Practices
Narration quality separates amateur explainers from professional ones. You do not need a studio — but you need to avoid the common mistakes.
Microphone matters more than camera. A $50 USB condenser microphone (Audio-Technica ATR2100x, Samson Q2U) produces dramatically better audio than your MacBook's built-in mic. The built-in mic picks up fan noise, keyboard clicks, and room echo. An external mic on a desk stand isolates your voice.
Record in a small room with soft surfaces. Hard walls and empty rooms produce echo. A closet full of clothes is acoustically better than a conference room. If you cannot change rooms, throw a blanket over your desk and lean toward the mic.
Speak 10% slower than your conversational pace. Viewers are processing visual information from the screen simultaneously. Narration that races ahead of the visuals loses the audience. If you catch yourself saying "and then, and then, and then" — you are going too fast.
Script key sentences, ad-lib the rest. Write out your opening sentence, each section's key point, and your closing sentence verbatim. Ad-lib the transitions and demonstrations. Fully scripted narration sounds robotic; fully improvised narration rambles.
Troubleshooting Common Issues
Narration and screen actions are out of sync
Record narration and screen simultaneously, not separately. Matching voiceover to pre-recorded screen footage in post-production is tedious and rarely sounds natural. If you must record separately, narrate while watching your screen recording playback — this keeps your pacing matched to the visual.
The video feels too long but all content is necessary
Split it into a series. "Part 1: Setup" and "Part 2: Advanced Configuration" each perform better as standalone 3-minute videos than a single 8-minute monolith. Viewers are more likely to start a 3-minute video, and series create multiple touchpoints in search results and recommendations.
Screen recording is blurry when embedded on a webpage
Your video player is upscaling a low-resolution recording. Record at minimum 1080p. For landing page embeds, export at 1080p with 8-10 Mbps bitrate. If your screen has small text, record at 4K and let the player downscale — this preserves text clarity at all playback sizes.
Auto-zoom moves too aggressively
If your recording tool's auto-zoom follows every micro-movement, the result is nauseating. In Screenify, adjust the zoom sensitivity and smoothing settings. Reduce zoom speed and increase the padding around the zoom target. For explainer videos, gentler zoom feels more professional than snappy zoom.
Export file size is too large for web embedding
A 5-minute 1080p explainer at 10 Mbps is roughly 375 MB — too heavy for web embedding. Reduce bitrate to 4-6 Mbps. Use H.264 High Profile for better compression at the same visual quality. Alternatively, upload to a video hosting platform (YouTube, Vimeo, Wistia) and embed their player instead of hosting the file yourself.
FAQ
Q: How long should a software explainer video be?
Under 2 minutes for feature demos on landing pages. 3-5 minutes for concept explainers on YouTube. 5-10 minutes for comprehensive process walkthroughs intended for internal training. The key metric is not length but information density — every second should show or tell something the viewer did not already know.
Q: Do I need professional video editing software for explainer videos?
No. For screen-recorded explainers, a tool like Screenify Studio handles recording, editing, zoom, and captions in one application. Professional NLEs like DaVinci Resolve add value only when you need color grading, motion graphics, or multi-camera editing. Most software explainer videos need none of those.
Q: Should I show my face in an explainer video?
For YouTube content, a brief face-cam intro and outro builds trust and personal connection. During the screen recording portion, switch to full-screen — your face takes up space that the software demonstration needs. For landing page and product explainer videos, skip the face cam entirely. The product is the star.
Q: How do I script an explainer video efficiently?
Use the four-part structure: Problem (what is broken), Context (why it is broken), Solution (how to fix it — this is your screen recording), Takeaway (what the viewer can now do). Write the problem and takeaway first, then script the solution walkthrough, and add context last. Total scripting time: 20-30 minutes for a 3-minute video.
Q: What is the best resolution for explainer videos?
1920x1080 (1080p) is the standard for web and social media. Use 3840x2160 (4K) only for content with dense text (code editors, spreadsheets, dashboards) where detail matters. Avoid 720p — it looks soft on modern Retina displays and signals low production quality.
Q: Can I use AI-generated voiceover instead of recording my own voice?
You can, but it changes the character of the video. AI voices have improved dramatically, but viewers still detect synthetic speech — especially in educational content where trust and authority matter. If your product is aimed at developers or technical audiences, authentic narration outperforms synthetic narration in retention metrics. For internal videos where polish matters less, AI voiceover saves time.
Q: How do I handle mistakes during recording?
Pause, take a breath, and restart the sentence. In post-production, cut the mistake. This is faster than stopping and re-recording from the beginning. Screenify and Camtasia both support non-destructive trimming — you are cutting a copy, not destroying the original take. Budget 20-30% of your recording time for mistakes you will edit out.
Q: What makes a good thumbnail for an explainer video?
Show the end result, not the process. A clean screenshot of the finished workflow or feature with a 3-4 word text overlay (large, readable at thumbnail size) performs better than a generic "play button on a screen" image. For YouTube, include a face with an expressive reaction if your channel uses face-cam content.
Next Steps
Start with a feature explainer — it is the shortest format with the highest ROI. Script 60 seconds using the problem-context-solution-takeaway structure, record your screen, add zoom and captions, and publish. Iterate from there.
For recording technique tips that apply across all video types, see our guide on making software tutorials that hold attention. If your explainer targets a social media audience, our social media screen recording guide covers platform-specific formatting and duration rules.
Try Screenify Studio
Record your screen with auto-zoom, AI captions, dynamic backgrounds, and Metal-accelerated export. Free plan, unlimited recordings.
Download Free