·35 min read·日本語版 →

Audio Settings for Educational YouTube Videos: A Complete Guide

Optimize audio for educational and tutorial YouTube videos. Learn the ideal loudness levels, BGM-to-narration balance, EQ settings for listener comfort, and export configurations for maximum clarity.

Share

Educational Videos Live or Die by Audio Quality#

Viewers of educational YouTube channels have different expectations than entertainment audiences. Their priority is accurately hearing and understanding information, and they'll abandon any video where the audio is hard to follow.

These audio problems commonly bottleneck educational channel growth:

  • Narration is too quiet to hear on a train or in a cafe
  • BGM is so loud it competes with the speaker's voice
  • Extended viewing causes ear fatigue
  • Volume levels vary wildly between videos

This guide provides concrete guidelines for creating audio that lets viewers learn comfortably.

Loudness Standards for Lectures and Tutorials#

YouTube's Loudness Normalization#

YouTube normalizes audio to -14 LUFS. Audio significantly above this threshold gets turned down by the platform.

For educational content, these loudness targets are recommended:

ElementRecommended LUFSRecommended Peak
Overall narration-14 to -16 LUFS-3 dBTP
Per-section narration-16 to -14 LUFS-3 dBTP
BGM-30 to -26 LUFS-6 dBTP
Sound effects-20 to -16 LUFS-3 dBTP

Why -14 to -16 LUFS?#

There are specific reasons for targeting slightly below YouTube's -14 LUFS ceiling:

  1. Reduced fatigue for long videos: High loudness causes listening fatigue in 30+ minute content
  2. Smartphone speaker compatibility: Loud audio distorts on small speakers 3. Background playback: Many viewers listen with earbuds while doing other tasks 4. Minimal platform processing: Staying near -14 LUFS means less normalization impact

BGM and Narration Balance#

Understanding BGM's Role#

In educational videos, BGM serves a fundamentally different purpose than in entertainment content.

What BGM should do:

  1. Eliminate the discomfort of silence

Smooth transitions between topics 3. Maintain attention by preventing monotony 4. Create brand consistency through sonic identity

What BGM must never do:

  • Compete with narration
  • Steal the viewer's focus
  • Manipulate emotions excessively (undermining learning objectivity)

Setting the Right Volume Gap#

Maintain at least 12dB, ideally 16–18dB of separation between narration and BGM.

Narration: -14 LUFS
BGM: -30 LUFS (16dB gap)

At this gap, BGM is present but doesn't register consciously — exactly where it should be.

Choosing BGM Genre#

GenreSuitabilityNotes
Lo-Fi Hip HopExcellentLaid-back tempo, no vocals
AmbientExcellentEnvironmental sounds, never fatiguing
Piano soloGoodChoose pieces where melody doesn't dominate
Acoustic guitarGood60–80 BPM works best
ElectronicaConditionalAvoid strong rhythmic patterns
Pop musicNot recommendedVocals compete with narration

Section-Based BGM Strategy#

Varying BGM usage by section improves the viewing experience.

Intro (0–30 seconds): BGM slightly louder (-24 LUFS) for brand impression. Fade down as narration begins.

Main content: BGM at minimum (-30 LUFS) or remove entirely. BGM-free segments work well for complex explanations.

Topic transitions: Change the BGM track or briefly raise volume. A 2–3 second BGM bridge signals topic shifts.

Conclusion/Outro: Raise BGM slightly (-26 LUFS) and pick up the tempo for a closing impression.

Designing Audio for Extended Listening#

How Listening Fatigue Works#

Three main causes of ear fatigue during long audio content:

  1. Excessive highs: 2–6kHz emphasis sounds clear short-term but causes fatigue over time
  2. Over-compression: Constant loudness with zero variation is exhausting 3. No dynamic variation: Without natural volume fluctuation, the brain can't rest

Anti-Fatigue EQ Settings#

Recommended EQ for educational narration:

  • Below 80Hz: High-pass filter to cut
  • 200–300Hz: -1 to -2dB (reduce muddiness)
  • 1–2kHz: +1dB (bring out vocal body)
  • 3–5kHz: +1 to +2dB (improve clarity, but keep it conservative)
  • 6–8kHz: Flat or -1dB (control sibilance)
  • Above 10kHz: -1 to -2dB (reduce long-session fatigue)

The critical point: keep the 3–5kHz clarity boost conservative. It sounds great in short clips but becomes the primary fatigue source in videos over 30 minutes.

Compression Settings#

Recommended compression for educational narration:

  • Threshold: -18 to -14dBFS
  • Ratio: 2:1 to 3:1 (gentle)
  • Attack: 10–20ms
  • Release: 100–200ms
  • Makeup gain: Compensate for reduction

Ratios above 4:1 make speech sound unnaturally flat — like a robot talking.

Preserving Natural Dynamics#

Educational content benefits from intentional dynamic variation:

  • Emphasize key points with slightly louder delivery
  • Lower volume for supplementary explanations
  • Use pauses and silence strategically

This natural variation maintains viewer concentration and aids comprehension.

Recording Quality Tips#

Microphone Distance#

For educational content, 15–25cm from the mic is optimal.

  • Too close (under 5cm): Proximity effect bloats low-end, plosives become harsh
  • Optimal (15–25cm): Natural tone, consistent volume
  • Too far (over 30cm): Room reflections increase, quality degrades

Pop Filter#

A pop filter is essential. It prevents wind-noise plosives from "P," "B," and "T" sounds.

Multi-Take Loudness Matching#

Educational videos are often recorded across multiple sessions. Voice condition and volume naturally vary between days, making loudness normalization across takes an important step.

Export Settings#

SettingRecommended Value
FormatAAC or WAV
Sample rate48kHz
Bit depth24-bit (if WAV)
Bit rate320kbps (if AAC)
ChannelsStereo
Loudness-14 to -16 LUFS
True PeakBelow -1 dBTP

Key Takeaways#

  1. Target -14 to -16 LUFS for narration mastering
  2. Maintain 16–18dB gap between narration and BGM 3. Choose Lo-Fi or ambient BGM that doesn't compete for attention 4. Keep the 3–5kHz boost conservative to prevent long-session fatigue 5. Use 2:1 to 3:1 compression to preserve natural speech dynamics 6. Vary BGM levels by section for pacing and engagement 7. Normalize loudness across recording sessions for channel-wide consistency

Clear, comfortable audio simultaneously improves viewer learning outcomes and drives channel growth.

Was this article helpful?
Share

Try DeckReady now

Free in your browser. Process up to 5 tracks without signing up.

Start Mastering

Get DJ mastering tips

Weekly tips for music production.

Related Articles