Sound Design: The 50% of Video Nobody Talks About

Sound Design: The 50% of Video Nobody Talks About

Why Audio Quality Matters More Than Visual Quality

There's a quote from the BBC Academy that sound engineers cite to explain their work: "Audio is the 50% of video that nobody talks about — until it's wrong."

The data backs it up. George Lucas famously claimed that "sound is 50% of the experience" of a film. More recently, research from MIT's Media Lab showed that viewers will tolerate poor video quality if the audio is excellent, but they will leave a video with great visuals if the audio is bad — within seconds.

We see this exact pattern at Mark Studios. The single biggest predictor of whether a client's video will hit retention targets isn't the cut, isn't the color grade, isn't the thumbnail. It's the audio mix.

The Five Layers of Pro Audio

A professionally-edited video has five distinct audio layers stacked together. Most DIY edits have two. Here's the full stack:

Layer 1 — Dialogue (the voice)

The most important layer. Two rules:

  • -12 dB to -6 dB peak, with the average sitting around -16 dB LUFS. This matches what YouTube's loudness normalization targets. Mix too quiet and YouTube boosts it (introducing noise); mix too loud and YouTube compresses it (sounds flat).
  • De-noise then EQ then compress, in that order. Tools: iZotope RX for noise removal, FabFilter or Adobe's parametric EQ for shaping, then compression to even out levels.

Layer 2 — Music

Music sets emotional tone faster than any visual element. Three rules we follow:

  • Music sits at -24 to -30 dB when dialogue is on top. It's there to be felt, not heard consciously.
  • Use side-chain ducking so music auto-quiets under dialogue. Stops the "music fights voice" problem.
  • Match BPM to the cut. If you're cutting every 4 seconds, the music's beat should fall on those cuts. Mismatched BPM feels visually off even if you can't articulate why.

Recommended sources for licensed music in 2026:

Layer 3 — Sound effects (foley + library)

This is where most DIY edits fall short. Pro edits use SFX constantly:

  • Whooshes on transitions — a quick whoosh under a cut hides the cut and adds energy.
  • Risers under tension moments — a 1–2 second rising-pitch sweep makes the viewer lean in subconsciously.
  • Impacts on key statements — a deep bass thud under a stat or claim makes it land harder.
  • Ambient layers — subtle room-tone or atmosphere under interview scenes makes the audio feel "alive" instead of clinical.

Recommended SFX libraries:

Layer 4 — Atmosphere / room tone

Room tone is the silent presence of the recording space. Without it, every cut sounds like a hard edit because the silence between dialogue is true digital silence (zero noise floor) — and that triggers the brain as "this is fake."

  • Record 30 seconds of pure room tone at every shoot. Layer it under dialogue at -45 to -50 dB throughout.
  • Match the tone of every dialogue clip to the same room tone. Smooths over the edits.

Layer 5 — Mastering bus processing

The final polish across the whole mix:

  • Light compression on the master bus (1.5:1 ratio, slow attack) to glue everything together.
  • Limiter at -1 dB ceiling to prevent any peaks from clipping.
  • Final loudness target: -14 LUFS for YouTube, -16 LUFS for Spotify-distributed podcasts, -18 LUFS for film.

The Mistakes Killing Most Channels

What we see when we audit DIY-edited channels:

  1. Music too loud. Music drowning dialogue is the #1 audio sin.
  2. Mono dialogue, stereo music. Wide stereo music with center-locked dialogue creates an uncomfortable listening experience. Either both stereo or both mono.
  3. No de-noising. Air conditioner hum, mic hiss, room echo. All fixable in 5 minutes with iZotope RX Voice De-noise. Almost no DIY edit does this.
  4. Inconsistent loudness across cuts. Two clips edited together with different levels make the viewer constantly adjust their volume. Use a normalize-to-LUFS pass after the cut.
  5. Hard cuts with no audio crossfade. A 100ms crossfade on every dialogue cut hides the seam. Without it, every cut is audible.
  6. Stock music that's everywhere. That free Epidemic Sound track that 100K other videos use? Your audience subconsciously associates it with cheap content. Pick less popular tracks.

The Sound Design Checklist We Run on Every Edit

Before we ship any client video, we run this 7-point gate:

  1. ✅ Dialogue averaged at -16 LUFS, peaked under -6 dB
  2. ✅ Music ducked under dialogue, sitting at -24 dB or lower
  3. ✅ Whooshes on at least 50% of transitions
  4. ✅ Risers under any tension/reveal moment
  5. ✅ Room tone layered under all dialogue
  6. ✅ De-noised, de-essed, EQ'd dialogue
  7. ✅ Final master bus limiter at -1 dB ceiling

If we miss any one of these, retention drops measurably on the resulting video.

How to Test Your Mix

Three quick tests we run on every edit before delivery:

  1. Phone-speaker test — play the final mix through laptop speakers (not headphones). If the dialogue is hard to hear or the music is overpowering, fix it. 70%+ of YouTube viewers watch through phone or laptop speakers, per Wistia's 2024 viewing-environment study.
  2. Walking-around test — play the video and walk around the room. If you have to lean toward the speakers to hear the dialogue, your mix is dialogue-quiet.
  3. Mute test — watch the video on mute with captions. If the visuals don't carry the story, your audio is doing too much heavy lifting.

The Bottom Line

Most creators spend 90% of their editing time on the visual cut and 10% on audio. The pros invert that ratio. Audio is what makes a video feel professional, makes viewers stay through the second minute, and makes them subscribe instead of scrolling.

If you want our team to handle sound design and audio mastering on your videos, we typically include this in every editing project — but we also offer audio-only services for creators who edit their own visuals.

👉 Start Your Project Now