
The mission: Figure out the best possible settings for maintaining ultra audio quality on Youtube through a variety of experiments.
Possible Factors: RMS Level, Peak Level, Video Resolution, 20Khz sin wav addition, and probably more.
What Am I Up To?
[YOUTUBE]LlitCukeLG8[/YOUTUBE]
Source Files
Here are the original mixes in 44.1Khz/16-bit as well as the original Test #1 video in .mp4 format before Youtube got a hold of them. Download Source Files
Warnings
- I seem to have made the intro a bit quiet. It's 3 seconds long or something. DO NOT crank your studio to hear that because when the drums kicks in it blows you all to hell.
- You may need to rethink your views on "dynamics" a bit in regard to the Loudness War because of my initial warning.
- Sometimes the Youtube player doesn't load all the videos. I presume this is due to me having more videos than I'm supposed to on this page. If there is a big white box, hit refresh.
Question #1: What does the peak volume of the audio file being uploaded have an effect on what Youtube does to it?
Hunches: My guess is there is a sweet spot where Youtube won't do as much damage to an audio file in terms of peak level, but I suspect they will increase the volume of Test #3, the -12db peak file so that it's closer to Test #1 which is as full volume.
Youtube Reverse Audio Engineering Test #1
Audio mixed at -8dB RMS, -0.1dB peak, 320k AAC, uploaded at 1080p
[YOUTUBE]jyBS6E-gRIA[/YOUTUBE]
Youtube Reverse Audio Engineering Test #2
Audio mixed at -8dB RMS, -6dB peak, 320k AAC, uploaded at 1080p
[YOUTUBE]KNk0QeNiPq4[/YOUTUBE]
Youtube Reverse Audio Engineering Test #3
Audio mixed at -8dB RMS, -12dB peak, 320k AAC, uploaded at 1080p
[YOUTUBE]S-STAunb2ZM[/YOUTUBE]
Conclusion For Question #1
When I listen back in 1080p, it seems that Youtube really hasn't done much. I'm not listening on my monitors and probably could be listening more critically, but I don't really hear any artifacts in Test #1 that would show any compression, normalization, or processing of any kind.
Also, when listening at 1080p, it seems the volume levels have more-or-less been left in tact. I don't have any way to meter the way I would in Cubase 5, but Test #3 is definitely lower in level than Test #2 which is definitely lower in level than Test #1.
When listening at standard 360p, the audio is DEFINITELY been mp3 munched, but I'm not hearing any deviations from the source files.
Summary For Test #1
Youtube respects your decisions in level when it comes to music mixed at -8dB RMS no matter if it's -0.1dB, -6dB, or -12dB with files uploaded at 1080p.
Question #2: Do the conclusions from Test #1 hold up with files mixed @ -12dB RMS?
Youtube Reverse Audio Engineering Test #4
Audio mixed at -12dB RMS, -0.1dB peak, 320k AAC, uploaded at 1080p
[YOUTUBE]akQ674mqgxQ[/YOUTUBE]
Youtube Reverse Audio Engineering Test #5
Audio mixed at -12dB RMS, -6dB peak, 320k AAC, uploaded at 1080p
[YOUTUBE]6ziCGu6RvmY[/YOUTUBE]
Youtube Reverse Audio Engineering Test #6
Audio mixed at -12dB RMS, -12dB peak, 320k AAC, uploaded at 1080p
[YOUTUBE]c0_PIecfQQk[/YOUTUBE]
Conclusion For Question #2
Again, it appears that Youtube leaves the levels to us. It's not going to help us if the levels are low. That's GREAT for us audio guys....not so great for video guys who may not have as much control over their levels.
At the moment, I think it's safe to say that there is no reason to come up with any wild tricks to upload music. While we'll test it anyway, I can't imagine what benefit blending in a 20Khz sin wav would do when uploading a 1080p file.
Question #3: Does Youtube do more MP3 Munching to lower quality video uploads?
We are going back to Test #1 (-8dB RMS, -0.1 peak) @ 320k ACC, but I'm rendering the file at 720p and 320p keeping the audio at 320k AAC. This test is designed to see if people with SD cameras can still get good quality in the audio department.
Youtube Reverse Audio Engineering Test #7
Audio mixed at -8dB RMS, -0.1dB peak, 320k AAC, uploaded at 720p
[YOUTUBE]GmF52ahwOgM[/YOUTUBE]
Youtube Reverse Audio Engineering Test #8
Audio mixed at -8dB RMS, -0.1dB peak, 320k AAC, uploaded at 360p
[YOUTUBE]bm2BcQi0ADI[/YOUTUBE]
Conclusion For Question #3
First off, I want to point out that the file size did not decrease with the smaller resolutions. I'm not sure how that is possible. Either I goofed something up or the fact that I'm using vector graphics instead of real photorealistic images had a factor. So if you are doing this type of graphic work, go head and keep it at 1080p.
When listening/watching @ 360p, I don't really hear a difference between Test #1 and Test #3.
Summary: If you want to be in the high-quality audio end, you've got to upload @ 1080p. In order for your users to hear the high quality audio, they must be watching the 1080p version. 720p does offer significant audio improvement over 360p. I do believe I hear a difference between 1080p audio and 720p audio, but this is nowhere near as large of gap between 720p quality and 360p quality.
Question #4: If I'm stuck with a 360p video, is there any point in keeping the audio at 320k
For Test #9 I'm uploading a 360p video file the same as Test #8, but I've dropped the quality down to 128k. We'll see if it sounds any different than the 320k audio file version.
Youtube Reverse Audio Engineering Test #9
Audio mixed at -8dB RMS, -0.1dB peak, 128k AAC, uploaded at 360p
[YOUTUBE]w-eAxD3GomI[/YOUTUBE]
Conclusion For Test #4
I'm not hearing a significant difference between Test #8 and Test #9. So if you are uploading a 360p video, don't bother going any higher than 128k on the encode.
Question #5: Is there anything to gain from blending in a 20Khz sine wave with the audio mix?
Let me explain my methodology. I took the finished mix that you've heard a million times now and inserted it into a new stereo track in Cubase. I think created a new track with the Cubase signal generator thingy found in the Tools menu of the VST plugins. I set it to output a 20Khz sine wave at -0.1dB, matching the peak level of the mix. I then linked their faders and pulled them down until they peaked at -0.1dB on the stereo bus.
I'm not sure if this is the "correct" way of doing it, but the rumors have been quite vague.
Youtube Reverse Audio Engineering Test #10
Audio mixed at -8dB RMS, -0.1dB peak, 320k AAC, uploaded at 1080p, 50/50 blend of Mix and 20Khz Sine Wave
[YOUTUBE]3TCyxmo3UxU[/YOUTUBE]
Is it better to upload a 44.1Khz file or a 48Khz file?
Youtube Reverse Audio Engineering Test #12 (Don't ask what happened to test #11)

Audio mixed at -8dB RMS, -0.1dB peak, 320k AAC, uploaded at 1080p, rendered to 44.1Khz in video editing software.
[YOUTUBE]BaY1H9ak4BA[/YOUTUBE]
Many video editing programs are really in love with this 48K sample rate thing. I can't imagine a single good reason why, but what else is new. I do know that resampling audio is NEVER a good idea and the people that are tracking to high sample rates are almost always summing into analog before going into 44.1Khz or 88.2Khz.
Conclusion For Test #5
Oh, I don't know. On the computer speakers I can't tell a difference on this one, either. At 1080p it sounds about the same. I'm not sure if there are any repercussions on the video end for using a 44.1Khz sample rate (probably), but the sonics seem about the same here.
Youtube Audio Conclusions
- Youtube doesn't care about your peak volume
- Youtube doesn't care about your RMS volume
- Youtube doesn't do anything special when you blend in a 20Khz sine wave
- Youtube ONLY saves their best quality for the 1080p uploads with 720p videos coming in a close second.
- 360p videos are automatically reduced to something resembling 128k mp3s no matter what they were originally uploaded at.
Be Careful What Your Video Editing Software Is Doing
After further investigation, I now realize that the clipping and oddball compression I was hearing in some of my previous videos was done by my cheapo video editing software. This stuff resembled something you could purchase at Best Buy. (That's about like imagining your mother-in-law naked.) It came with my Kodaz Zi8 I was using there for a while.
It's highly recommended that you deal with your audio in an AUDIO program and your video in a VIDEO program. While I'm still a total idiot when it comes to video (and life in general) I've not seen the tools I feel are necessary to get my audio in tip top shape in any video editing application.




Menu
Category Navigation






Latest Blogs
Lastest Forum Threads

vBulletin Message