Intelligence.Log
2022-10-04
Extracted: 1 items. Sources: YouTube.
grep TOPIC=
YT
We dive into some of the internals of MLPs with multiple layers and scrutinize the statistics of the forward pass activations, backward pass gradients, and some of the pitfalls when they are improperly…
“This video examines the statistical challenges in training deep neural networks, focusing on how improperly scaled activations and gradients can cause instability. It introduces Batch Normalization as a key technique to stabilize training by normalizing layer inputs.”
-- END OF LOG --
[STATS] 1 items · Filter applied
Powered by Horizon + DeepSeek