Recommending Gwern on any technical topic is practically cheating; he always has...

Recommending Gwern on any technical topic is practically cheating; he always has in-depth, impeccably referenced overviews, complete with experiments he has done.

For deep learning in particular, I will add Neel Nanda's interpretability work: https://www.neelnanda.io/mechanistic-interpretability