For deep learning in particular, I will add Neel Nanda's interpretability work: https://www.neelnanda.io/mechanistic-interpretability
For deep learning in particular, I will add Neel Nanda's interpretability work: https://www.neelnanda.io/mechanistic-interpretability