LSTMs & Transformers#
Slides from the third lecture, introducing attention and transformers, can be found here. Furthermore, there are some excellent educational materials and talks out there – references to some original papers and some suggested resources are below.
Transformers: Part 2#
Slides from the second lecture on transformers can be found here. The slides on mathematical formalization of transformers are available here.
Additional materials#
If you want to dig a bit deeper, here are (optional!) supplementary readings, a general textbook on deep learning and the paper that first introduced backpropagation for training neural networks:
Supplementary materials for the second transformers session: