Dr. Tiziana Ligorio x Deep Learning, Hunter College of The City University of New York

🙏 Credits: These notes borrow from the following sources:

Transformers

Issues tackled (sequence processing bottlenecks)

Self-Attention (the basic idea)

tokens_embeddings_posencoding.png