*Dr. Tiziana Ligorio

Hunter College of The City University of New York*

🙏 Credits: These notes borrow from the following sources:

Transformers

Issues tackled (sequence processing bottlenecks)

Self-Attention (the basic idea)

tokens_embeddings_posencoding.png