Aveek Goswami

AI

Understanding Transformers

A straightforward breakdown of “Attention is All You Need”¹ Aveek Goswami · Follow Published in Towards Data Science · 10 min read · 9 hours ago — The transformer came out in 2017. There have been many, many articles explaining how it works, but I often find them either going too deep into the math or too shallow on the details. I end up spending as much time googling (or chatGPT-ing) as I do reading,

Read More »