Stephanie Shen

AI

What does the Transformer Architecture Tell Us?

Stephanie Shen · Follow Published in Towards Data Science · 14 min read · 10 hours ago — Image by narciso1 from Pixabay The stellar performance of large language models (LLMs) such as ChatGPT has shocked the world. The breakthrough was made by the invention of the Transformer architecture, which is surprisingly simple and scalable. It is still built of deep learning neural networks. The main addition is the so-called “attention” mechanism that contextualizes each

Read More »
AI

Deep Reinforcement Learning: Toward Integrated and Unified AI

Can AI provide a lens on human intelligence? Stephanie Shen · Follow Published in Towards Data Science · 15 min read · 22 hours ago — Photo by Elena Popova on Unsplash Most artificial intelligence(AI) models today, including convolution neural networks (CNN) and large language models (LLM), are built for specific tasks and require humans’ curation of vast training data. They lack the capability to interact with the world or to learn continuously…

Read More »
AI

What Does Evolution Tell Us about Human Intelligence?

A comparative analysis of AI with the biological brain Stephanie Shen · Follow Published in Towards Data Science · 17 min read · 2 hours ago — Photo by NASA on Unsplash The human brain is a product of millions of years of evolution. Humans share most of their genes with mammals: 98 to 99 percent with chimpanzees and roughly 90 percent with mice, dogs, and cats. The degree of gene overlapping indicates how closely

Read More »