Tarik Dzekman

AI

Exploring the AI Alignment Problem with GridWorlds

It’s difficult to build capable AI agents without encountering orthogonal goals Tarik Dzekman · Follow Published in Towards Data Science · 18 min read · 13 hours ago — Design of a “Gridworld” which is hard for an AI agent to learn without encouraging bad behaviour. Image by the Author. This is the essence of the AI alignment problem: An advanced AI model with powerful capabilities may have goals not aligned with our best interests.

Read More »
AI

How I Deal with Hallucinations at an AI Startup

And the difference between weak vs strong grounding Tarik Dzekman · Follow Published in Towards Data Science · 6 min read · 13 hours ago — Image by the author I work as an AI Engineer in a particular niche: document automation and information extraction. In my industry using Large Language Models has presented a number of challenges when it comes to hallucinations. Imagine an AI misreading an invoice amount as $100,000 instead of $1,000,

Read More »
AI

What Do Large Language Models “Understand”?

A deep dive on the meaning of understanding and how it applies to Large Language Models Tarik Dzekman · Follow Published in Towards Data Science · 24 min read · 2 hours ago — Source: Image by the author with elements generated with Stable Diffusion It’s hard to believe that ChatGPT is almost 2 years old. That’s significant to me because ChatGPT is only 1 month younger than my daughter. Just yesterday she successfully put

Read More »