Introduction to Reinforcement Learning and Solving the Multi-armed Bandit Problem

Dissecting “Reinforcement Learning” by Richard S. Sutton with Custom Python Implementations, Episode I

Oliver S

Published in

Towards Data Science

11 min read

13 hours ago

—

Reinforcement Learning (RL) is a fascinating subfield of Machine Learning. You might already know it from applications such as playing Go [1], autonomous driving [2], and more.

Equally fascinating in my opinion is Sutton’s and Barto’s famous book, “Reinforcement Learning” [3]. I think it’s a great introduction to the topic, but also dives deep and introduces all important theoretical topics of the field. It can be a lot to read though, and especially upon the first read might look a bit mathy.

Image by Carl Raw on Unsplash

Thus, I decided to start a post series summarizing the book chapter by chapter. I believe getting the contents explained with different words will greatly help understanding. And I will also implement all (most) algorithms from the book in Python and apply them to problems and environments modeled via (formerly) OpenAI’s gymnasium framework [4]. These two points are, as far as I know, novel so far and make this series unique.

This post is the first in the series, and will briefly introduce RL in general, then give a quick overview of how Sutton’s book is structured — and how…

The Noonification: Python FIFO Buffer Class for Audio – an Algorithm (6/28/2024) | HackerNoon

How are you, hacker? 🪐What’s happening in tech this week: The Noonification by HackerNoon has got you covered with fresh content from our top 5

June 28, 2024

Test-driving Google’s Gemini-Exp-1206 model: Competitive data analysis and sophisticated visualizations in under a minute

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More One of Google’s latest experimental models,

December 27, 2024

How I Studied LLMs in Two Weeks: A Comprehensive Roadmap

Image created by Midjourney. A day-by-day detailed LLM roadmap from beginner to advanced, plus some study tips Hesam Sheikh · Follow Published in Towards Data

October 18, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.