Introduction to Reinforcement Learning and Solving the Multi-armed Bandit Problem

Dissecting “Reinforcement Learning” by Richard S. Sutton with Custom Python Implementations, Episode I

Oliver S

Published in

Towards Data Science

11 min read

13 hours ago

—

Reinforcement Learning (RL) is a fascinating subfield of Machine Learning. You might already know it from applications such as playing Go [1], autonomous driving [2], and more.

Equally fascinating in my opinion is Sutton’s and Barto’s famous book, “Reinforcement Learning” [3]. I think it’s a great introduction to the topic, but also dives deep and introduces all important theoretical topics of the field. It can be a lot to read though, and especially upon the first read might look a bit mathy.

Image by Carl Raw on Unsplash

Thus, I decided to start a post series summarizing the book chapter by chapter. I believe getting the contents explained with different words will greatly help understanding. And I will also implement all (most) algorithms from the book in Python and apply them to problems and environments modeled via (formerly) OpenAI’s gymnasium framework [4]. These two points are, as far as I know, novel so far and make this series unique.

This post is the first in the series, and will briefly introduce RL in general, then give a quick overview of how Sutton’s book is structured — and how…

Uber Teams With GM’s Cruise To Deploy Driverless Robotaxis In 2025

Uber Technologies and Cruise announced a multiyear strategic partnership to bring Cruise autonomous vehicles to the Uber platform. The two companies plan to launch the

August 23, 2024

5 Open-Source Research Tools to Support via Kivach | HackerNoon

Research is the backbone of progress in every field, from science to the humanities. It helps us understand the world better, solve problems, and improve

July 3, 2024

Apple’s iPhone 16 Looms But The iPhone 17’s Camera May Be Worth Waiting For

Apple supply chain whisperer Ming-Chi Kuo is reporting what he believes to be the roadmap for cameras on the next four iPhones. According to the

July 12, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.