Outlier Detection using Random Forest Regressors: Leveraging Algorithm Strengths to your Advantage

Using a model’s robustness to outliers to detect them

Michael Zakhary

Published in

Towards Data Science

8 min read

Sep 28, 2023

—

Photo by Will Myers on Unsplash

Problem Statement

The problem of outlier detection can be tricky, especially if the ground truth or the description of what is an outlier is ambiguous or based upon multiple factors. Mathematically speaking, an outlier can be defined as data points more than three standard deviations away from a mean. However, in most real-life problems, not all data points away from a mean are of the same significance, sometimes we require a bit more nuance when flagging outliers.

Image by Rohanukhade

Let’s take a quick example:

We have a dataset of water consumption per household. By analyzing the water consumption as a whole and isolating points 3 standard deviations from the mean, we can quickly get the outliers that use the most water.

This however fails to take into account the reason behind the increase in consumption, i.e. there could be multiple reasons why the water consumption is high, some reasons are of more interest…

Noctua’s Flagship NH-D15 G2 CPU Air Cooler Arrives Just In Time For Ryzen 9000

If you’re reading this site, Noctua almost assuredly needs no introduction to you. It’s rare that Austria’s premiere purveyor of puissant cooling apparatus actually launches

July 2, 2024

Hands-On Building a Virtual Property Consultant Using Artificial Intelligence

This is how I used real estate data and powered them using OpenAI Large Language Model GPT3 Piero Paialunga · Follow Published in Towards Data

April 6, 2024

The end of AI scaling may not be nigh: Here’s what’s next

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More As AI systems achieve superhuman performance

December 1, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.