Avoid These Easily Missed Mistakes in Machine Learning Workflows — Part 2

Using unavailable data at prediction time and mixing magic numbers with real numbers

Thomas A Dorfer

Published in

Towards Data Science

6 min read

16 hours ago

—

Image by the Author.

Welcome back to another edition in this series on easily missed mistakes in machine learning workflows! For those who haven’t read the first one, this is part of a series that focuses predominantly on procedural errors that may not always be very obvious but have a very high potential of deteriorating model performance if they do end up slipping into our development pipeline.

In the first article, we explored common pitfalls like misusing numerical identifiers, mishandling data splits, and overfitting the model to rare feature values.

Avoid These Easily Missed Mistakes in Machine Learning Workflows — Part 1

Misusing identifiers, incorrect data splits, and ignoring rare feature values

towardsdatascience.com

In this edition, we’ll continue to explore some errors related to data handling, specifically focusing on the following two topics:

Training with data not available at prediction time
Mixing magic numbers with real numbers

Neural Network (MLP) for Time Series Forecasting in Practice

A Practical Example for Feature Engineering and Constructing an MLP Model Daniel J. TOTH · Follow Published in Towards Data Science · 16 min read

July 17, 2024

What does the Transformer Architecture Tell Us?

Stephanie Shen · Follow Published in Towards Data Science · 14 min read · 10 hours ago — Image by narciso1 from Pixabay The stellar

July 25, 2024

Combining Storytelling and Design for Unforgettable Presentations

Image created with Dall·E by the author. How to craft slide decks that stand out Hennie de Harder · Follow Published in Towards Data Science

April 18, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.