Valerie Carey

AI

Data Disruptions to Elevate Entity Embeddings

Injecting random values during neural network training can help you get more from your categoricals Valerie Carey · Follow Published in Towards Data Science · 11 min read · 8 hours ago — Photo by dylan nolte on Unsplash Today I will discuss a stochastic regularization method to improve generalizability of entity embeddings in neural network models. I use a data generator to randomly inject selected input values into data during training…

Read More »
AI

No Label Left Behind: Alternative Encodings for Hierarchical Categoricals

Seeking a system that works for current and future codes Valerie Carey · Follow Published in Towards Data Science · 15 min read · 20 hours ago — Photo by Gabriel Tenan on Unsplash In my work as a data scientist, I see a lot of labels. Data contains zip code labels, gender labels, medical diagnosis labels, job title labels, stock ticker labels, you name it. Labels may be simple (shirt sizes of S,M, L)

Read More »
AI

Exploring Hierarchical Blending in Target Encoding

When can code hierarchies improve target encoding for high-cardinality categorical features? Valerie Carey · Follow Published in Towards Data Science · 12 min read · 3 hours ago — Photo by Jessica Alves on Unsplash What neighborhood do you live in? What drug were you prescribed? Why did you cancel your streaming subscription? These days, there’s a code for that, stored in databases by whatever governments agencies, businesses, etc. you…

Read More »