Yu Dong

AI

5 Essential Tips to Build Business Dashboards Stakeholders Love

A practical guide to designing clear, effective, and actionable dashboards for decision-making Yu Dong · Follow Published in Towards Data Science · 7 min read · 10 hours ago — Working in data science, dashboarding often feels like an unfavored but unavoidable work. Why is it unfavored? Dashboarding is less technical (less fancy) than analysis and modeling, and more repetitive. But why is it also unavoidable? It is the first and must-have step to understand

Read More »
AI

From Insights to Impact: Presentation Skills Every Data Scientist Needs

How to structure, design, and deliver data presentations that win over stakeholders Yu Dong · Follow Published in Towards Data Science · 7 min read · 3 hours ago — Being a data scientist today is more than just a technical role. It has evolved into a highly cross-functional job, as you need to explain your data insights and sell your ideas to your stakeholders to drive real business impacts. Therefore, to be a successful

Read More »
AI

Top 5 Principles for Building User-Friendly Data Tables

Designing intuitive and reliable tables that your data team will love Yu Dong · Follow Published in Towards Data Science · 7 min read · 16 hours ago — Working in data science and analytics for seven years, I have created and queried many tables. There are numerous times I wonder, “What does this column mean?” “Why are there two columns with the same name in table A and table B? Which one should I

Read More »
AI

Seven Common Causes of Data Leakage in Machine Learning

Key Steps in data preprocessing, feature engineering, and train-test splitting to prevent data leakage Yu Dong · Follow Published in Towards Data Science · 7 min read · 14 hours ago — When I was evaluating AI tools like ChatGPT, Claude, and Gemini for machine learning use cases in my last article, I encountered a critical pitfall: data leakage in machine learning. These AI models created new features using the entire dataset before splitting it

Read More »
AI

ChatGPT vs. Claude vs. Gemini for Data Analysis (Part 1)

Ten questions to test which AI assistant writes the best SQL Yu Dong · Follow Published in Towards Data Science · 18 min read · 10 hours ago — Table of Contents · Context· Let’s Compare Their SQL Skills!· Round 1: Problem Solving (LeetCode SQL)· Round 2: Business Logic· Round 3: Query Optimization· Summary· What’s Next Context Welcome to the first installment of my new series, ChatGPT vs. Claude vs. Gemini for Data Analysis. Throughout

Read More »
AI

Navigating Data Science: B2C vs. B2B Analytics

How customer types shape data science roles and methodologies Yu Dong · Follow Published in Towards Data Science · 10 min read · 16 hours ago — Context When considering a new company or a job offer, we often think about industry, company vision, growth opportunities, culture, etc. Today, I want to introduce another perspective: whether the business is B2B (Business-to-Business) or B2C (Business-to-Consumer). This distinction has a surprisingly large impact on data science roles.

Read More »
AI

Evaluating ChatGPT’s Data Analysis Improvements: Interactive Tables and Charts

Is ChatGPT becoming a BI tool? Yu Dong · Follow Published in Towards Data Science · 9 min read · 9 hours ago — In May 2024, alongside the exciting release of the GPT-4o, OpenAI announced its improvements to data analysis in ChatGPT, featuring interactive tables and charts, and integration with Google Drive and Microsoft OneDrive. In this article, I will evaluate these new features and envision the future of data analysis with ChatGPT. Photo

Read More »
AI

Building a Standout Data Science Portfolio: A Comprehensive Guide

Learn how to create an impactful data science portfolio that showcases your skills and attracts potential employers Yu Dong · Follow Published in Towards Data Science · 9 min read · 2 hours ago — Context I started my data science portfolio website in 2018 when I was fresh out of school. Unsurprisingly, I set it up, hoping it could help my job search and career development. Six years later, I’m proud of the progress

Read More »
AI

Mastering SQL Optimization: From Functional to Efficient Queries

Six Simple Yet Effective SQL Tips That Helped Me Reduce 50 Hours of Snowflake Query Time Every Day Yu Dong · Follow Published in Towards Data Science · 9 min read · 11 hours ago — SQL is probably the most fundamental technical skill every data analyst and data scientist should master. It’s usually part of the interview process, and we spend significant time coding in SQL at work to collect data. Without it, there

Read More »
AI

330 Weeks of Data Visualizations: My Journey and Key Takeaways

How consistent practice in data visualization enhanced my data science skills Yu Dong · Follow Published in Towards Data Science · 7 min read · 10 hours ago — I have been making one visualization weekly since I started my full-time data science job in 2018. Now, over 330 weeks later, I consider this an achievement I’m truly proud of. During coffee chats, people often ask me about it, which inspired me to write this

Read More »