Shaw Talebi

AI

Fine-tuning Multimodal Embedding Models

Adapting CLIP to YouTube Data (with Python Code) Shaw Talebi · Follow Published in Towards Data Science · 9 min read · Just now — This is the 4th article in a larger series on multimodal AI. In the previous post, we discussed multimodal RAG systems, which can retrieve and synthesize information from different data modalities (e.g. text, images, audio). There, we saw how we could implement such a system using CLIP. One issue with

Read More »
AI

How I’d Learn AI in 2025 (If I Knew Nothing)

A 5-step roadmap for today’s landscape Shaw Talebi · Follow Published in Towards Data Science · 6 min read · 12 hours ago — Today, more people than ever are trying to learn AI. Although there are countless free learning resources online, navigating this rapidly evolving landscape can be overwhelming (especially as a beginner). In this article, I discuss how I’d approach learning AI, given what I know now and the tools available today. Image

Read More »
AI

Multimodal RAG: Process Any File Type with AI

A beginner-friendly guide with example (Python) code Shaw Talebi · Follow Published in Towards Data Science · 12 min read · 6 hours ago — This is the third article in a larger series on multimodal AI. In the previous posts, we discussed multimodal LLMs and embedding models, respectively. In this article, we will combine these ideas to enable the development of multimodal RAG systems. I’ll start by reviewing key concepts and then share example

Read More »
AI

Python QuickStart for People Learning AI

A beginner-friendly guide Shaw Talebi · Follow Published in Towards Data Science · 12 min read · 10 hours ago — Python has become the de facto programming language for AI and data science. Although no-code solutions exist, learning how to code is still essential to build fully custom AI projects or products. In this article, I share a beginner QuickStart guide to AI development with Python. I’ll cover the basics and then share a

Read More »
AI

Local LLM Fine-Tuning on Mac (M1 16GB)

Beginner-friendly Python code walkthrough (ft. MLX) Shaw Talebi · Follow Published in Towards Data Science · 8 min read · 11 hours ago — This article is part of a larger series on using large language models (LLMs) in practice. In a previous post, I showed how to fine-tune an LLM using a single (free) GPU on Google Colab. While that example (and many others) readily runs on Nvidia hardware, they are not easily adapted

Read More »
AI

The Skill That Holds Back (Most) Data Scientists

7 communication tips that made me a better data scientist Shaw Talebi · Follow Published in Towards Data Science · 7 min read · 10 hours ago — When starting in data science, I was hyper-focused on learning Python, Machine Learning, Statistics, etc. While these are necessary, there is another skill that most tend to overlook at their peril—communication. In this article, I discuss why data scientists must be effective communicators and share 7 tips

Read More »
AI

How to Build Data Pipelines for Machine Learning

A beginner-friendly introduction with Python code Shaw Talebi · Follow Published in Towards Data Science · 10 min read · 6 hours ago — This is the 3rd article in a larger series on Full Stack Data Science (FSDS). In the previous post, I introduced a 5-step project management framework for building machine learning (ML) solutions. While ML may bring to mind fancy algorithms and technologies, the quality of an ML solution is determined by

Read More »