Why Does Position-Based Chunking Lead to Poor Performance in RAGs?

How to implement semantic chunking and gain better results.

Thuwarakesh Murallie

Published in

Towards Data Science

10 min read

6 hours ago

—

Photo by vackground.com on Unsplash

Neighbors could still be different.

Language models come with a context limit. For newer OpenAI models, this is around 128k tokens, roughly 80k English words. This may sound big enough for most use cases. Still, large production-grade applications often need to refer to more than 80k words, not to mention images, tables, and other unstructured information.

Even if we pack everything within the context window with more irrelevant information, LLM performance drops significantly.

This is where RAG helps. RAG retrieves the relevant information from an embedded source and passes it as context to the LLM. To retrieve the ‘relevant information,’ we should have divided the documents into chunks. Thus, chunking plays a vital role in a RAG pipeline.

Chunking helps the RAG retrieve specific pieces of a large document. However, small changes in the chunking strategy can significantly impact the responses LLM makes.

“As a CEO and Founder, I’ve Quickly Learned that It’s All About People.” says Hawke Media Founder | HackerNoon

HackerNoon: What is your company in 2–5 words? Erik Huberman: AI Enabled. Tech-Integrated. Marketing Powerhouse. Why is now the time for your company to exist?

September 27, 2024

Navigating the world of manufacturing logistics

In Episode 179 of The Robot Report Podcast, we feature an interview with Anders Folgelberg, founder and CEO of FlexQube, a Swedish company specializing in

December 26, 2024

Visualizing and Integrating Complex Ideas with LLMs, Part 1: Napkin AI

Discover how AI tools can transform intricate concepts into clear, practical frameworks and diagrams Kunal Kambo Puri · Follow Published in Towards Data Science ·

July 16, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.