How to Use HyDE for Better LLM RAG Retrieval

Building an advanced local LLM RAG pipeline with hypothetical document embeddings

Dr. Leon Eversberg

Published in

Towards Data Science

9 min read

7 hours ago

—

Implementing HyDE is very simple in Python. Image by the author

Large Language Models (LLMs) can be improved by giving them access to external knowledge through documents.

The basic Retrieval Augmented Generation (RAG) pipeline consists of a user query, an embedding model that converts text into embeddings (high-dimensional numerical vectors), a retrieval step that searches for documents similar to the user query in the embedding space, and a generator LLM that uses the retrieved documents to generate an answer [1].

In practice, the RAG retrieval part is crucial. If the retriever does not find the correct document in the document corpus, the LLM has no chance to generate a solid answer.

A problem in the retrieval step can be that the user query is a very short question — with imperfect grammar, spelling, and punctuation — and the corresponding document is a long passage of well-written text that contains the information we want.

A query and the corresponding passage from the MS MARCO dataset, illustrating that typically query and document have different lengths and formats. Image by the author

HyDE is a proposed technique to improve the RAG retrieval step by converting the user question into a…

Phison’s groundbreaking aiDAPTIV+ makes training AI easier by combining GPUs and SSDs

When it comes to AI training and dealing with large data sets and increasingly complex large language models (LLMs) like Llama 70b, it’s not simply

August 7, 2024

AMD’s new R&D center confirmed for Taiwan: 33 partners in Taiwan for R&D cooperation

According to new reports, AMD has inked a deal to build a new research and development (R&D) center in Taiwan. 3 VIEW GALLERY – 3

July 28, 2024

Richtech launches autonomous mobile robot for hospitals – The Robot Report

Listen to this article Medbot targets pharmacy deliveries for hospitals and other medical facilities. | Credit: Richtech Robotics Richtech Robotics just unveiled its newest autonomous

June 1, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.