Using Vector Steering to Improve Model Guidance

Exploring the research on vector steering and coding up an implementation

Matthew Gunton

Published in

Towards Data Science

9 min read

2 hours ago

—

Image by Author — Flux.1

Large language models are complex and do not always give answers that are perfect. To remedy this, people try many different techniques to guide the model’s output. We’ve seen pre-training on larger datasets, pre-training models with more parameters, and using a vector-database (or some other form of lookup) to add relevant context to the LLM’s input. All of these do see some improvement, but there is no method today that is fool-proof.

One interesting way to guide the model is vector steering. An interesting example of this is the Claude Golden Gate Bridge experiment. Here we see that no matter what the user asks, Claude will find some clever way to bring up its favorite topic: the Golden Gate Bridge.

Image from “Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet” Showing Claude Sonnet’s Behavior Change With Steering Vector

Today I’ll be going through the research done on this topic and also explaining Anastasia Borovykh’s excellent code implementation. If you’re interested more in this topic, I highly recommend checking out her video.

Let’s dive in!

Theory

Pyka raises $40M for autonomous electric aircraft

Listen to this article Pyka’s Pelican Spray technology combines spray precision and chemical drift reduction technologies to provide crop protection at scale. | Source: Pyka

September 25, 2024

Why AI won’t make you a better writer

Don’t let anything, least of all AI, cheat you out of what is creatively possible. Go forth and write your own story.Read More

November 14, 2024

Generative AI that imitates human motion

Walking and running is notoriously difficult to recreate in robots. Now, a group of researchers has overcome some of these challenges by creating an innovative

May 9, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.