How to Improve Model Quality Without Building Larger Models

Going into the Google DeepMind’s “Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters”

Matthew Gunton

Published in

Towards Data Science

11 min read

7 hours ago

—

Image by Author — Flux.1 12B

Recently OpenAI unveiled their newest model o1. Rather than highlight the parameter size of this model, OpenAI instead showcased that the model performs significantly better because it takes more time. When you ask the model a question, it will often taken multiple seconds to respond — a far cry from the millisecond speed most people now expect with Large Language Models (LLMs). Nevertheless, this extra time appears to pay off as o1 scores substantially higher than other models on the LMSYS Chatbot Arena.

Given this leap in performance, the question everyone is asking is, How did they do this?

Screen Capture of Lmsys Chatbot Arena Math Rankings on 9/23/2024

While OpenAI has not publicly stated how they achieved these results, there have been a few papers recently that are good candidates for what is happening behind the scenes. One such paper is “Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters”. This goes into how you can leverage…

Snap’s Upgraded Spectacles Look To Advance AR, $1,200 Commitment Required

Snap has announced its next generation Spectacles, a new see-through, standalone pair of AR glasses. The glasses are powered by Snap OS, the company’s new

September 18, 2024

Last day to vote for TC Disrupt 2024 Audience Choice program | TechCrunch

Attention, tech enthusiasts and startup supporters! The final countdown is here: Today is the last day to cast your vote for the TechCrunch Disrupt 2024

May 24, 2024

Max To Join The Streaming Pile Of Services To Crack Down On Password Sharing

Max is joining the ranks of streaming services cracking down on password sharing. The news came during a recent Warner Bros. earnings call, where chief

November 8, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.