OpenAI ChatGPT GPT-4 Turbo Gets A Mid-Life Boost, Here’s What You Should Know

When OpenAI’s GPT-4 hit the internet, it was pretty much the best large language model (LLM) around. Many of OpenAI’s competitors have long since surpassed the original GPT-4 on various metrics, from Claude’s enormous context window to Gemini 1.5’s excellent performance with complex multi-modal datasets. Of course, OpenAI hasn’t been resting on its laurels this whole time. The company unveiled GPT-4 Turbo back in November, and now it has just announced an update to that model with some pretty significant changes.

In the most recent update, which has no fancy name, GPT4 Turbo is now “significantly smarter and more pleasant to use”, according to OpenAI founder Sam Altman. While he didn’t elaborate, it seems like Altman is primarily talking about changes to the model that have made its responses when being used as a chatbot “more direct, less verbose, and more conversational”, for which OpenAI provides the following example as proof:

Image: OpenAI

The updated model also scores higher on most common AI benchmarks, including the Graduate-Level Google-Proof Q&A Benchmark. That challenging dataset was designed to test the abilities of LLMs and comprises a 448-question multiple-choice test with questions spread across every scientific domain. The questions are designed by experts in the respective fields to judge not only how well LLMs can answer questions, but also how well they can be overseen by humans. This test is GPT-4’s weakest benchmark, and the new version improves its score on this test from approximately 35% to just under 50%, which is an excellent result on this difficult benchmark.

Other benchmarks that see gains include the reasoning-focused MATH test, the Multilingual Grade School Math (MGSM) benchmark, and the Discrete Reasoning Over Paragraphs (DROP) benchmark. DROP in particular is one of the most taxing AI benchmarks, and GPT-4 Turbo was already one of the best models in this test, but the new release improves its score on this difficult test to a bit over 80%, putting it in the exclusive category of models to reach such heights that includes, uh, itself. (The next best result is from Google’s Gemini 1.5 Turbo at 78.9%.)

openai developers tweet gpt4 turbo with vision

Along with the new model that updates GPT-4’s knowledge to April 2024, OpenAI also notes that GPT-4 Turbo with Vision—the model that integrates image analysis capabilities—is now generally available using its API. Vision requests can now also use JSON mode and function calling, making them considerably more versatile than before. In the Twitter thread linked above, developers have posted many impressive examples of apps created using this API. If you’re interested in getting started, head over to OpenAI’s website and check out the pricing for API requests to the updated GPT-4 Turbo.

igus to show affordable automation at Hannover Messe, Robotics Summit – The Robot Report

Listen to this article New offerings include the igusGO AI-driven app and more low-cost automation. Source: igus At its annual press conference last week, igus

April 23, 2024

GALAX GeForce RTX 4070 with GDDR6 reviewed, only 1% slower than GDDR6X

Our friends at Wccftech have posted a review of the GALAX GeForce RTX 4070 OC, one of the new RTX 4070 variants that sports 12GB

September 13, 2024

Steam’s Best of 2024 list highlights the biggest selling new PC games of the year

TL;DR: Valve’s Steam Best of 2024 list highlights top PC games by gross revenue and peak players, without revealing sales figures. Categories include Platinum, Gold,

December 30, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.