Carl Franzen

AR/VR

OpenAI’s new voice AI model gpt-4o-transcribe lets you add speech to your existing text apps in seconds

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI‘s voice AI models have gotten it into trouble before with actor Scarlett Johansson, but that isn’t stopping the company from continuing to advance its offerings in this category. Today, the ChatGPT maker has unveiled three new proprietary voice models: gpt-4o-transcribe, gpt-4o-mini-transcribe and gpt-4o-mini-tts. These models will initially be available through the ChatGPT maker’s application programming

Read More »
AR/VR

Baidu delivers new LLMs ERNIE 4.5 and ERNIE X1 undercutting DeepSeek, OpenAI on cost — but they’re not open source (yet)

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Over the weekend, Chinese web search giant Baidu announced the launch of two new AI models, ERNIE 4.5 and ERNIE X1, a multimodal language model and reasoning model, respectively. Baidu claims they offer state-of-the-art performance on a variety of metrics, besting DeepSeek’s non-reasoning V3 and OpenAI’s GPT-4.5 (how do you like the close name match Baidu

Read More »
AR/VR

GenLayer offers novel approach for AI agent transactions: getting multiple LLMs to vote on a suitable contract

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More We’ve been hearing a lot about AI agents — tools powered by generative AI models that can perform actions without much human supervision or intervention. But they still remain largely a novel curiosity for most people, and as far as we can tell, very few people are trusting AI agents to buy or enter contracts on

Read More »
AR/VR

What you need to know about Manus, the new AI agentic system from China hailed as a second ‘DeepSeek moment’

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Stop me if you’ve heard this one before: A little-known Chinese startup is making waves globally for an impressive new AI product. No, we’re not talking about DeepSeek-R1, the AI reasoning model that made waves among western AI circles earlier this year. Instead, the hot new product du jour is Manus, a new AI multipurpose agent

Read More »
AR/VR

Mistral releases new optical character recognition (OCR) API claiming top performance globally

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Well-funded French AI startup Mistral is content to go its own way. In a sea of competing reasoning models, the company has introduced Mistral OCR, a new optical character recognition (OCR) API designed to provide advanced document understanding capabilities. The API extracts content — including handwritten notes, typed text, images, tables and equations — from unstructured

Read More »
AR/VR

Alibaba’s new open source model QwQ-32B matches DeepSeek-R1 with way smaller compute requirements

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Qwen Team, a division of Chinese e-commerce giant Alibaba developing its growing family of open-source Qwen large language models (LLMs), has introduced QwQ-32B, a new 32-billion-parameter reasoning model designed to improve performance on complex problem-solving tasks through reinforcement learning (RL). The model is available as open-weight on Hugging Face and on ModelScope under an Apache 2.0

Read More »
AR/VR

OpenAI releases ‘largest, most knowledgable’ model GPT-4.5 with reduced hallucinations and high API price

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More It’s here: OpenAI has announced the release of GPT-4.5, a research preview of its latest and most powerful large language model (LLM) for chat applications. Unfortunately, it’s far-and-away OpenAI’s most expensive model (more on that below). It’s also not a “reasoning model,” or the new class of models offered by OpenAI, DeepSeek, Anthropic and many others

Read More »