Carl Franzen, Author at Future Tech Stocks

AR/VR

OpenAI’s new voice AI model gpt-4o-transcribe lets you add speech to your existing text apps in seconds

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI‘s voice AI models have gotten it into trouble before with actor Scarlett Johansson, but that isn’t stopping the company from continuing to advance its offerings in this category. Today, the ChatGPT maker has unveiled three new proprietary voice models: gpt-4o-transcribe, gpt-4o-mini-transcribe and gpt-4o-mini-tts. These models will initially be available through the ChatGPT maker’s application programming

Carl Franzen March 20, 2025

AR/VR

Baidu delivers new LLMs ERNIE 4.5 and ERNIE X1 undercutting DeepSeek, OpenAI on cost — but they’re not open source (yet)

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Over the weekend, Chinese web search giant Baidu announced the launch of two new AI models, ERNIE 4.5 and ERNIE X1, a multimodal language model and reasoning model, respectively. Baidu claims they offer state-of-the-art performance on a variety of metrics, besting DeepSeek’s non-reasoning V3 and OpenAI’s GPT-4.5 (how do you like the close name match Baidu

Carl Franzen March 17, 2025

Google’s native multimodal AI image generation in Gemini 2.0 Flash impresses with fast edits, style transfers

It enables developers to create illustrations, refine images through conversation, and generate detailed visualsRead More

Carl Franzen March 12, 2025

OpenAI unveils Responses API, open source Agents SDK, letting developers build their own Deep Research and Operator

OpenAI is also making its web search, file search and computer use tools available directly through the responses API.Read More

Carl Franzen March 11, 2025

AR/VR

GenLayer offers novel approach for AI agent transactions: getting multiple LLMs to vote on a suitable contract

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More We’ve been hearing a lot about AI agents — tools powered by generative AI models that can perform actions without much human supervision or intervention. But they still remain largely a novel curiosity for most people, and as far as we can tell, very few people are trusting AI agents to buy or enter contracts on

Carl Franzen March 10, 2025

AR/VR

What you need to know about Manus, the new AI agentic system from China hailed as a second ‘DeepSeek moment’

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Stop me if you’ve heard this one before: A little-known Chinese startup is making waves globally for an impressive new AI product. No, we’re not talking about DeepSeek-R1, the AI reasoning model that made waves among western AI circles earlier this year. Instead, the hot new product du jour is Manus, a new AI multipurpose agent

Carl Franzen March 10, 2025

AR/VR

Mistral releases new optical character recognition (OCR) API claiming top performance globally

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Well-funded French AI startup Mistral is content to go its own way. In a sea of competing reasoning models, the company has introduced Mistral OCR, a new optical character recognition (OCR) API designed to provide advanced document understanding capabilities. The API extracts content — including handwritten notes, typed text, images, tables and equations — from unstructured

Carl Franzen March 6, 2025

AR/VR

Alibaba’s new open source model QwQ-32B matches DeepSeek-R1 with way smaller compute requirements

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Qwen Team, a division of Chinese e-commerce giant Alibaba developing its growing family of open-source Qwen large language models (LLMs), has introduced QwQ-32B, a new 32-billion-parameter reasoning model designed to improve performance on complex problem-solving tasks through reinforcement learning (RL). The model is available as open-weight on Hugging Face and on ModelScope under an Apache 2.0

Carl Franzen March 5, 2025

Google launches free Gemini-powered Data Science Agent on its Colab Python platform

With Google data science agent, one scientist estimated that their data processing time dropped from 1 week to five minutes.Read More

Carl Franzen March 3, 2025

AR/VR

OpenAI releases ‘largest, most knowledgable’ model GPT-4.5 with reduced hallucinations and high API price

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More It’s here: OpenAI has announced the release of GPT-4.5, a research preview of its latest and most powerful large language model (LLM) for chat applications. Unfortunately, it’s far-and-away OpenAI’s most expensive model (more on that below). It’s also not a “reasoning model,” or the new class of models offered by OpenAI, DeepSeek, Anthropic and many others

Carl Franzen February 27, 2025

Carl Franzen

OpenAI’s new voice AI model gpt-4o-transcribe lets you add speech to your existing text apps in seconds

Baidu delivers new LLMs ERNIE 4.5 and ERNIE X1 undercutting DeepSeek, OpenAI on cost — but they’re not open source (yet)

Google’s native multimodal AI image generation in Gemini 2.0 Flash impresses with fast edits, style transfers

OpenAI unveils Responses API, open source Agents SDK, letting developers build their own Deep Research and Operator

GenLayer offers novel approach for AI agent transactions: getting multiple LLMs to vote on a suitable contract

What you need to know about Manus, the new AI agentic system from China hailed as a second ‘DeepSeek moment’

Mistral releases new optical character recognition (OCR) API claiming top performance globally

Alibaba’s new open source model QwQ-32B matches DeepSeek-R1 with way smaller compute requirements

Google launches free Gemini-powered Data Science Agent on its Colab Python platform

OpenAI releases ‘largest, most knowledgable’ model GPT-4.5 with reduced hallucinations and high API price

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

Carl Franzen

Subscribe