Microsoft lifts the lid on its new AI chip, Maia 100, up to 700W TDP, built for large-scale AI

Microsoft is finally ready to enter the custom AI hardware race, a chip market in which NVIDIA has at least a 75 percent market share. At this year’s Hot Chips conference, the company unveiled its first AI accelerator, Maia 100, built on TSMC’s 5nm process node.

VIEW GALLERY – 3 IMAGES

Designed to “optimize performance and reduce costs,” Maia 100’s architecture includes custom server boards, racks, and software for running AI services like Microsoft’s Azure OpenAI Services.

“The Maia 100 accelerator is purpose-built for a wide range of cloud-based AI workloads,” Microsoft’s technical blog on Maia 100 details. “The chip measures out at ~820mm2 and utilizes TSMC’s N5 process with COWOS-S interposer technology. Equipped with large on-die SRAM, Maia 100’s reticle-size SoC die, combined with four HBM2E die, provide a total of 1.8 terabytes per second of bandwidth and 64 gigabytes of capacity to accommodate AI-scale data handling requirements.”

Provisioned at 500W, the AI accelerator is also designed to support up to 700W, which is efficient for high-performance AI workloads. Maia 100’s architecture includes a high-speed 16xRx16 tensor unit for training and inference, with a vector processor supporting a wide range of data types, including FP32 and BF16.

Microsoft lifts the lid on its new AI chip, Maia 100, up to 700W TDP, built for large-scale AI 3

The 64GB of HBM2E memory might look like a step down from the 80GB found in NVIDIA’s H100 chip, not to mention the 192GB of HBM3E coming with B200, but Maia 100 is squarely focused on cost and efficiency for its AI processing capabilities. On the software front, the Maia SDK allows developers to port models previously created in Pytorch and Triton and offers scheduling and device management tools.

“With its advanced architecture, comprehensive developer tools, and seamless integration with Azure, the Maia 100 is revolutionizing the way Microsoft manages and executes AI workloads,” Microsoft says. “Through the algorithmic co-design or hardware with software, built-in hardware optionality for both model developers and custom kernel authors, and a vertically integrated design to optimize performance and improve power efficiency while reducing costs, Maia 100 offers a new option for running advanced cloud-based AI workloads on Microsoft’s AI infrastructure.”

For a detailed look at Maia 100’s architecture, check out Microsoft’s technical ‘Inside Maia 100‘ post.

Cohere’s family of Command R models arrives in Amazon Bedrock

Cohere makes its powerful Command R and Command R+ large language models available on Amazon Bedrock to win over the enterprise from OpenAI.Read More

April 29, 2024

A Comprehensive Analysis of Startup Predictive Models | HackerNoon

Table of Links Abstract and 1. Introduction 2 Related works 3 Dataset Overview, Preprocessing, and Features 3.1 Successful Companies Dataset and 3.2 Unsuccessful Companies Dataset

August 8, 2024

How to EMP-Proof a Building

2 min read Emily Waltz is the power and energy editor at IEEE Spectrum. Yilu Liu is one of the researchers at Oak Ridge National

May 25, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.