Microsoft built a 750W AI chip to challenge Nvidia's dominance, claims 3x performance gains over Amazon

Alfonso Maruccia · Jan 27, 2026

Editor's take: Nvidia became the world's most overvalued company as Big Tech players scrambled to buy every GPU and AI accelerator they could get. Many of these corporations are now turning their attention to developing their own accelerators, with Microsoft reportedly leading the pack in efficiency and performance.

Microsoft recently announced Maia 200, a new AI accelerator specifically designed for inference workloads. According to Redmond, Maia 200 can deliver "dramatic" improvements for AI applications and is already deployed in select US data centers on the Azure platform.

The company highlighted the chip's impressive specifications: Maia 200 is built on TSMC's 3nm process, features native FP8/FP4 tensor cores, a new memory system with 216 GB of HBM3e VRAM, and a massive 272 MB on-chip SRAM cache. Microsoft claims that Maia 200 offers the highest performance among all custom silicon designs currently used by other hyperscalers.

The chip is said to be up to three times more powerful than Amazon's third-generation Trainium at 4-bit precision (FP4) and surpasses Google's seventh-generation TPU at 8-bit precision (FP8). It is also more efficient, delivering 30 percent better performance per dollar compared with Microsoft's previous accelerator, Maia 100.

Maia 200 is currently deployed in Microsoft's US data center region in Iowa, with additional regions expected to come online soon. The chip integrates seamlessly with the Azure cloud platform and is also being used to generate "synthetic" data for training next-generation AI models.

Microsoft's overview of compute/memory specs across Maia 200, Amazon Trainium3, and Google's TPU v7

Concerned about the potential feedback-loop effects, major corporations are exploring alternative data streams as they anticipate that human-generated content will eventually be fully consumed by large language models and other machine learning tools.

Microsoft confirms that Maia 200 is a massive chip, with over 140 billion transistors contained within a 750 W TDP envelope. Performance is rated at over 10 petaFLOPS at FP4 and over five petaFLOPS at FP8. The SoC is capable of running today's most powerful AI models and has been designed to support even larger models in the future.

The chip also features a new network design for moving vast amounts of data. Based on standard Ethernet technology, the solution includes a custom transport layer and an integrated NIC for improved performance and reliability. In practical terms, the network interface in each Maia 200 SoC can reach 2.8 TB/s of bidirectional bandwidth.

Finally, Microsoft is inviting developers and AI startups to sign up for the official Maia 200 software development kit once it becomes available. The SDK includes a compiler, PyTorch support, low-level programming tools, a Maia simulator, and more.

Permalink to story:

Microsoft built a 750W AI chip to challenge Nvidia's dominance, claims 3x performance gains over Amazon

toooooot · Jan 27, 2026

750w, insane... Is it even practical?

NotYourITguy · Jan 27, 2026

toooooot said:
750w, insane... Is it even practical?

I don't know if you realize this, but that's less than the 1000W high tuned H200s run at.

seeprime · Jan 27, 2026

Prepping for a split from OpenAI, eh?

Theinsanegamer · Jan 27, 2026

toooooot said:
750w, insane... Is it even practical?

Easily. Consumer designs are cheap cheap cheap, and the 3090ti and 5090 both tickle 600w.

For something like this, a full nickel coated copper heatsink would radiate significantly more heat, since these will be in datacenters the designs are not limited to standard ATX expansion card compliance either. They'll also have powerful, loud delta server fans moving hundreds of CFM through them.

Kashim · Jan 27, 2026

toooooot said:
750w, insane... Is it even practical?

Yes, this is actually power "efficient" if you want to look at it that way. We already have consumer level video cards that consume 350w and beyond, but this is not a consumer level chip. Look at the die size and the number of transistors packed in there. Not to mention the MASSIVE on chip SRAM and the power hungry HBM3e VRAM it uses.

Theinsanegamer · Jan 27, 2026

Kashim said:
Yes, this is actually power "efficient" if you want to look at it that way. We already have consumer level video cards that consume 350w and beyond, but this is not a consumer level chip. Look at the die size and the number of transistors packed in there. Not to mention the MASSIVE on chip SRAM and the power hungry HBM3e VRAM it uses.

As always, efficiency is represented by the amount of work being done for a given amount of power. Just consuming 350w does not make something efficient or inefficient on its own.

As a dedicated, tailor made chip these Microsoft designs will likely be much more efficient then nVidia's design.

Melkor Unlimited · Jan 28, 2026

Theinsanegamer said:
As a dedicated, tailor made chip these Microsoft designs will likely be much more efficient then nVidia's design.

How would you know?
Headline says ... "Microsoft challenges nVidia .." ... but there are no nVidia numbers to compare.

Melkor Unlimited · Jan 28, 2026

toooooot said:
750w, insane... Is it even practical?

Very practical.
You just have to make 5 times more room in datacenter.
1 rack for computing + another 4 doing the cooling.

RaXelliX · Jan 31, 2026

I dont know why but that misaligned laser engraved text on the die bugs me the hell out.

Microsoft built a 750W AI chip to challenge Nvidia's dominance, claims 3x performance gains over Amazon

Alfonso Maruccia

Posts: 2,515 +935

toooooot

Posts: 4,620 +2,952

NotYourITguy

Posts: 165 +356

seeprime

Posts: 1,192 +1,823

Theinsanegamer

Posts: 8,551 +17,374

Kashim

Posts: 998 +2,308

Theinsanegamer

Posts: 8,551 +17,374

Melkor Unlimited

Posts: 341 +316

Melkor Unlimited

Posts: 341 +316

RaXelliX

Posts: 410 +244

Similar threads

Latest posts