AMD's next compute GPU could have two dies and 128 GB of memory

mongeese

Posts: 513   +109
Staff member
Fun fact: Aldebaran is the brightest star in the Taurus constellation. It’s 44 times the size of the sun and 400 times as luminous. It’s also the namesake of AMD’s next Instinct Accelerator.

Aldebaran, the computer part, has appeared a few times in Linux patch notes. Its two-die design was half-implied in February and confirmed last month: "on Aldebaran, only primary die fetches valid power data. Show power/energy values as 0 on secondary die," one note reads.

It’s the first AMD Accelerator/GPU to leverage an MCM (multi-chip module) design, a technique that’s been plastered on patents for years but has only just begun to materialize. Using multiple chips/dies connected closely allows for more scalability than traditional monolithic designs, but also reduces the per-compute unit performance of a card, particularly in poorly parallelized workloads.

Nvidia is expected to challenge Aldebaran with their own MCM designs based on the Hopper architecture next year. Intel, meanwhile, is on the verge of releasing the Ponte Vecchio accelerator based on the Xe HPC architecture and an early MCM implementation.

Like Ponte Vecchio, Aldebaran leverages MCM to expand memory capacity, patch notes imply. A note from last week describes Aldebaran as having two dies, four UMCs per die, and eight channels connected to 2 GB of HBM per UMC. Or 128 GB, in total.

The patch notes also mention support for a "new HBM2 memory type," presumably HBM2e. If AMD is using SK Hynix’s new HBM2e memory modules with 3.6 Gbps of bandwidth and a 4096-bit bus on each die, then Aldebaran could have 3.6 TB/s of memory bandwidth.

For comparison, Aldebaran’s predecessor, the Instinct MI100 (formerly codenamed Arcturus, another star) has 32 GB of HBM2 and 1.2 TB/s of memory bandwidth. Nvidia’s A100 accelerator can be configured with up to 80 GB of 3.2 Gbps HBM2e memory for 2 TB/s of bandwidth.

AMD CEO Dr Lisa Su has already said that the CDNA2 architecture, meaning Aldebaran, is slated for release later this year. Presumably, it will release as the MI200 Accelerator.

"This year, as I said, we’re putting together our next generation CDNA architecture. This is actually a key component that enabled us to win the largest supercomputer bids in the US," Su said at a conference in May. "… And we will be launching the next generation of that architecture, actually, later this year. We’re very excited about it. I think it’s progressed extremely well."

Masthead by Timothy Dykes

Permalink to story.

 

Irata

Posts: 1,658   +2,776
Am really curious about the first MCM GPU and if it‘s going to be an IMC less solution like the first Threadrippers, with an IMC or an in-between solution.

Either way, it‘s a good idea to start with compute GPU rather than gaming ones.
 

Vanderlinde

Posts: 44   +36
Am really curious about the first MCM GPU and if it‘s going to be an IMC less solution like the first Threadrippers, with an IMC or an in-between solution.

Either way, it‘s a good idea to start with compute GPU rather than gaming ones.

All camps, wether it be Intel, Nvidia or AMD start with a enterprise or computational chip. That one is just simply derived towards a consumer friendly chip or CPU.

I mean if you compare the chips, Quaddro vs Geforce or Instinct vs Vega etc; they are all very simular.
 

NeoMorpheus

Posts: 876   +1,632
o06ddpdiwwf11.jpg
 

Lounds

Posts: 892   +795
If you bothered to read the article you're commenting on, you would have learned that this is exactly a GPU with multiple dies. As even the title indicates.
Maybe I should have added for Gaming graphics cards. This is a compute card :)