AI GPU accelerators will be in short supply for two more years, TSMC warns

Alfonso Maruccia

Posts: 1,025   +302
Staff
A hot potato: HPC GPUs designed to accelerate AI training cost a lot of money, but TSMC won't be able to satisfy the increasingly high customer demand yet. The world's leading chip manufacturer is still lacking the kind of advanced packaging capability the market is requesting right now.

AI servers are expected to explode to an estimated market value of $150 billion by 2027, and Nvidia currently makes the most sought-after GPUs designed to accelerate algorithmic training for chatbots and generative AI services. TSMC, which is tasked with actually making the GPU-based boards the aforementioned AI servers will employ, won't have enough manufacturing capacity to satisfy market demand for at least one and a half years.

Speaking with Nikkei, TSMC chairman Mark Liu said that the AI accelerator shortage isn't caused by a lack of AI chips designed by Nvidia. The current market situation mostly depends on TSMC's own limited capacity with CoWoS packaging. The Taiwanese company cannot fulfill 100% of its customers' orders right now, Liu stated, but "we try to support about 80%" of manufacturing requests.

Chip-on-wafer-on-substrate (CoWoS) technology is an advanced packaging platform that provides "best-in-breed performance and highest integration density" for HPC hardware applications, TSMC explains. The interposer-based, wafer-level system integration process supports a wide range of HBM cubes and package sizes, the company says.

TSMC currently makes the overwhelming majority of GPUs, FPGAs and other specialized chips designed to accelerate AI computations. Those accelerators employ HBM memory to provide the largest bandwidth possible for AI training, and they are manufactured with the CoWoS packaging technique. Other chipmaking companies also provide similar packaging capacities, but TSMC is likely getting the largest orders coming from the most reputable chip companies including Nvidia and AMD.

According to industry analysts, advanced packaging technologies like CoWoS are costly and financially risky to implement. Therefore, smaller manufacturing companies are likely less motivated to invest all the money they would need to increase and refine their packaging capabilities.

TSMC is spending nearly $3 billion on a new packaging fab that should come online by 2027. The company is committed to increasing its packaging capacity "as quickly as possible," Liu confirmed to investors. The "tightness" in manufacturing output is expected to be released by the end of the next year, TSMC's chairman said, with CoWoS wafers doubling in 2024 compared to 2023.

Permalink to story.

 
There will always be "short supply" if nvidia keeps buying all capacity it can.
my point being there is enough supply actually, the real problem is nvidia wanting too much production.
 
:rolleyes: Leatherman is drooling over the high prices he'll be able to charge for his AI accelerators.
 
There will always be "short supply" if nvidia keeps buying all capacity it can.
my point being there is enough supply actually, the real problem is nvidia wanting too much production.
It's buying all it can because it has buyers for all it can get and then some! So much so TSMC is building more capacity.
MI300 is only sampling right now and won't be ramping production until the end of the year at the earliest so it's only NVIDIA right now.
 
It's buying all it can because it has buyers for all it can get and then some! So much so TSMC is building more capacity.
MI300 is only sampling right now and won't be ramping production until the end of the year at the earliest so it's only NVIDIA right now.
but is making a little less ridiculous amounts of money really deserving of being called a "shortage" ?
 
but is making a little less ridiculous amounts of money really deserving of being called a "shortage" ?
Yes, because it leads to the dismal generational price per frame improvements (or rather, no improvement at all) that we've seen. As long as there's so much margin to be made a the top end of the GPU market, it will affect the rest of the market.

What we need is for Intel to catch up (and also to get their GPUs on their own production lines) so that a true competitor to TSMC (I think the only one right now is Samsung) can bring more supply to the market.
 
Back