Skip to main content

Nvidia built a massive dual GPU to power models like ChatGPT

Nvidia’s semi-annual GPU Technology Conference (GTC) usually focuses on advancements in AI, but this year, Nvidia is responding to the massive rise of ChatGPT with a slate of new GPUs. Chief among them is the H100 NVL, which stitches two of Nvidia’s H100 GPUs together to deploy Large Language Models (LLM) like ChatGPT.

The H100 isn’t a new GPU. Nvidia announced it a year ago at GTC, sporting its Hopper architecture and promising to speed up AI inference in a variety of tasks. The new NVL model with its massive 94GB of memory is said to work best when deploying LLMs at scale, offering up to 12 times faster inference compared to last-gen’s A100.

Nvidia's H100 NVL being installed in a server.
Nvidia

These GPUs are at the heart of models like ChatGPT. Nvidia and Microsoft recently revealed that thousands of A100 GPUs were used to train ChatGPT, which is a project that’s been more than five years in the making.

The H100 NVL works by combining two H100 GPUs over Nvidia high bandwidth NVLink interconnect. This is already possible with current H100 GPUs — in fact, you can connect up to 256 H100s together through NVLink — but this dedicated unit is built for smaller deployments.

This is a product built for businesses more than anything, so don’t expect to see the H100 NVL pop up on the shelf at your local Micro Center. However, Nvidia says enterprise customers can expect to see it around the second half of the year.

In addition to the H100 NVL, Nvidia also announced the L4 GPU, which is specifically built to power AI-generated videos. Nvidia says it’s 120 times more powerful for AI-generated videos than a CPU, and offers 99% better energy efficiency. In addition to generative AI video, Nvidia says the GPU sports video decoding and transcoding capabilities and can be leveraged for augmented reality.

Nvidia says Google Cloud is among the first to integrate the L4. Google plans on offering L4 instances to customers through its Vertex AI platform later today. Nvidia said the GPU will be available from partners later, including Lenovo, Dell, Asus, HP, Gigabyte, and HP, among others.

Editors' Recommendations

Jacob Roach
Senior Staff Writer, Computing
Jacob Roach is a writer covering computing and gaming at Digital Trends. After realizing Crysis wouldn't run on a laptop, he…
Google Bard can now speak, but can it drown out ChatGPT?
Google Bard on a green and black background.

In the world of artificial intelligence (AI) chatbots, OpenAI’s ChatGPT is undoubtedly the best known. But Google Bard is hot on its heels, and the bot has just been granted a new ability: the power of speech.

The change was detailed in a Google blog post, which described the update as “Bard’s biggest expansion to date.” It grants Bard not just speech, but the ability to converse in over 40 languages, use images as prompts, and more.

Read more
ChatGPT maker OpenAI faces FTC probe over consumer protection laws
ChatGPT and OpenAI logos.

ChatGPT maker OpenAI is facing an investigation by the Federal Trade Commission (FTC) over possible violations of consumer protection laws.

The action marks the toughest scrutiny of Microsoft-backed OpenAI since it burst onto the scene in November with its AI-powered ChatGPT chatbot.

Read more
ChatGPT website traffic has fallen for the first time
A MacBook Pro on a desk with ChatGPT's website showing on its display.

Since the arrival of OpenAI’s ChatGPT in November, the clever AI-powered chatbot has taken the world by storm as people take the tool for a spin while also speculating about how the technology might transform the workplace and wider society.

But for the first time since ChatGPT landed toward the end of last year, visits to the chatbot’s website have dipped, analytics firm Similarweb said.

Read more