Skip to main content

The future of fast PC graphics? Connecting directly to SSDs

Performance boosts are expected with each new generation of the best graphics cards, but it seems that Nvidia and IBM have their sights set on greater changes.

The companies teamed up to work on Big accelerator Memory (BaM), a technology that involves connecting graphics cards directly to superfast SSDs. This could result in larger GPU memory capacity and faster bandwidth while limiting the involvement of the CPU.

A chart breaks down Nvidia and IBM's BaM technology.
Image source: Arxiv

This type of technology has already been thought of, and worked on, in the past. Microsoft’s DirectStorage application programming interface (API) works in a somewhat similar way, improving data transfers between the GPU and the SSD. However, this relies on external software, only applies to games, and only works on Windows. Nvidia and IBM researchers are working together on a solution that removes the need for a proprietary API while still connecting GPUs to SSDs.

The method, amusingly referred to as BaM, was described in a paper written by the team that designed it. Connecting a GPU directly to an SSD would provide a performance boost that could prove to be viable, especially for resource-heavy tasks such as machine learning. As such, it would mostly be used in professional high-performance computing (HPC) scenarios.

The technology that is currently available for processing such heavy workloads requires the graphics card to rely on large amounts of special-purpose memory, such as HBM2, or to be provided with efficient access to SSD storage. Considering that datasets are only growing in size, it’s important to optimize the connection between the GPU and storage in order to allow for efficient data transfers. This is where BaM comes in.

“BaM mitigates the I/O traffic amplification by enabling the GPU threads to read or write small amounts of data on-demand, as determined by the compute,” said the researchers in their paper, first cited by The Register. “The goal of BaM is to extend GPU memory capacity and enhance the effective storage access bandwidth while providing high-level abstractions for the GPU threads to easily make on-demand, fine-grain access to massive data structures in the extended memory hierarchy.”

An Nvidia GPU core sits on a table.
Niels Broekhuijsen/Digital Trends

For many people who don’t work directly with this subject, the details may seem complicated, but the gist of it is that Nvidia wants to rely less on the processor and connect directly to the source of the data. This would both make the process more efficient and free up the CPU, making the graphics card much more self-sufficient. The researchers claim that this design would be able to compete with DRAM-based solutions while remaining cheaper to implement.

Although Nvidia and IBM are undoubtedly breaking new ground with their BaM technology, AMD worked in this area first: In 2016, it unveiled the Radeon Pro SSG, a workstation GPU with integrated M.2 SSDs. However, the Radeon Pro SSG was intended to be strictly a graphics solution, and Nvidia is taking it a few steps further, aiming to deal with complex and heavy compute workloads.

The team working on BaM plans to release the details of their software and hardware optimization as open source, allowing others to build on their findings. There is no mention as to when, if ever, BaM might find itself implemented in future Nvidia products.

Editors' Recommendations

Monica J. White
Monica is a UK-based freelance writer and self-proclaimed geek. A firm believer in the "PC building is just like expensive…
I’ve reviewed every GPU in the last 2 years — these are the only ones you should buy
A hand grabbing a graphics card.

There are a lot of options when you're on the hunt for the best graphics card. In years past, you could look at a product name and get a general sense of where it landed in terms of performance, but times are changing. With bizarre value propositions, extra features that go beyond raw performance, and questionable naming conventions, GPUs are a lot more complex than they used to be.

I've reviewed every graphics card released over the last two years, covering two generations from both AMD and Nvidia, as well as Intel's first push into the discrete graphics market. Here are the only GPUs I'd even consider buying.
What to look for

Read more
Did Nvidia just fix the RTX 4090’s melting power connectors?
Nvidia GeForce RTX 4090 GPU.

Nvidia’s 40-series graphics cards have gained an unwanted reputation for the unreliability of their 12VHPWR power connectors, with numerous accounts circulating online of connectors melting and destroying GPUs. Now, though, it seems Nvidia might have solved the problem for its RTX 4090 graphics cards.

The idea came to light after a post by prackprackprack on Reddit, with the user asking whether their new RTX 4090 featured shorter sense pins compared to older versions of the card. Shorter pins would mean the power supply couldn’t properly connect to a user’s graphics card and send it higher wattages unless the connector was correctly pushed in.

Read more
Why this two-year-old GPU is still the one you should buy
The AMD RX 6700 XT sitting on a table.

Since Nvidia and AMD announced their next-gen GPUs, budget PC builders have been waiting with bated breath to see what the companies could deliver in the midrange sweet spot of $300 to $400. Unfortunately, the answer hasn't been pretty.

Nvidia's RTX 4060 Ti launched to near-universal dissatisfaction, while AMD's RX 7600, although a decent GPU, carried a lot of caveats. Even Nvidia's value-focused RTX 4060 failed to impress, largely due to the fact that it's beaten by last-gen options around the same price.

Read more