Skip to main content

When we run out of room for data, scientists want to store it in DNA

Google

First the apocalyptic warning: We’re running out of data storage.

Chances are that this isn’t something you’ve had to worry about too much in recent years. There was a time, not all that long ago, when your computer’s finite hard drive was all the storage you had available. Hit that limit (which, in the case of my own first computer, was less than 100MB) and you resorted to floppy disks and other local external storage. When you ran out of that, too, you got deleting.

Each day, around 2.5 quintillion bytes of data is created, courtesy of the 3.7 billion humans who now use the internet.

We don’t delete any more. Nor do companies, especially those valued based on the data they own. Instead, we simply propel our files off to the cloud, whose very name is ephemeral and ethereal; lacking in any real physicality. Where is the data stored? It doesn’t matter so long as we can get it back. What are the perils of running out of cloud storage? Seemingly very little, besides having to up your monthly subscription payments to unlock more glorious free space.

As a result, the idea that we might one day run out of data storage is as hard to wrap your head around as the suggestion that we could run out water: that glorious free resource which falls from the sky. But 2018 is the year in which Cape Town, South Africa, came precipitously close to running out of water. And we could run out of data storage, too.

Data, data, everywhere

The reason for this is the unimaginable pace at which we currently produce data. Each day, around 2.5 quintillion bytes of data is created, courtesy of the 3.7 billion humans who now use the internet. In the last two years alone, a mind-boggling 90 percent of the world’s data has been created. With a growing number of smart devices connected to the Internet of Things, that figure is set to increase significantly.

“When we think of cloud storage, we think of these infinite stores of data,” Hyunjun Park, CEO and co-founder of the data storage company Catalog, told Digital Trends. “But the cloud is really just someone else’s computer. What most people don’t realize is that we’re generating so much data that the pace at which we are generating it is far outpacing our ability to store all of it. In the very near future, we’re going to have a huge gap between the useful data that we’re generating, and how we are able to store it using conventional mediums.”

Catalog has developed technology they believe could transform the way we store data.

Since cloud storage companies are busy building new data centers, and expanding their existing ones, at a rate of knots, it’s difficult to work out when we could run out of data storage capacity. There’s no movie-style countdown clock. According to Park, however, as early as 2025 humankind may have produced more than 160 zettabytes of data cumulatively. (A zettabyte, in case you’re wondering, is a trillion gigabytes.) How much of this will we be able to store? Around 12.5 percent of it, Park suggests.

Clearly, something needs to be done.

Is DNA the answer?

That’s where Park and fellow MIT scientist and co-founder Nathaniel Roquet enter the picture. Their startup Catalog has developed technology they believe could transform data storage as we know it; allowing, or so they claim, the entirety of the world’s data to be comfortably fit into a space the size of a coat closet.

Catalog's DNA storage team
Catalog’s DNA Storage Team in the lab. Catalog

Catalog’s solution? By encoding data into DNA. That might sound like the plot of a Michael Crichton novel, but their scalable and affordable solution is serious, and has so far received $9 million in venture funding — along with the support of leading professors from Stanford and Harvard Universities.

“A question I get asked often is, ‘Whose DNA are we using?’” Park laughed. “People are afraid of us taking DNA from people and turning them into mutants, or things like that.”

For years bottlenecks have stopped DNA from living up to it’s massive data storage potential.

This is not, we should make clear, what Catalog is doing. The DNA the company is coding data into is a synthetic polymer. It is not something that comes from a biological origin, and the series of base pairs into which the data is coded, as a series of ones and zeros, isn’t the code for anything living. But the end product is nonetheless biologically indistinguishable from something you might find in a living cell.

The idea of DNA being a potential storage method has been speculated upon for decades now, virtually since James Watson and Francis Crick discovered the double helix in 1953. However, until now there have been a number of bottlenecks that have stopped it living up to its massive potential as a computational data storage solution.

Traditional thinking on DNA-based data storage focused on the synthesis of new DNA molecules; mapping the sequence of bits to the sequence of DNA’s four base pairs and making enough molecules to represent all of the numbers you want to store. The problem is that this process is slow and expensive, both considerable bottlenecks when it comes to storing data.

Catalog’s approach is based on decoupling the synthesis process from the encoding process. Essentially, the company generates massive numbers of just a few different molecules (making it much cheaper) and then encodes the information by generating huge diversity from the premade molecules.

As an analogy, Catalog likened the previous approach to manufacturing custom hard drives with all your data hard-wired in. Storing different data means building a whole new hard drive from the ground up. Their approach, they suggest, is akin to mass-producing blank hard drives, and then filling it with the encoded information as and when required.

It’s all about the storage

The exciting part of all of this is the mind-boggling amount of data it can store. As a proof of concept, Catalog has used its technology to encode books like The Hitchhiker’s Guide to the Galaxy into DNA. But that’s nothing compared to the possibilities.

From start to finish, reading data off of DNA will take a minimum of several hours.

“If you’re comparing apples to apples, the bits you can store in the same volume comes out at something like 1 million times the informational density of a solid-state drive,” Park said. “Whatever you can store in a flash drive, you could store 1 million times that in the same volume if you’re doing it in DNA.”

The comparison with solid-state drives is not exact, however. DNA may be able to store far more information in the same volume, but it doesn’t have the instant access of, say, a USB-connected flash drive. Catalog’s approach transforms data into a solid pellet of synthetic polymer.

To access your data, scientists would need to take said pellet, rehydrate it by adding water, and then read it using a DNA sequencer. This provides the base pairs of the DNA, which can, in turn, then be used to calculate the ones and zeroes that reassemble your data. From start to finish, the process will take a minimum of several hours.

Catalog's DNA team in the lab
In order to retrieve data off of DNA, scientists would need to take the pellet it’s stored on, rehydrate it by adding water, and then read it using a DNA sequencer. Catalog

For this reason, Catalog is initially targeting a market used to these kinds of delays: the archiving market. This is the kind of data that is currently stored on formats like magnetic tape, used for keeping track of the kind of information that you might hope not to have to revisit, but is still crucial to hang onto. (Imagine the corporate equivalent of the warranty to your fridge.)

But is there ever a point at which this will matter to the average user? After all, as we pointed out at the top of this article, most of us don’t really think all that much about our data and where it is kept. Is it on magnetic tape? Is it on solid-state storage? We don’t mind so long as it is there when we need it.

DNA-based data encoding is likely to be a long-term storage option, while short-term data takes other forms.

Because of the amount of time it takes to retrieve information, there’s unlikely to ever be a point at which, for instance, your Google Cloud information is stored in enormous vats of DNA or as a series of marble-like pellets in Mountain View, CA. Should Catalog be able to prove its concept to businesses, this is likely to be a long-term storage option, while short-term data takes other forms.

Imagine the possibilities

A tube containing millions of copies of data encoded into DNA. Catalog

There are exciting sci-fi-sounding possibilities, though. “Imagine a subcutaneous pellet containing all your health data, all your MRA scans, your blood tests, your X-rays from your dentist,” Park said. “You would always want that data to be very accessible to you, but you don’t necessarily want it up in the cloud somewhere, or on an unsecured server in a hospital. If you had that with you in the form of DNA, you could physically control that data and access to it, while making sure that only the authorized doctors could have access to it.”

After all, as he points out, all hospitals today have DNA sequencers. “I’m not saying we’re pursuing that right now, but it’s a possible future,” he said.

Having announced their new company to the world, Catalog is now focused on carrying out some pilot projects to demonstrate how this technology can be used effectively. “These aren’t scientific challenges we have left to solve, but rather mechanical optimization problems,” he noted.

Having, by his own admission, having entered this field because it sounded like a cool technological approach to a big problem, Park is now convinced that DNA data storage may turn out to be one of the most important technologies of our time.

Heck, when it comes to being able to archive human history as we know it, it’s hard to disagree. “It’s about preserving our way of life as we know it,” he explained.

Luke Dormehl
Former Digital Trends Contributor
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
Juiced Bikes offers 20% off on all e-bikes amid signs of bankruptcy
Juiced Bikes Scrambler ebike

A “20% off sitewide” banner on top of a company’s website should normally be cause for glee among customers. Except if you’re a fan of that company’s products and its executives remain silent amid mounting signs that said company might be on the brink of bankruptcy.That’s what’s happening with Juiced Bikes, the San Diego-based maker of e-bikes.According to numerous customer reports, Juiced Bikes has completely stopped responding to customer inquiries for some time, while its website is out of stock on all products. There are also numerous testimonies of layoffs at the company.Even more worrying signs are also piling up: The company’s assets, including its existing inventory of products, is appearing as listed for sale on an auction website used by companies that go out of business.In addition, a court case has been filed in New York against parent company Juiced Inc. and Juiced Bike founder Tora Harris, according to Trellis, a state trial court legal research platform.Founded in 2009 by Harris, a U.S. high-jump Olympian, Juiced Bikes was one of the early pioneers of the direct-to-consumer e-bike brands in the U.S. market.The company’s e-bikes developed a loyal fandom through the years. Last year, Digital Trends named the Juiced Bikes Scorpion X2 as the best moped-style e-bike for 2023, citing its versatility, rich feature set, and performance.The company has so far stayed silent amid all the reports. But should its bankruptcy be confirmed, it could legitimately be attributed to the post-pandemic whiplash experienced by the e-bike industry over the past few years. The Covid-19 pandemic had led to a huge spike in demand for e-bikes just as supply chains became heavily constrained. This led to a ramp-up of e-bike production to match the high demand. But when consumer demand dropped after the pandemic, e-bike makers were left with large stock surpluses.The good news is that the downturn phase might soon be over just as the industry is experiencing a wave of mergers and acquisitions, according to a report by Houlihan Lokey.This may mean that even if Juiced Bikes is indeed going under, the brand and its products might find a buyer and show up again on streets and trails.

Read more
Volkswagen plans 8 new affordable EVs by 2027, report says
volkswagen affordable evs 2027 id 2all

Back in the early 1970s, when soaring oil prices stifled consumer demand for gas-powered vehicles, Volkswagen took a bet on a battery system that would power its first-ever electric concept vehicle, the Elektro Bus.
Now that the German automaker is facing a huge slump in sales in Europe and China, it’s again turning to affordable electric vehicles to save the day.Volkswagen brand chief Thomas Schaefer told German media that the company plans to bring eight new affordable EVs to market by 2027."We have to produce our vehicles profitably and put them on the road at affordable prices," he is quoted as saying.
One of the models will be the ID.2all hatchback, the development of which is currently being expedited to 36 months from its previous 50-month schedule. Last year, VW unveiled the ID.2all concept, promising to give it a price tag of under 25,000 euros ($27,000) for its planned release in 2025.VW CEO Larry Blume has also hinted at a sub-$22,000 EV to be released after 2025.It’s unclear which models would reach U.S. shores. Last year, VW America said it planned to release an under-$35,000 EV in the U.S. by 2027.The price of batteries is one of the main hurdles to reduced EV’s production costs and lower sale prices. VW is developing its own unified battery cell in several European plants, as well as one plant in Ontario, Canada.But in order for would-be U.S. buyers to obtain the Inflation Reduction Act's $7,500 tax credit on the purchase of an EV, the vehicle and its components, including the battery, must be produced at least in part domestically.VW already has a plant in Chattanooga, Tennesse, and is planning a new plant in South Carolina. But it’s unclear whether its new unified battery cells would be built or assembled there.

Read more
Nissan launches charging network, gives Ariya access to Tesla SuperChargers
nissan charging ariya superchargers at station

Nissan just launched a charging network that gives owners of its EVs access to 90,000 charging stations on the Electrify America, Shell Recharge, ChargePoint and EVgo networks, all via the MyNissan app.It doesn’t stop there: Later this year, Nissan Ariya vehicles will be getting a North American Charging Standard (NACS) adapter, also known as the Tesla plug. And in 2025, Nissan will be offering electric vehicles (EVs) with a NACS port, giving access to Tesla’s SuperCharger network in the U.S. and Canada.Starting in November, Nissan EV drivers can use their MyNissan app to find charging stations, see charger availability in real time, and pay for charging with a payment method set up in the app.The Nissan Leaf, however, won’t have access to the functionality since the EV’s charging connector is not compatible. Leaf owners can still find charging stations through the NissanConnectEV and Services app.Meanwhile, the Nissan Ariya, and most EVs sold in the U.S., have a Combined Charging System Combo 1 (CCS1) port, which allows access to the Tesla SuperCharger network via an adapter.Nissan is joining the ever-growing list of automakers to adopt NACS. With adapters, EVs made by General Motors, Ford, Rivian, Honda and Volvo can already access the SuperCharger network. Kia, Hyundai, Toyota, BMW, Volkswagen, and Jaguar have also signed agreements to allow access in 2025.
Nissan has not revealed whether the adapter for the Ariya will be free or come at a cost. Some companies, such as Ford, Rivian and Kia, have provided adapters for free.
With its new Nissan Energy Charge Network and access to NACS, Nissan is pretty much covering all the bases for its EV drivers in need of charging up. ChargePoint has the largest EV charging network in the U.S., with over 38,500 stations and 70,000 charging ports at the end of July. Tesla's charging network is the second largest, though not all of its charging stations are part of the SuperCharger network.

Read more