Skip to main content

Machine learning system brings still images to life by guessing what comes next

machine learning images to life fig5
Whether it’s guessing which songs you want to listen to or which ads you should be shown, modern AI is increasingly focused on predicting the future. But there’s an enormous gulf between those kind of applications and looking at a scene and guessing what will happen next.

That’s what researchers at the Massachusetts Institute of Technology have done, with a new paper revealing not just their ability to look at still images and guess what will happen next — something we’ve covered in the past — but to actually generate video of it.

Recommended Videos

“What we’re interested in is teaching machines what can happen in a particular setting,” Carl Vondrick, a Ph.D. student in computer science, told Digital Trends. “For example, we wanted a machine to recognize what happens on a beach. We want it to know that waves are going to crash, people are going to play in the water — these are all things it’s very difficult to teach a machine. The reason is that it would be very time-consuming for a person to sit down and write rules to explain everything that can happen in any given scenario. What we wanted to do was to teach them from watching massive amounts of video instead.”

To do this, Vondrick and his fellow researchers used deep learning to “train” a computer to understand everyday scenes by watching the equivalent of two years’ worth of online video footage. After that, they then showed the machine still images and asked it to generate video to bring that scene to life.

Like many examples of computation creativity, such as Google’s DeepDream technology, the generated images can be a bit trippy — although there’s no denying they get a lot right about the world. MIT’s system, for instance, takes a still image of a beach and animates it to show waves crashing, just as Vondrick hoped it would.

Limitations mean that the videos are only one second long, and animated like a made-for-TV movie from the 1990s, but it’s an impressive start.

“One application I’m really excited about is predicting the future,” Vondrick said. “What our work is doing is to generate videos letting us anticipate what is going to happen next. This is important for robotics. You can imagine a robot being able to predict when an elderly person is going to fall down, for example, and then preventing that from happening. It’s also got application in computer graphics. Right now, the system isn’t photorealistic, but as we keep improving it I think we’ll be able to generate full-length videos. This could therefore be very useful as a tool in something like Hollywood movie production.”

It’s potentially even more significant than that. “In the largest sense, this work is about a computer learning to simulate the world,” he said. “Getting a machine to understand how the world works will be very important as computers start taking more and more actions in our world — and need to understand what is going on in order to do that.”

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
The best portable power stations
EcoFlow DELTA 2 on table at campsite for quick charging.

Affordable and efficient portable power is a necessity these days, keeping our electronic devices operational while on the go. But there are literally dozens of options to choose from, making it abundantly difficult to decide which mobile charging solution is best for you. We've sorted through countless portable power options and came up with six of the best portable power stations to keep your smartphones, tablets, laptops, and other gadgets functioning while living off the grid.
The best overall: Jackery Explorer 1000

Jackery has been a mainstay in the portable power market for several years, and today, the company continues to set the standard. With three AC outlets, two USB-A, and two USB-C plugs, you'll have plenty of options for keeping your gadgets charged.

Read more
CES 2023: HD Hyundai’s Avikus is an A.I. for autonomous boat and marine navigation
Demonstration of NeuBoat level 2 autonomous navigation system at the Fort Lauderdale International Boat Show

This content was produced in partnership with HD Hyundai.
Autonomous vehicle navigation technology is certainly nothing new and has been in the works for the better part of a decade at this point. But one of the most common forms we see and hear about is the type used to control steering in road-based vehicles. That's not the only place where technology can make a huge difference. Autonomous driving systems can offer incredible benefits to boats and marine vehicles, too, which is precisely why HD Hyundai has unveiled its Avikus AI technology -- for marine and watercraft vehicles.

More recently, HD Hyundai participated in the Fort Lauderdale International Boat Show, to demo its NeuBoat level 2 autonomous navigation system for recreational boats. The name mashes together the words "neuron" and "boat" and is quite fitting since the Avikus' A.I. navigation tech is a core component of the solution, it will handle self-recognition, real-time decisions, and controls when on the water. Of course, there are a lot of things happening behind the scenes with HD Hyundai's autonomous navigation solution, which we'll dive into below -- HD Hyundai will also be introducing more about the tech at CES 2023.

Read more
This AI cloned my voice using just three minutes of audio
acapela group voice cloning ad

There's a scene in Mission Impossible 3 that you might recall. In it, our hero Ethan Hunt (Tom Cruise) tackles the movie's villain, holds him at gunpoint, and forces him to read a bizarre series of sentences aloud.

"The pleasure of Busby's company is what I most enjoy," he reluctantly reads. "He put a tack on Miss Yancy's chair, and she called him a horrible boy. At the end of the month, he was flinging two kittens across the width of the room ..."

Read more