Skip to main content

MIT's Pic2Recipe A.I. Can Predict Food Ingredients By Analyzing a Photo

Pic2Recipe: Predicting recipes from photos
Scrolling through food photography can bring on the desire to recreate a dish at home, but what if the ingredients aren’t listed? Could there be a way to find out just by analyzing the image? That’s what researchers at the Massachusetts Institute of Technology asked when they set out to create a deep learning algorithm that could predict a recipe based just on a photo. The research, published on July 20, resulted in a program called Pic2Recipe that could accurately predict a dish’s recipe based on a photo, with a 65 percent success rate.

Earlier attempts to turn photos into recipes were limited by smaller datasets — although “small” is relative to all the possible recipes available. One study used 65,000 recipes, but it only included traditional Chinese cuisine; another only had about a 50 percent accuracy in initial testing. Because deep learning algorithms “learn” from being fed large quantities of data, these resulting programs were missing large gaps in potential ingredients, affecting the program’s accuracy.

To create a larger database, the researchers from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) knew the software would have to be based on a wide-ranging set of data. So to solve that narrow dataset, the team turned to large sets of photos and recipes that already exists — food websites. Compiling data from places like Food.com and All Recipes, the team created Recipe1M, a dataset of over one million recipes.

Using those recipes and the associated images, the team was able to train the software to use object recognition to pick up on what each dish’s ingredients might be. With a list of ingredients, the system then selected  the recipe that best matched the list. Pic2Recipe was able to recognize ingredients like flour, eggs, and butter.

The program doesn’t actually identify a recipe from the photo — it creates a list of ingredients. With that list, the program can then go through that one-million-recipe database and choose the one with ingredients that match the list from the photo.

“In computer vision, food is mostly neglected because we don’t have the large-scale datasets needed to make predictions,” said Yusuf Aytar, a postdoctoral associate who co-wrote the paper with MIT professor Antonio Torralba. “But seemingly useless photos on social media can actually provide valuable insight into healthy habits and dietary preferences.”

Since the computer already has that large dataset, it is also able to able to pick up on a number of different patterns, like that the average recipe has nine ingredients and the most popular are salt, butter, sugar, olive oil, water, eggs, garlic cloves, milk, flour, and onion.

The software could have a number of different real-world uses. A person could snap a photo at a restaurant to learn how to make the dish at home, or to track her personal nutrition.

The program, while it contains a wider dataset than earlier attempts, still has a few gaps. The researchers said the program has trouble with dishes that are a bit more ambiguous, like smoothies and sushi rolls. Similar recipes with a number of different variations, like lasagna for example, also tended to confuse the program.

The group plans to continue developing the program and even hopes to give the system the ability to tell how something is cooked, like picking up the difference between stewed and diced. Future work could also expand the program’s ability to recognize specific ingredients, like determining the type of onion instead of just listing onion.

You don’t have to wait until Pic2Recipe becomes a full fledged app to try it out. An online version allows users to upload images and try it out.

Hillary K. Grigonis
Hillary never planned on becoming a photographer—and then she was handed a camera at her first writing job and she's been…
The best webcams for 2023

Laptop webcams suck. If you're stuck doing Zoom calls or videoconferencing from your home, you'll need a decent external or stand-alone webcam, one that works for your preferred space. The Logitech C920S is currently our pick for the best webcam. It's affordable and provides crisp image quality. But if you need a higher resolution or a streaming-specific option, check the full list below.

Read more
Best microSD cards in 2023: top picks for your computer, camera, or drone
galaxy s8 tips and tricks

MicroSD cards are an unsung hero in the mobile space. They provide extra storage space for your smartphone, but they're also key in tablets, drones, and security cameras. Unfortunately, this feature is falling out of favor in the smartphone space, and most flagship and midrange phones no longer have a slot to insert a microSD card. Even the most expensive phones around, like the Samsung Galaxy Z Fold 4 and the Samsung S22 Ultra don't have a microSD card slot anymore. You're not even safe if you have an iPhone or iPad, as they've never had microSD card slots.

But all is not lost! Some of the best Android phones and best Android tablets do still support them, and they're still required for use with cameras, security cameras, and drones. But no matter which device you're buying it for, you'll want to get your hands on a microSD card from a reputable brand.

Read more
Selfie stunt sets new Guinness World Record
Indian actor Akshay Kumar attempts the world record for most selfies taken in three minutes.

Akshay Kumar Attempts Most Selfies Taken In Three Minutes - Guinness World Records

It seems like Guinness World Records is happy to entertain any kind of extraordinary feat for its listings, no matter how weird or wacky.

Read more