Skip to main content

Get ready: AI generated-GIFs might be coming soon

With chatbots and text-to-image generators taking the internet by storm, the next frontier of AI might be text-to-video generators.

Nvidia recently published a research paper called “High-Resolution Video Synthesis with Latent Diffusion Models” on its experiments at its Toronto AI Lab that details how it uses Stable Diffusion to create a tool that can make moving art results from text prompts.

The tech company showcased demos of the Latent Diffusion Models (LDMs), which use text to generate video clips without large amounts of computer processing, TechRadar noted.

The tool is able to generate GIF-style moving images that are approximately 4.7-second long videos at a 1,280 x 2,048 resolution. It is also capable of creating longer videos at a lower resolution of 512 x 1024, according to the research paper.

Having viewed a demo of the technology, TechRadar said the tool is likely ideal as a text-to-GIF generator at this point. The publication noted it could easily handle simple prompts such as a stormtrooper vacuuming on the beach or teddy bear is playing the electric guitar, high definition, 4K. Even so, the result still produced random artifacts and smudging in the GIFs, as are common on other regularly used AI tools such as Midjourney.

The publication believes longer videos still need a little more development before they hit prime time, but feels Nvidia will work quickly to get the technology ready. They might work well for stock libraries and similar purposes.

There are other companies experimenting with AI text-to-video generators. Google demoed its Phenaki generator, which allows longer prompts that produce 20-second clips. Another startup called Runway announced its second-generation video model last month, which is also based on Stable Diffusion. Its demo of the prompt the late afternoon sun peeking through the window of a New York City loft shows how you can add slight moving effects to still images.

Users also stand to benefit from the addition of AI in other programs, such as Adobe Firefly and Adobe Premiere Rush, according to TechRadar.

Some other companies, such as Narakeet and Lume5, market themselves as having text-to-video generators. However, many of these tools work more like PowerPoint presentations, putting together text, audio, images, and perhaps some already produced clips of video with prompts, as opposed to generating a unique work.

Editors' Recommendations

Fionna Agomuoh
Fionna Agomuoh is a technology journalist with over a decade of experience writing about various consumer electronics topics…
5 things AI image generators still struggle with
Dall-E was an early AI leader but hands are not its thing.

AI image generators like Dall-E, Stable Diffusion, Midjourney, and Bing Image Creator produce amazing results, but sometimes they can be incredibly frustrating. With simple prompts containing just a few words, an AI can output impressive images that appear to be professional photographs and convincing art in various styles. However, the same prompt will occasionally create some horrific creature or hilariously flawed rendering.

Negative prompts might help reduce the likelihood of these errors, but complexity can't always save you. Even AI experts struggle with misshapen creatures and unworldly scenes, requiring long hours of refining prompts or touching-up images with a traditional photo editor. For the time being, if you look carefully in the right areas of an image, there's a good chance you'll be able to identify if it was made by a machine.
Hand salad and balls of fingers
AI developers have made progress in the struggle to teach artificial intelligence tools how human hands should look, but there's plenty of room for improvement. If fingers aren't featured prominently, it's easy to miss errors, but it's an ongoing problem.

Read more
Stop using generative-AI tools such as ChatGPT, Samsung orders staff
Samsung logo

Samsung has told staff to stop using generative AI tools such as ChatGPT and Bard over concerns that they pose a security risk, Bloomberg reported on Monday.

The move follows a string of embarrassing slip-ups last month when Samsung employees reportedly fed sensitive semiconductor-related data into ChatGPT on three occasions.

Read more
Microsoft’s new Designer app makes generative AI dead simple
A screenshot of Microsoft's new Designer app.

The Microsoft Designer app is now available as a public preview after the brand first announced it in October 2022.

The Designer app is Microsoft's productivity spin on AI art tools such as OpenAI's DALL-E 2, which also gained popularity last year.

Read more