Skip to main content

Nvidia just released an open-source LLM to rival GPT-4

Nvidia CEO Jensen in front of a background.
Nvidia

Nvidia, which builds some of the most highly sought-after GPUs in the AI industry, has announced that it has released an open-source large language model that reportedly performs on par with leading proprietary models from OpenAI, Anthropic, Meta, and Google.

The company introduced its new NVLM 1.0 family in a recently released white paper, and it’s spearheaded by the 72 billion-parameter NVLM-D-72B model. “We introduce NVLM 1.0, a family of frontier-class multimodal large language models that achieve state-of-the-art results on vision-language tasks, rivaling the leading proprietary models (e.g., GPT-4o) and open-access models,” the researchers wrote.

Recommended Videos
Get your weekly teardown of the tech behind PC gaming
Check your inbox!

The new model family is reportedly already capable of “production-grade multimodality,” with exceptional performance across a variety of vision and language tasks, in addition to improved text-based responses compared to the base LLM that the NVLM family is based on. “To achieve this, we craft and integrate a high-quality text-only dataset into multimodal training, alongside a substantial amount of multimodal math and reasoning data, leading to enhanced math and coding capabilities across modalities,” the researchers explained.

The result is an LLM that can just as easily explain why a meme is funny as it can solve complex mathematics equations, step by step. Nvidia also managed to increase the model’s text-only accuracy by an average of 4.3 points across common industry benchmarks, thanks to its multimodal training style.

screenshot of the NVLM white paper explaining the process of explaining why a meme is funny
Nvidia

Nvidia appears serious about ensuring that this model meets the Open Source Initiative’s newest definition of “open source” by not only making its training weights available for public review, but also promising to release the model’s source code in the near future. This is a marked departure from the actions of rivals like OpenAI and Google, who jealously guard the details of their LLMs’ weights and source code. In doing so, Nvidia has positioned the NVLM family to not necessarily compete directly against ChatGPT-4o and Gemini 1.5 Pro, but rather serve as a foundation for third-party developers to build their own chatbots and AI applications.

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
A new definition of ‘open source’ could spell trouble for Big AI
Meta AI can generate images within a chat in about five seconds.

The Open Source Initiative (OSI), self-proclaimed steward of the open source definition, the most widely used standard for open-source software, announced an update to what constitutes an "open source AI" on Thursday. The new wording could now exclude models from industry heavyweights like Meta and Google.

"Open Source has demonstrated that massive benefits accrue to everyone after removing the barriers to learning, using, sharing, and improving software systems," the OSI wrote in a recent blog post. "For AI, society needs the same essential freedoms of Open Source to enable AI developers, deployers, and end users to enjoy those same benefits."

Read more
OpenAI gets called out for opposing a proposed AI safety bill
A person sits in front of a laptop. On the laptop screen is the home page for OpenAI's ChatGPT artificial intelligence chatbot.

Ex-OpenAI employees William Saunders and Daniel Kokotajlo have written a letter to California Gov. Gavin Newsom arguing that the company's opposition to a state bill that would impose strict safety guidelines and protocols on future AI development is disappointing but not surprising.

"We joined OpenAI because we wanted to ensure the safety of the incredibly powerful AI systems the company is developing," Saunders and Kokotajlo wrote. "But we resigned from OpenAI because we lost trust that it would safely, honestly, and responsibly develop its AI systems."

Read more
What is Grok? Elon Musk’s controversial ChatGPT competitor, explained
A digital image of Elon Musk in front of a stylized background with the Twitter logo repeating.

Elon Musk has thrown his hat into the already crowded AI ring with Grok, a conversational AI designed to challenge both the likes of ChatGPT and Midjourney, by offering a chatbot with more of "a sense of humor" than other AIs (read: fewer content restrictions and more swearing), as Musk has quipped.

It's all accessed by and trained on X social media platform, as you might guess. Here's everything you need to know about it.
What is Grok?

Read more