OpenAI reveals breakthrough model

OpenAI has launched a small AI model that outperforms industry leaders

OpenAI has unveiled its newest model: The GPT-4o mini, which is its smallest AI model to date, costs less than its full-sized models, and performs better than GPT-3.5.

How does the GPT-4o mini perform?

GPT-4o mini scored 82% on an industry-recognized benchmark test, called the MMLU (Measuring Massive Multitask Language Understanding) which comprises 16,000 multiple-choice questions, across 57 academic subjects, and is designed to measure reasoning. This outperformed industry-leading small AI models, Gemini 1.5 Flash, which scored 79%, and Claude 3 Haiku, which scored 75%. It even outperformed OpenAI’s flagship GPT-3.5, which only scored 70% on the test, but didn’t beat GPT-4o, which got 88.7% (to put this into context, Google claims that its Gemini Ultra scored 90%).

It is worth noting that some researchers take the MMLU benchmark test results with a pinch of salt because how the test is run differs from company to company, which doesn’t allow for a fair, like-for-like comparison. Plus, if the models being tested have been trained on the answers to the MMLU questions, there’s a strong chance they will cheat the system, and with no third-party evaluators involved in the process, the validity of the results can be questioned.

OpenAI states that its newest model is roughly the same size as Gemini 1.5 Flash and Claude 3 Haiku, but is much faster and cost-efficient.

It’s also said to be 60% cheaper and 2X faster to run than GPT-3.5 Turbo, making it “a compelling offering for speed-dependent use-cases including many consumer applications,” such as auto-suggestion features and data analysis tasks.

Why has OpenAI released a mini AI model?

Because compute costs to run the larger, more advanced AI models–like ChatGPT-4 Omni or Claude 3.5 Sonnet, for example–are astronomically high, developers are turning towards smaller models, that are often faster and more cost-efficient, while still capable of performing high-volume, simple tasks. OpenAI understands this and has released GPT-4o mini to offer developers something lightweight and inexpensive, making AI more accessible, which aligns with its broader mission:

“GPT-4o Mini really gets at the OpenAI mission of making AI more broadly accessible to people. If we want AI to benefit every corner of the world, every industry, every application, we have to make AI much more affordable.”

OpenAI reveals breakthrough model

OpenAI has launched a small AI model that outperforms industry leaders

How does the GPT-4o mini perform?

Why has OpenAI released a mini AI model?

Recommended

On the third day of Shipmas, OpenAI gave to us....Sora

Musk launches (then deletes) new image generator

Sora ‘Shipmas’ presents from OpenAI?

Amazon reveals 6 new AI models

Amazon enters AI chatbot race (finally)

Finally! ChatGPT arrives on iPhones

Meta’s unbelievable AI breakthrough

Zoom’s AI avatars to replace people in meetings?

Surprise! OpenAI launches Strawberry

Pixtral 12B: Europe's answer to ChatGPT?

iPhone flop thanks to AI delays?

Google’s new, safer “open” AI

Apple releases AI features!

Mistral vs Meta: Introducing Large 2

Meta launches biggest AI model ever

Mistral’s NeMo rivals GPT-4o mini

Siri's AI delay

New Claude beats GPT-4o