• Ben's Bites
  • Posts
  • OpenAI's 4o-mini brings big brains on a budget.

OpenAI's 4o-mini brings big brains on a budget.

OpenAI has a new baby - GPT-4o-mini. It is a small model that beats the early version of GPT-4 and it costs 30x less than its big bro GPT-4o.

What is going on here?

OpenAI's releasing GPT-4o mini, a small but mighty AI model that's way more affordable than its big brothers.

What does this mean?

GPT-4o mini is a powerhouse in a small package, designed to replace GPT-3.5 models. It's incredibly affordable at just 15 and 30 cents for a million input-output tokens, making it 60% cheaper than GPT-3.5 Turbo and ~30x budget-friendly than top-tier models.

In fact, it even beats out other small models like Google's Gemini 1.5 Flash ($0.35/$0.70) and Anthropic's Claude 3 Haiku ($0.25/$1.25) on price.

But it's not just about the cost savings; GPT-4o mini is seriously smart. It outperforms other small models in math, coding, and multimodal reasoning. , On the MMLU benchmark (general intelligence), it scores 82%, surpassing GPT-3.5 and some larger models.

This little model can handle a massive 128K token context window and outputs 16k tokens, opening up a ton of new possibilities. Companies like Ramp and Superhuman are already using it to great success in real-world tasks.

Plus, it's multimodal, just like its bigger sibling GPT-4o, supporting both text and vision inputs with even more on the horizon.

Safety is taken care of too. OpenAI has baked in new techniques like "instruction hierarchy" to keep the model secure and resistant to jailbreaks.

Why should I care?

You can now say bye to GPT-3.5-Turbo as 4o-mini will take its place in ChatGPT’s free tier. If you were using GPT-3.5-Turbo API in your applications, you should switch to 4o-mini.

That leads to two reasons to care: People are getting increasingly smarter AIs for free directly via ChatGPT and app developers can build powerful AI tools without breaking the bank.

This model (and other kiddos like Gemini 1.5 Flash and Calude 3 Haiku) are great for low-logic tasks. Think translations, rewrites, getting data from forms/images etc. Just don’t expect them to use their own brain, and you’ll be fine.

Reply

or to participate.