AI

OpenAI Introduces GPT-4o Mini: An Affordable AI Model for Developers

18 July 2024

|

Zaker Adham

Summary

During its Spring Update event in May, OpenAI unveiled its new large language model (LLM), GPT-4o.

Renowned for its top-tier performance in various industry benchmarks, GPT-4o comes with a hefty price tag, making it one of the most expensive models available, second only to Anthropic's Claude 3 Opus.

In recent months, major AI companies have been focusing on developing more affordable AI models to reduce costs for developers. Notable examples include Claude 3 Haiku and Gemini 1.5 Flash. Today, OpenAI joined this trend by announcing GPT-4o Mini, a new cost-effective AI model. The GPT-4o Mini is designed to be cheaper and faster than the current OpenAI models and will replace the GPT-3.5 Turbo model in OpenAI's lineup.

GPT-4o Mini has achieved an 82% score on the MMLU benchmark, compared to GPT-4o's 89%. Its main competitors, Gemini 1.5 Flash and Claude 3 Haiku, scored 79% and 75% respectively on the same benchmark.

The pricing for GPT-4o Mini is set at approximately $0.60 USD per million output tokens and $0.15 USD per million input tokens, making it 60% cheaper than the GPT-3.5 Turbo model it replaces. This pricing aligns it with Gemini 1.5 Flash and Claude 3 Haiku. The model maintains a context window of 128,000 tokens and features a knowledge cutoff date of October 2023.

Developers can now access GPT-4o Mini via APIs, while consumers can utilize this model through ChatGPT web and mobile apps. Enterprise users will have access to GPT-4o Mini starting next week.

With the introduction of GPT-4o Mini, OpenAI takes a significant step towards making advanced AI technology more accessible and affordable for a broader audience. This new model meets the growing demand for cost-effective AI solutions while maintaining competitive performance.