OpenAI has officially released the o3-mini, its newest and most cost-efficient reasoning model to date. First previewed in December 2024, the o3-mini is optimized for STEM-related tasks, excelling in science, math, and coding. The model is designed to provide a specialized alternative for technical domains that require both speed and precision, marking a significant step forward in OpenAI’s mission to make advanced AI tools more accessible and affordable.
Performance and Speed
The o3-mini is engineered to match the performance of OpenAI’s o1 model in science, math, and coding while delivering faster responses. According to OpenAI, the o3-mini is 24% faster than its predecessor, the o1-mini, with an average response time of 7.7 seconds compared to o1-mini’s 10.16 seconds. This improvement in speed, combined with its accuracy, makes the o3-mini a compelling choice for developers and users alike.
In A/B testing, external testers preferred o3-mini’s answers over o1-mini’s more than half the time, and the new model made 39% fewer major mistakes on challenging real-world questions. With medium reasoning effort, o3-mini matches o1’s performance in STEM tasks, while high reasoning effort allows it to outperform both o1 and o1-mini in certain benchmarks.
Developer-Friendly Features
The o3-mini is OpenAI’s first small reasoning model to support highly requested developer features, including:
- Function calling
- Developer messages
- Structured Outputs
- Streaming capabilities
Developers can also choose from three levels of reasoning effort—low, medium, and high—to balance speed and accuracy based on their specific use cases. Additionally, o3-mini integrates with search functionality, providing up-to-date answers with links to relevant web sources. OpenAI notes that this feature is still a prototype as the company works to refine search integration across its reasoning models.
Accessibility for All Users
The o3-mini is now available to ChatGPT Plus, Team, and Pro subscribers, replacing the o1-mini in the model picker. Paid users can select “o3-mini-high” for higher intelligence at the cost of slower response times.
- ChatGPT Pro users will enjoy unlimited access to both o3-mini and o3-mini-high.
- Plus and Team users will see their daily message limits tripled from 50 to 150 messages per day.
For the first time, free users of ChatGPT can also access OpenAI’s reasoning capabilities by selecting the “Reason” option in the message composer or using the “re-generate” feature. This move democratizes access to advanced AI tools, allowing a broader audience to benefit from OpenAI’s cutting-edge technology.
Integration with Microsoft Services
The o3-mini is also available through Microsoft’s Azure OpenAI Service, further expanding its reach. This integration underscores the deepening collaboration between OpenAI and Microsoft, as both companies continue to push the boundaries of AI accessibility and performance.
Competitive Edge and Safety
While the o3-mini is not OpenAI’s most powerful model, it marks a significant leap in cost-efficiency and specialized performance. Priced at $0.55 per million cached input tokens and $4.40 per million output tokens, the o3-mini is 63% cheaper than the o1-mini and remains competitive with DeepSeek’s R1 reasoning model. This makes it an attractive option for businesses and developers seeking high-performance AI solutions without breaking the bank.
Beyond affordability, OpenAI has reinforced its commitment to safety by ensuring that the o3-mini is as safe or safer than the o1 family. Through rigorous red-teaming efforts and a deliberative alignment methodology, the model is designed to adhere strictly to OpenAI’s safety policies. The company even asserts that the o3-mini “significantly surpasses” GPT-4o in challenging safety and jailbreak evaluations, making it a highly reliable choice for responsible AI applications.
A New Chapter in AI Reasoning
The release of the o3-mini marks a pivotal moment for OpenAI as it continues to innovate in the AI space. “While o1 remains our broader general-knowledge reasoning model, o3-mini provides a specialized alternative for technical domains requiring precision and speed,” OpenAI wrote in a blog post. “The release of o3-mini marks another step in OpenAI’s mission to push the boundaries of cost-effective intelligence.”
With its focus on STEM optimization, developer-friendly features, and broad accessibility, the o3-mini is poised to become a valuable tool for users and developers worldwide.