OpenAI erzielt mit einem neuen Sprachmodell bei der Internationalen Mathematik-Olympiade einen bemerkenswerten Erfolg.
In Kürze
- Modell löst fünf von sechs Aufgaben korrekt
- Erzielt 35 von 42 möglichen Punkten
- Innovative Anwendung bestehender Modelle ohne spezielle Anpassungen
OpenAI’s Remarkable Achievement at the International Mathematics Olympiad 2025
OpenAI has achieved a remarkable success with a new experimental language model: At the International Mathematics Olympiad (IMO) 2025, the model performed at a gold medal level. It solved five out of six problems correctly, scoring an impressive 35 out of 42 possible points. What’s special about this? It is not a system specifically developed for mathematics, but a so-called reasoning language model known for its problem-solving and reasoning abilities.
Innovative Approach with Existing Models
OpenAI emphasizes that this result is not due to special adaptations, but rather an innovative application of existing models. In comparison, other leading AI models have recently disappointed in the same competition. This shows that OpenAI has made a clear step forward.
Reinforcement Learning: A Key Factor
A crucial factor for this success is a special learning method called Reinforcement Learning. This method allows models to improve through experience. This technique has already proven effective in other challenges, such as an international programming competition.
Future Implications and Current Limitations
Despite the impressive performance, the result has not yet been independently confirmed. OpenAI currently does not plan to make the model available for public applications. Instead, it remains a research project with promising potential for future developments. The result has even surprised OpenAI itself and could be seen as a significant step for AI systems with general thinking abilities.
Quellen
- Quelle: OpenAI
- Der ursprüngliche Artikel wurde hier veröffentlicht
- Dieser Artikel wurde im Podcast KI-Briefing-Daily behandelt. Die Folge kannst du hier anhören.




