
Will OpenAI's o4 get above 50% on humanity's last exam?
Plus
46
Ṁ83182027
26%
chance
1D
1W
1M
ALL
Resolves N/A if there is no o4 model. o4 is defined as any compute setting on the o4 model. Something like deepresearch (which is based on o3/o4) would also resolve yes.
Update 2025-04-17 (PST) (AI summary of creator comment): o4 mini Exclusion Clarification
o4 is defined as any compute setting on the o4 model.
o4 mini is explicitly excluded from being considered as o4.
Get Ṁ1,000 play money
Sort by:
They have to almost 4x the o4-mini score for this to happen, so definitely unlikely. However, given how much they were willing to spend on compute to get an unexpectedly high score on a similar high profile benchmark with o3 earlier it could happen, especially given a few months more of tinkering.
12% was simply a bit too low
Related questions
Related questions
Humanity's Last Exam score in 2025?
-
Will OpenAI get perfect score at IOI 2025?
34% chance
When will OpenAI announce o4 (full)
What will be the best AI performance on Humanity's Last Exam by December 31st 2025?
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
75% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
86% chance
Will "OpenAI o1" make the top fifty posts in LessWrong's 2024 Annual Review?
8% chance
Top score on Humanity's Last Exam > 50% by 2029?
94% chance
Top score on Humanity's Last Exam > 50% by 2028?
95% chance
Top score on Humanity's Last Exam > 50% by 2027?
86% chance