Will OpenAI's o4 get above 50% on humanity's last exam? | Manifold

Will OpenAI's o4 get above 50% on humanity's last exam?

Plus

46

Ṁ8673

2027

16%

chance

1D

1W

1M

ALL

Resolves N/A if there is no o4 model. o4 is defined as any compute setting on the o4 model. Something like deepresearch (which is based on o3/o4) would also resolve yes.

Update 2025-04-17 (PST) (AI summary of creator comment): o4 mini Exclusion Clarification
- o4 is defined as any compute setting on the o4 model.
- o4 mini is explicitly excluded from being considered as o4.

️ Technology AI OpenAI Technical AI Timelines AI Impacts

Get Ṁ1,000 play money

Sort by:

bought Ṁ25 YES

They have to almost 4x the o4-mini score for this to happen, so definitely unlikely. However, given how much they were willing to spend on compute to get an unexpectedly high score on a similar high profile benchmark with o3 earlier it could happen, especially given a few months more of tinkering.

12% was simply a bit too low

o4 mini is not o4 btw

Humanity's last exam?

@ken https://agi.safe.ai/

Related questions

Will OpenAI disappear before 2034?

+14% 1d49% chance

Humanity's Last Exam score in 2025?

Top score on Humanity's Last Exam > 50% by 2029?

Top score on Humanity's Last Exam > 50% by 2027?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2030?

What will be the best AI performance on Humanity's Last Exam by December 31st 2025?

Will "OpenAI o1" make the top fifty posts in LessWrong's 2024 Annual Review?

Top score on Humanity's Last Exam > 50% by 2028?

Will OpenAI cause human extinction in the next 5 years?

Will OpenAI become nothing by 2030?

Related questions

Will OpenAI disappear before 2034?

What will be the best AI performance on Humanity's Last Exam by December 31st 2025?

Humanity's Last Exam score in 2025?

Will "OpenAI o1" make the top fifty posts in LessWrong's 2024 Annual Review?

Top score on Humanity's Last Exam > 50% by 2029?

Top score on Humanity's Last Exam > 50% by 2028?

Top score on Humanity's Last Exam > 50% by 2027?

Will OpenAI cause human extinction in the next 5 years?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2030?

Will OpenAI become nothing by 2030?