Grok 4 Heavy gets on Humanity's Last Exam leaderboard? | Manifold

Grok 4 Heavy gets on Humanity's Last Exam leaderboard?

2

Ṁ401

Sep 2

24%

chance

1D

1W

1M

ALL

The market resolves YES if Grok 4 Heavy has a score in the leaderboard section of https://agi.safe.ai/, regardless of settings (text-only, tools-allowed, etc.).

While the market stays open, it resolves NO when either of the following happen:

The next major iteration of Grok models is released by xAI without Grok 4 Heavy being generally accessible (including eg. limitation to paid users) in the official xAI API. Examples include Grok 4.2, Grok 4.5, Grok 5.
Grok 4 Heavy is made generally accessible in the official xAI API for a month.

In short, YES if Grok 4 Heavy ever appears on the HLE leaderboard; NO if either (i) a newer Grok generation ships first, or (ii) Grok 4 Heavy is on the xAI API for 30 days without reaching the leaderboard.

Get Ṁ1,000 play money

Related questions

Open-source OpenAI model beats Grok 4 on LMArena?

-12% 1d10% chance

Humanity's Last Exam score in 2025?

Top score on Humanity's Last Exam > 90% by what year?

Top score on Humanity's Last Exam > 60% by what year?

Top score on Humanity's Last Exam > 50% by 2028?

What is Grok 4 Heavy's performance on METR's task length evaluation?

Top score on Humanity's Last Exam > 80% by what year?

Gemini 2.5 Pro DeepThink gets on Humanity's Last Exam leaderboard before September?

-32% 1d18% chance

Top score on Humanity's Last Exam > 70% by what year?

Top score on Humanity's Last Exam > 50% by 2029?

Related questions

Open-source OpenAI model beats Grok 4 on LMArena?

What is Grok 4 Heavy's performance on METR's task length evaluation?

Humanity's Last Exam score in 2025?

Top score on Humanity's Last Exam > 80% by what year?

Top score on Humanity's Last Exam > 90% by what year?

Gemini 2.5 Pro DeepThink gets on Humanity's Last Exam leaderboard before September?

Top score on Humanity's Last Exam > 60% by what year?

Top score on Humanity's Last Exam > 70% by what year?

Top score on Humanity's Last Exam > 50% by 2028?

Top score on Humanity's Last Exam > 50% by 2029?