Will I judge GPT-5 to be smarter than o1 (not preview) after both are released?

Plus

Ṁ4175

resolved Aug 23

Resolved

YES

ALL

Resolves subjectively, based on my analysis of benchmarks both official, third party, and my own.

Some examples of benchmarks I consider are MMLU, ZebraLogic, SWE-bench, simplebench, ARC, and livebench.

Some of my own evals are game-playing (tic-tac-toe, and connect 4), and creative writing (giving a model 3 random nouns and asking it to write a story involving them)

AI OpenAI ChatGPT GPT-5 Speculation

Get Ṁ1,000 play money

🏅 Top traders

#	Name	Total profit
1		Ṁ147
2		Ṁ138
3		Ṁ73
4		Ṁ67
5		Ṁ44

2 Comments

Sort by:

What do you do if no model named GPT 5 will be released, but instead they continue with the oN scheme for all their models?

@yetforever Resolves n/a. Though a departed researcher already described working on GPT-5 so I would be surprised if that happened

🏅 Top traders

Related questions

Related questions