Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.

Mini

Ṁ7927

Sep 2

15%

chance

ALL

AI ️ Technology Technical AI Timelines LLMs

Get Ṁ1,000 play money

12 Comments

Sort by:

The best model will probably be around 72% give or take 2%.

I like the odds for EOY tho

bought Ṁ175 NO

i'd bet even odds by EOY, but it's highly unlikely by september 1st.

The human baseline is now 83.7%. Unfortunate that the old baseline is the name but I will resolve to true if any model exceeds the human baseline published on https://simple-bench.com.

bought Ṁ50 YES

@HenryGeorge You can edit the name. Hover over it and a pen button will appear.

@NeuralBets done thx

We have a new reported human baseline. (83.7%) Is this a question about 92% or about the human level?

@MikhailDoroshenko human baseline

bought Ṁ250 NO

Seems unlikely without a major paradigm shift. 27% is sota and it doesn't seem to be increasing much with successive model generations

Is it true that this benchmark can be anything, and can be changed at any point? There are no hashes, no large sample of problems, no error bars, no evaluation code, no specifics on what a model can or cannot use... How do we know what a true performance is, except what the author says?

@dp gotta trust the guy

bought Ṁ532 YES

Description of the benchmark here: https://simple-bench.com/about.html

I have made some irrational bets to subsidize the market - as I cannot be bothered to figure out the correct way to do this.

I think you can normally just add liquidity?

Related questions

Related questions