Will Claude 3.5 Opus beat OpenAI's best released model on the arena.lmsys.org leaderboard?
Mini
38
Ṁ4955Jan 1
35%
chance
1D
1W
1M
ALL
OpenAI's best released model could be GPT-4, GPT-4o, or something else. It does not count as an OpenAI model unless it's made available to the public to try, and is known to be from OpenAI (e.g. the model can not be a secret, pseudonymous release). If arena.lmsys.org is not available at the time, the successor site or most similar leaderboard will be used.
Resolves yes if Claude 3.5 Opus is ranked above all OpenAI models 1 week after it is put on the leaderboard.
Get Ṁ1,000 play money
Related questions
Related questions
By the end of Q1 2025 will an open source model beat OpenAI’s o1 model?
44% chance
Will Claude Opus be ranked in the top 20 on the Chatbot Arena Leaderboard two years from today (3/10/24)?
31% chance
By the end of Q2 2025 will an open source model beat OpenAI’s o1 model?
72% chance
Will Claude 3.5 Opus be able to draw me in tic-tac-toe while playing as O at least 1/3 of the time?
52% chance
What will be the *first* ELO Rating of Claude 3.5 Opus in the LMSYS Arena?
Will Claude 3.5 Opus have a higher Chat Arena Elo than GPT-5?
9% chance
Will OpenAI launch a significantly better model for ChatGPT paying users in 2024? (>= 100 points diff on ChatBot Arena)
23% chance
Will the top model by OpenAI rank 3rd (or lower) behind 2 other model families at any point before 2026?
48% chance
Will Claude 3.5 Haiku be better than Claude 3 Opus?
48% chance
Will any open-source model rank in the top 3 on Chatbot Arena at any point in 2024? (resolves based on ELO rating)
15% chance