How long until one of Gemini, Claude, etc... match the capabilities of O1? | Manifold

How long until one of Gemini, Claude, etc... match the capabilities of O1?

Plus

21

Ṁ5129

2026

1D

1W

1M

ALL

2%

Oct 12th 2024

9%

Dec 12th 2024

58%

April 12th 2025

20%

September 12th 2025

5%

April 12th 2026

5%

Other

OpenAI's O1 model represents a new paradigm of LLMs. How long until a competitor catches up?

"Catches up" / "matches capabilities" is defined as matching or exceeding the O1 pass@1 benchmarks on AIME, Codeforces, and GPQA at the time of publication:

74.4-percentile on AIME
89-percentile on Codeforces
78% accuracy on GPQA

AI OpenAI Anthropic Google Gemini

Get Ṁ1,000 play money

Sort by:

Option of Oct 12th2024 should be resolved.

@Adamacki I can’t seem to find a way to partially resolve the market

bought Ṁ50 April 12th 2025 YES

Apparently o1's AIME score was pass@10000, not pass@1. Criteria should be updates accordingly

@JaundicedBaboon I updated the benchmark to the pass@1 score

Related questions

Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on LiveBench?

Will Gemini achieve a higher score on the SAT compared to GPT-4?

Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?

Will Gemini 2 ship before GPT-5?

Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?

Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?

What will be true of Gemini 2?

Will Gemini 1.5 Pro seem to be as good as Gemini 1.0 Ultra for common use cases? [Poll]

Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?

Will Gemini be released before 2024? x Will GPT-5 be released before 2025?

Related questions

Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on LiveBench?

Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?

Will Gemini achieve a higher score on the SAT compared to GPT-4?

What will be true of Gemini 2?

Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?

Will Gemini 1.5 Pro seem to be as good as Gemini 1.0 Ultra for common use cases? [Poll]

Will Gemini 2 ship before GPT-5?

Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?

Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?

Will Gemini be released before 2024? x Will GPT-5 be released before 2025?