New METR SOTA by EOY?
2
Ṁ105Dec 31
77%
chance
1D
1W
1M
ALL
Resolves YES if any model surpasses a 50% time-horizon of 2h 42m on https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/ by EOY
Update 2025-12-12 (PST) (AI summary of creator comment): The model must be scored by end of year (not just released by end of year).
Get Ṁ1,000 play money
Sort by:
Related questions
Related questions
SOTA AI at EOY 2026 a reasoning model?
94% chance
OpenAI releases SOTA LLM by end of year?
6% chance
MMLU 99% #3: Will SOTA for MMLU (average) pass 99% by the start of 2026?
6% chance
By 2026, will it be standard practice to sandbox SOTA LLMs?
11% chance
Veo 4 By EOY?
12% chance
MMLU 99% #4: Will SOTA for MMLU (average) pass 99% by the start of 2027?
8% chance
By what date will at least one SOTA reasoning model use neuralese?
BIG-bench accuracy 75% #5: Will SOTA for a single model on BIG-bench pass 75% by the start of 2028?
87% chance
By what date will all SOTA reasoning models use neuralese?
MMLU 99% #5: Will SOTA for MMLU (average) pass 99% by the start of 2028?
44% chance