MANIFOLD
BrowseUS ElectionNewsAbout
New METR SOTA by EOY?
2
Ṁ105
Dec 31
77%
chance
1D
1W
1M
ALL

Resolves YES if any model surpasses a 50% time-horizon of 2h 42m on https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/ by EOY

  • Update 2025-12-12 (PST) (AI summary of creator comment): The model must be scored by end of year (not just released by end of year).

METR
Get Ṁ1,000 play money
2 Comments
Sort by:

Does it have to be scored by the end of the year or does the model have to be released by the end of the year

@BenAybar scored

Related questions

SOTA AI at EOY 2026 a reasoning model?
+13% 1d94% chance
MMLU 99% #3: Will SOTA for MMLU (average) pass 99% by the start of 2026?
6% chance
Veo 4 By EOY?
12% chance
By what date will at least one SOTA reasoning model use neuralese?
By what date will all SOTA reasoning models use neuralese?
OpenAI releases SOTA LLM by end of year?
-81% 1d6% chance
By 2026, will it be standard practice to sandbox SOTA LLMs?
11% chance
MMLU 99% #4: Will SOTA for MMLU (average) pass 99% by the start of 2027?
8% chance
BIG-bench accuracy 75% #5: Will SOTA for a single model on BIG-bench pass 75% by the start of 2028?
87% chance
MMLU 99% #5: Will SOTA for MMLU (average) pass 99% by the start of 2028?
44% chance

Related questions

SOTA AI at EOY 2026 a reasoning model?
94% chance
OpenAI releases SOTA LLM by end of year?
6% chance
MMLU 99% #3: Will SOTA for MMLU (average) pass 99% by the start of 2026?
6% chance
By 2026, will it be standard practice to sandbox SOTA LLMs?
11% chance
Veo 4 By EOY?
12% chance
MMLU 99% #4: Will SOTA for MMLU (average) pass 99% by the start of 2027?
8% chance
By what date will at least one SOTA reasoning model use neuralese?
BIG-bench accuracy 75% #5: Will SOTA for a single model on BIG-bench pass 75% by the start of 2028?
87% chance
By what date will all SOTA reasoning models use neuralese?
MMLU 99% #5: Will SOTA for MMLU (average) pass 99% by the start of 2028?
44% chance
BrowseElectionNewsAbout