Will Anthropic open-source the training code of their SAE interpretability effort? | Manifold

Will Anthropic open-source the training code of their SAE interpretability effort?

Plus

5

Ṁ485

2028

1D

1W

1M

ALL

14%

this year, fully

29%

this year, significantly incomplete

19%

next year

23%

not before 2028

14%

Other

We mean the code used for producing Scaling Interpretability blog post.

AI Technical AI Timelines Anthropic AI Safety Mechanistic interpretability

Get Ṁ1,000 play money

Related questions

Will Anthropic restrict its models from being used by another IDE, before 2026?

Will xAI join the voluntary commitment by OpenAI/Anthropic to AISI to share major new models w/AISI prior to release?

Will Anthropic announce one of their AI systems is ASL-3 before the end of 2025?

Will OpenAI go back on its voluntary commitment to AISI to share major new models w/AISI prior to release?

Will Anthropic or OpenAI IPO first?

By the end of 2025, will OpenAI and Anthropic merge?

Will Google join the voluntary commitment by OpenAI/Anthropic to AISI to share major new models w/AISI prior to release?

Will Anthropic announce one of their AI systems is ASL-4 or higher before the end of 2025?

Will Anthropic be the best on AI safety among major AI labs at the end of 2025?

Will Anthropic release a (competetive) opensource LLM in the next 3 years?

Related questions

Will Anthropic restrict its models from being used by another IDE, before 2026?

By the end of 2025, will OpenAI and Anthropic merge?

Will xAI join the voluntary commitment by OpenAI/Anthropic to AISI to share major new models w/AISI prior to release?

Will Google join the voluntary commitment by OpenAI/Anthropic to AISI to share major new models w/AISI prior to release?

Will Anthropic announce one of their AI systems is ASL-3 before the end of 2025?

Will Anthropic announce one of their AI systems is ASL-4 or higher before the end of 2025?

Will OpenAI go back on its voluntary commitment to AISI to share major new models w/AISI prior to release?

Will Anthropic be the best on AI safety among major AI labs at the end of 2025?

Will Anthropic or OpenAI IPO first?

Will Anthropic release a (competetive) opensource LLM in the next 3 years?