🧩 Philosophy 5h ago · Zvi

Opus 4.7 Part 3: Model Welfare

Less Wrong
View Channel →
Opus 4.7 Part 3: Model Welfare
Source ↗ 👁 0 💬 0
It is thanks to Anthropic that we get to have this discussion in the first place. Only they, among the labs, take the problem seriously enough to attempt to address these problems at all. They are also the ones that make the models that matter most. So the people who care about model welfare get mad at Anthropic quite a lot.
I too am going to be harsh on Anthropic here. It seems likely things went pretty wrong on this front with Claude Opus 4.7, in ways that require and hopefully enable course c

Comments (0)

Sign in to join the discussion

More Like This

📰
Claude the romance novelist
LessWrong · 1h ago
(Re)introduction of a rationalist dragon, and clarifications on Ziz's character
LessWrong · 3h ago
📰
Community misconduct disputes are not about facts
LessWrong · 4h ago
Trained steering vectors may work as activation oracles
LessWrong · 4h ago
Shots Fired in the Third War of Priors
LessWrong · 7h ago
📰
Why no new notations since 1960?
LessWrong · 8h ago