🌐 Global Economy 1d ago · Tyler Cowen

Self-fulfilling misalignment?

Marginal Revolution
View Channel →
Source ↗ 👁 0 💬 0
From Anthropic:
We started by investigating why Claude chose to blackmail. We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.
And here is Alex Turner on the topic of self-fulfilling misalignment.  I raised this possibility some while ago in a Free Press column, and mainly was met with hostility.
The social return to a positive world view, and avoiding negative emotional contagion, never has been higher.
The post Self-ful

Comments (0)

Sign in to join the discussion

More Like This

📰
Will AI kill the research paper?
Marginal REVOLUTION · 10h ago
Robert Solow kicking Lucas and Sargent in the pants
Real-World Economics Review Blog · 12h ago
📰
Which are the most common everyday phenomena that we don’t properly understand?
Marginal REVOLUTION · 16h ago
📰
Saturday assorted links
Marginal REVOLUTION · 19h ago
📰
The UAP report so far
Marginal REVOLUTION · 1d ago
Public Choice Outreach!
Marginal REVOLUTION · 1d ago