🧩 Philosophy 6h ago · Corm

Anthropic is Really Pushing the Frontier, What do I Think About This?

Less Wrong
View Channel →
Source ↗ 👁 0 💬 0
Anthropic just released a new AI model, Mythos. Mythos can take a browser crash and turn it into a working exploit that takes over your computer 72% of the time.[1]Anthropic is the least bad AI lab. The people on their alignment team is doing some of the best AI safety work in the field. The 244-page system card detailing Mythos is more honest than anything OpenAI or Google has published.Anthropic was founded in 2021 on the premise that a safety-focused lab needed to exist to do the research the

Comments (0)

Sign in to join the discussion

More Like This

📰
Philosophy of Microbiology
Stanford Encyclopedia of Philosophy · Just now
Claude Mythos #2: Cybersecurity and Project Glasswing
LessWrong · 5h ago
📰
The Unintelligibility is Ours: Notes on Chain-of-Thought
LessWrong · 5h ago
Slaughterhouse 666: Kicking and A-gouging in the Pus and the Gore and the Fear
3:AM Magazine · 7h ago
📰
"Close Enough" as a Primitive in Intelligent Systems
LessWrong · 8h ago
📰
Foundational Beliefs
LessWrong · 9h ago