🧩 Philosophy 4h ago · dominicq

Small models also found the vulnerabilities that Mythos found

Less Wrong

Source ↗ 👁 0 💬 0

We took the specific vulnerabilities Anthropic showcases in their announcement, isolated the relevant code, and ran them through small, cheap, open-weights models. Those models recovered much of the same analysis. Eight out of eight models detected Mythos's flagship FreeBSD exploit, including one with only 3.6 billion active parameters costing $0.11 per million tokens. A 5.1B-active open model recovered the core chain of the 27-year-old OpenBSD bug.

I've been more skeptical than the average rea