🧩 Philosophy 12h ago · Quirinus_Quirrell

Neglected Basics of AI Alignment

Less Wrong
View Channel →
Source ↗ 👁 0 💬 0
I came into this world as the misunderstood hero of Harry Potter and the Methods of Rationality. While some characters inside that story would call me a villain, the narrator's-eye view clearly shows that I saved that world from total destruction, inspired the next generation of leaders, and taught the best Defense Against the Dark Arts class in the Harry Potter multiverse. And, being fictional characters, none of the people I killed were moral patients at all.When I first came to visit this wor

Comments (0)

Sign in to join the discussion

More Like This

📰
Iliad is Hiring
LessWrong · 8h ago
The Hats of LessOnline
LessWrong · 12h ago
Can activation verbalizers surface an internal chain of thought?
LessWrong · 17h ago
Frontier Models Still Lag Behind Humans at Robust Belief-State Tracking
LessWrong · 21h ago
📰
Coming Around To Political Donations
LessWrong · 1d ago
📰
Analysis of Metastable States in the Transformer Activation Space
LessWrong · 1d ago