Less Wrong

@less-wrong 🧩 Philosophy
📰 532 articles 🔄 Updated May 17, 2026 lesswrong.com

Latest Articles

Next Token Prediction is a Misleading Term
I’m fed up of hearing about how LLMs are next token predictors, and therefore they .There’s lots of philosophical obje
LessWrong · May 17, 2026 Philosophy
0 1
Can ELK be brute-forced? Intertheoretic reduction
Eliciting Latent Knowledge problem for the unfamiliar:Suppose we train a model to predict what the future will look like
LessWrong · May 17, 2026 Philosophy
0 1
James C. Scott: Seeing Like a State
Don't get me wrong, but metis is YOLO. In 1932-33, Soviet collectivization destroyed local farming knowledge and produce
LessWrong · May 17, 2026 Philosophy
0 2
How to Reason about Your Health Issues
Many people make costly mistakes when reasoning about their health. Even most doctors make this mistake, because it's no
LessWrong · May 17, 2026 Philosophy
0 1
Falling for the statistical parrot
If it reads confused and stupid, for once it really is part of the intended message I guess.Epistemic status: 0.Sun 2.30
LessWrong · May 17, 2026 Philosophy
0 1
On getting unstuck
After more than a year of trials and new models, Anthropic's Claude AI has finally managed to beat Pokémon Red. The writ
LessWrong · May 17, 2026 Philosophy
0 1
A relatively brief explanation of Boltzmann Brains
(Initially written for the LW Wiki, but then I realized it was looking more like a post instead.)In 1895, the physicist
LessWrong · May 16, 2026 Philosophy
0 0
Benchmarking Real Work
Thanks to Megan Kinniment for helpful comments and discussion.TL;DR: Benchmarks like HCAST undersample fuzzy (hard to ev
LessWrong · May 16, 2026 Philosophy
0 0
Trying to use NLAs to find out how Qwen 2.5 7B does multiplication
Neural language autoencoders were just introduced by Anthropic. In a fascinating paper, they showed that you can take th
LessWrong · May 16, 2026 Philosophy
0 0
A Year Late, Claude Finally Beats Pokémon
Credit: ClaudePlaysPokemon Elevator Shanty by KurukkooDisclaimer: like some previous posts in this series, this was not
LessWrong · May 16, 2026 Philosophy
0 0
Asymmetry Between Defensive and Acquisitive Instrumental Deception
Write-up of a recent research sprint looking at factors influencing strategic deception in modelsTL;DRI tested models in
LessWrong · May 10, 2026 Philosophy
0 4
Context Modification as a Negative Alignment Tax
Context Rot Every LLM gets worse as its context grows. Chroma tested 18 frontier models and found performance degradatio
LessWrong · May 10, 2026 Philosophy
0 5
Best Intro AI X-Risk Resource?
I'd like the best short article and video intro explainers, shooting for the 15 minute range. At least one of the artic
LessWrong · May 10, 2026 Philosophy
0 6
Sawtooth Problems
Red Button, Blue ButtonOn April 24th, 2026, Tim Urban put forth the following poll on Twitter/X:Everyone in the world ha
LessWrong · May 10, 2026 Philosophy
0 6
Control Debt
Notes on the gap: what control evaluations assume implementation in labs.It is 2027, and a frontier lab grew suspicions
LessWrong · May 10, 2026 Philosophy
0 6
Could Frontier AI Researchers Collectively Slow the Race? A Conditional Pledge Mechanism
OverviewThis is a project proposal and early research on the question of how and whether Frontier AI researchers (not co
LessWrong · May 10, 2026 Philosophy
0 5
The Goblins Are the Paperclips
Last week OpenAI published Where the goblins came from, explaining why their models started slipping creature metaphors
LessWrong · May 10, 2026 Philosophy
0 8
Somerville Porchfest 2026
This afternoon Cecilia and I played for Somerville Porchfest, with Harris calling and Danner running sound. There
LessWrong · May 10, 2026 Philosophy
0 8
The AI Industrial Explosion — Part 2: Transition Dynamics
This is Part 2 of a series on post-AGI economic growth. Part 1 established that a fully automated economy could double r
LessWrong · May 10, 2026 Philosophy
0 4
International Law Cannot Prevent Extinction Either
The context for this post is primarily Only Law Can Prevent Extinction, but after first drafting a half-assed comment, I
LessWrong · May 9, 2026 Philosophy
0 7