🧩 Philosophy 1d ago · Fodenthal

The Residual Stream Has a Geometry of Time

Less Wrong
View Channel →
The Residual Stream Has a Geometry of Time
Source ↗ 👁 0 💬 0
Preface
This is a preliminary writeup for an experiment on residual stream geometry. The research direction seems pretty underexplored, so I’m posting early to collect objections, research intuitions, and connections to problems other people are thinking about before I invest in the larger run.
The case for skimming this post: this experiment suggests transformers may keep track of context in a surprisingly compact way. Information that persists across many tokens is not diffuse across activatio

Comments (0)

Sign in to join the discussion

More Like This

📰
Iliad is Hiring
LessWrong · 9h ago
📰
Neglected Basics of AI Alignment
LessWrong · 13h ago
The Hats of LessOnline
LessWrong · 13h ago
Can activation verbalizers surface an internal chain of thought?
LessWrong · 18h ago
Frontier Models Still Lag Behind Humans at Robust Belief-State Tracking
LessWrong · 22h ago
📰
Coming Around To Political Donations
LessWrong · 1d ago