Retrospective on my unsupervised elicitation challenge
Source ↗
👁 0
💬 0
This post contains spoilers for the unsupervised elicitation challenge of getting Claude to get my Ancient Greek homework right.
tl;dr Opus 4.7 one-shots it, nothing else worked.
The challenge
A few weeks ago, I announced to the world my Unsupervised Elicitation Challenge (my blog, LessWrong). I’d encourage you to read that post for the context, but the tl;dr is that there was a fill-in-the-blank exercise early on in my Ancient Greek textbook that Claude Opus 4.6 didn’t fill out correctly by
tl;dr Opus 4.7 one-shots it, nothing else worked.
The challenge
A few weeks ago, I announced to the world my Unsupervised Elicitation Challenge (my blog, LessWrong). I’d encourage you to read that post for the context, but the tl;dr is that there was a fill-in-the-blank exercise early on in my Ancient Greek textbook that Claude Opus 4.6 didn’t fill out correctly by
Comments (0)