1\ Can you make this Roman-numeral equation true by moving exactly one matchstick?
1/ New preprint! Reasoning models often require hundreds of task examples and thousands of rollouts to improve on a task. How can they learn more from much less?
Introducing CORE: contrastive self-reflection for rapid, sample-efficient, and interpretable self-improvement 🧵
Video
Linas Nasvytis
Linas Nasvytis
Many of us were taught experiments are for testing hypotheses. In Ch 1 of Experimentology, our free, open methods textbook, my coauthors and I argue differently: experiments are for estimating the magnitude of causal effects.
This reframing has important consequences. 🧵
experimentology.io
We've updated the preprint of our Naturalistic Computational Cognitive Science paper (arxiv.org/abs/2502.20349) — we've tried to clarify and streamline the arguments, and added some new examples: 1/5