When and how can test-time thinking allow models to use information latent in their training data? What are the benefits and tradeoffs relative to other solutions like synthetic data augmentation? Pleased to share (after a long delay) an exploration of these issues: arxiv.org/abs/2604.01430 thread:
Jeee 🐦⬛
I am very proud of our joint effort with @sreejan.bsky.social on the project "Reason to Play"
LRMs show human-like rule discovery, and their hidden states predict human brain activity during gameplay 10x better than previous methods
Interactive demo + paper:
botcs.github.io/reason-to-pl...
🚀 PhD position in #NeuroAI & neurodevelopment 🚀
Co-supervised by Sarah Lippé and myself, to investigate visual processing & cognition abnormalities in children with neurodevelopmental disorders in a neuroAI framework.
Full project details and how to apply here: tinyurl.com/kbuyntpn
🧠🤖 📈