Inlay

🎉 New preprint: Can #LLMs automate reproducibility checks? 👉We test this on 76 social & behavioral studies ✅Our LLM pipeline arrived at original qualitative conclusions in 96% of cases 🎯We recovered the effect sizes in 41% (human reanalysts: 34%) arxiv.org/abs/2606.13670

Reproducibility in the social and behavioral sciences is typically evaluated by independent researchers who reanalyze the original data to assess whether the published findings can be recovered. Howev...