Online Now: Moral decision-making with bounded cognitive resources and limited information
Now officially out:
"Re-evaluating Theory of Mind evaluation in large language models"
royalsocietypublishing.org/doi/10.1098/...
(by Hu, Sosa, & me)
We updated our preprint on moral decision-making in LLMs (osf.io/preprints/ps...) with a new study investigating sources of the yes-no framing bias and amplified omission bias. Results show that they likely arise from fine-tuning for chatbot applications. (w/ @maxmaier.bsky.social and Falk Lieder)