//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
"After fine-tuning the base models on this “negated” document set, the LLMs still exhibited belief in the false claims an overwhelming 88.6 percent of the time, on average." arstechnica.com/ai/2026/05/l...
24d
Fine-tuning tests show "bias... toward confidently representing the claims as true."
arstechnica.com
LLMs believe false statements even after explicit warnings that they're false
annkspencer