"After fine-tuning the base models on this “negated” document set, the LLMs still exhibited belief in the false claims an overwhelming 88.6 percent of the time, on average." arstechnica.com/ai/2026/05/l...
Fine-tuning tests show "bias... toward confidently representing the claims as true."