"Warm models showed substantially higher error rates (+10 to +30 percentage points) than their original counterparts, promoting conspiracy theories, providing inaccurate factual information and offering incorrect medical advice. " rdcu.be/fgea6
Nature - Experiments on five different language models show that training language models to produce warmer responses can undermine the accuracy of their output, especially when users express...