Check out @lisaalaz.bsky.social's internship work with us @cohere.com questioning the rationale behind rationales ๐ฅ
Max Bartolo
Do LLMs need rationales for learning from mistakes? ๐ค
When LLMs learn from previous incorrect answers, they typically observe corrective feedback in the form of rationales explaining each mistake. In our new preprint, we find these rationales do not help, in fact they hurt performance!
๐งต