Our paper on multilingual reasoning is accepted to Findings of #EMNLP2025! ๐ (OA: 3/3/3.5/4)
We show SOTA LMs struggle with reasoning in non-English languages; prompt-hack & post-training improve alignment but trade off accuracy.
๐ arxiv.org/abs/2505.22888
See you in Suzhou! #EMNLP
Recent Large Reasoning Models (LRMs) with thinking traces have shown strong performance on English reasoning tasks. However, their ability to think in other languages is less studied. This capability ...
[1/]๐กNew Paper
Large reasoning models (LRMs) are strong in English โ but how well do they reason in your language?
Our latest work uncovers their limitation and a clear trade-off:
Controlling Thinking Trace Language Comes at the Cost of Accuracy
๐Link: arxiv.org/abs/2505.22888