๐๏ธ ๐๐๐ฐ #๐๐ข๐๐๐ ๐๐ฉ๐ข๐ฌ๐จ๐๐ ๐๐ฎ๐ญ!
In the new #WiAIRpodcast episode with @neuranna.bsky.social, we talk about the relationship between language, thought, and intelligence, with insights from neuroscience, cognitive science, and AI research.
๐ท YouTube: youtu.be/e36ryy0Dsdo
After a break, the #WiAIR Women in AI Research Podcast is back!
Our next guest is Anna Ivanova @neuranna.bsky.social from Georgia Tech, whose research tackles a fundamental question in AI and cognitive science:
๐ง What is the relationship between language and thought?
Don't miss!
The paper evaluates 14 LLMs across 5 model families on 9 multilingual benchmarks spanning knowledge, reading comprehension, NLI, commonsense & mathematical reasoning, truthfulness, and regional knowledge. (2/5 ๐งต)
Women in AI Research - WiAIR
Women in AI Research - WiAIR
Women in AI Research - WiAIR
Neural MT metrics show the strongest alignment with downstream performance. But the proxy has limits: some specialized benchmarks, including MGSM and INCLUDE, show weaker or more variable correlations. Task-specific evaluation remains necessary. (4/5 ๐งต)