Inlay

//

ProfilePosts

Loading...

🎧 Listen to the episode! 🎬 YouTube: www.youtube.com/watch?v=3QXH... 🎙️ Spotify: open.spotify.com/episode/1aWC... 🍎 Apple: podcasts.apple.com/ca/podcast/1... 📄 Paper: arxiv.org/pdf/2601.11778 #WiAIR #MultilingualAI #LLMs #MachineTranslation #NLProc

🎙️ 𝐍𝐞𝐰 #𝐖𝐢𝐀𝐈𝐑 𝐄𝐩𝐢𝐬𝐨𝐝𝐞 𝐎𝐮𝐭! In the new #WiAIRpodcast episode with @neuranna.bsky.social, we talk about the relationship between language, thought, and intelligence, with insights from neuroscience, cognitive science, and AI research. 📷 YouTube: youtu.be/e36ryy0Dsdo

After a break, the #WiAIR Women in AI Research Podcast is back! Our next guest is Anna Ivanova @neuranna.bsky.social from Georgia Tech, whose research tackles a fundamental question in AI and cognitive science: 🧠 What is the relationship between language and thought? Don't miss!

1mo

23h

1d

www.youtube.com

YouTube video by Women in AI Research WiAIR

100% Jailbreak Success? The Hard Truth About AI Safety, with Dr. Saadia Gabriel (Part 2)

The paper evaluates 14 LLMs across 5 model families on 9 multilingual benchmarks spanning knowledge, reading comprehension, NLI, commonsense & mathematical reasoning, truthfulness, and regional knowledge. (2/5 🧵)

Women in AI Research - WiAIR

1mo

Neural MT metrics show the strongest alignment with downstream performance. But the proxy has limits: some specialized benchmarks, including MGSM and INCLUDE, show weaker or more variable correlations. Task-specific evaluation remains necessary. (4/5 🧵)