Inlay

//

Profile

Loading...

We're hosting a poster session at the UnConference 🌟 𝗪𝗵𝘆 𝗣𝗿𝗲𝘀𝗲𝗻𝘁? - Connect with researchers working on LLM Safety and Security - Share insights from your recently published research - Get feedback and fresh perspectives - Find new collaborators among participants

8mo

📢 𝗖𝗮𝗹𝗹 𝗳𝗼𝗿 𝗣𝗼𝘀𝘁𝗲𝗿𝘀: 𝗟𝗟𝗠 𝗦𝗮𝗳𝗲𝘁𝘆 𝗮𝗻𝗱 𝗦𝗲𝗰𝘂𝗿𝗶𝘁𝘆 𝗪𝗼𝗿𝗸𝘀𝗵𝗼𝗽 @ 𝗘𝗟𝗟𝗜𝗦 𝗨𝗻𝗖𝗼𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 📅 December 2, 2025 📍 Copenhagen An opportunity to discuss your work with colleagues working on similar problems in LLM safety and security

8mo

✨ 𝗦𝘂𝗯𝗺𝗶𝘀𝘀𝗶𝗼𝗻 𝗜𝗻𝗳𝗼: - Quick application - Accepting posters for 2025 papers from top ML / Security venues - 𝗗𝗲𝗮𝗱𝗹𝗶𝗻𝗲: October 28, 2025 - Notifications: October 31, 2025 Submission link: docs.google.com/forms/d/e/1F... Workshop website: llmsafety-unconference.github.io

🛠️ 𝗢𝗿𝗴𝗮𝗻𝗶𝘇𝗲𝗿𝘀: @egorzverev.bsky.social, @aideenfay.bsky.social, myself, Mario Fritz, @thegruel.bsky.social Looking forward to interesting discussions in Copenhagen! #EurIPS2025 #LLMSafety #LLMSecurity #AIResearch #ELLIS #AISafety #EurIPS

(1/n) In our #ICLR2025 paper, we explore a fundamental issue that enables prompt injections: 𝐋𝐋𝐌𝐬’ 𝐢𝐧𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐭𝐨 𝐬𝐞𝐩𝐚𝐫𝐚𝐭𝐞 𝐢𝐧𝐬𝐭𝐫𝐮𝐜𝐭𝐢𝐨𝐧𝐬 𝐟𝐫𝐨𝐦 𝐝𝐚𝐭𝐚 𝐢𝐧 𝐭𝐡𝐞𝐢𝐫 𝐢𝐧𝐩𝐮𝐭. ✅ Definition of separation 👉 SEP Benchmark 🔍 LLM evals on SEP

Sahar Abdelnabi