//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
ProfilePosts




(1/n) In our #ICLR2025 paper, we explore a fundamental issue that enables prompt injections: ๐‹๐‹๐Œ๐ฌโ€™ ๐ข๐ง๐š๐›๐ข๐ฅ๐ข๐ญ๐ฒ ๐ญ๐จ ๐ฌ๐ž๐ฉ๐š๐ซ๐š๐ญ๐ž ๐ข๐ง๐ฌ๐ญ๐ซ๐ฎ๐œ๐ญ๐ข๐จ๐ง๐ฌ ๐Ÿ๐ซ๐จ๐ฆ ๐๐š๐ญ๐š ๐ข๐ง ๐ญ๐ก๐ž๐ข๐ซ ๐ข๐ง๐ฉ๐ฎ๐ญ. โœ… Definition of separation ๐Ÿ‘‰ SEP Benchmark ๐Ÿ” LLM evals on SEP
Mar 18, 2025
Egor Zverev
8mo
8mo
We're hosting a poster session at the UnConference ๐ŸŒŸ ๐—ช๐—ต๐˜† ๐—ฃ๐—ฟ๐—ฒ๐˜€๐—ฒ๐—ป๐˜? - Connect with researchers working on LLM Safety and Security - Share insights from your recently published research - Get feedback and fresh perspectives - Find new collaborators among participants
๐Ÿ“ข ๐—–๐—ฎ๐—น๐—น ๐—ณ๐—ผ๐—ฟ ๐—ฃ๐—ผ๐˜€๐˜๐—ฒ๐—ฟ๐˜€: ๐—Ÿ๐—Ÿ๐—  ๐—ฆ๐—ฎ๐—ณ๐—ฒ๐˜๐˜† ๐—ฎ๐—ป๐—ฑ ๐—ฆ๐—ฒ๐—ฐ๐˜‚๐—ฟ๐—ถ๐˜๐˜† ๐—ช๐—ผ๐—ฟ๐—ธ๐˜€๐—ต๐—ผ๐—ฝ @ ๐—˜๐—Ÿ๐—Ÿ๐—œ๐—ฆ ๐—จ๐—ป๐—–๐—ผ๐—ป๐—ณ๐—ฒ๐—ฟ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐Ÿ“… December 2, 2025 ๐Ÿ“ Copenhagen An opportunity to discuss your work with colleagues working on similar problems in LLM safety and security
โœจ ๐—ฆ๐˜‚๐—ฏ๐—บ๐—ถ๐˜€๐˜€๐—ถ๐—ผ๐—ป ๐—œ๐—ป๐—ณ๐—ผ: - Quick application - Accepting posters for 2025 papers from top ML / Security venues - ๐——๐—ฒ๐—ฎ๐—ฑ๐—น๐—ถ๐—ป๐—ฒ: October 28, 2025 - Notifications: October 31, 2025 Submission link: docs.google.com/forms/d/e/1F... Workshop website: llmsafety-unconference.github.io
๐Ÿ› ๏ธ ๐—ข๐—ฟ๐—ด๐—ฎ๐—ป๐—ถ๐˜‡๐—ฒ๐—ฟ๐˜€: @egorzverev.bsky.social, @aideenfay.bsky.social, myself, Mario Fritz, @thegruel.bsky.social Looking forward to interesting discussions in Copenhagen! #EurIPS2025 #LLMSafety #LLMSecurity #AIResearch #ELLIS #AISafety #EurIPS