PhD student @umdcs, Member of @ClipUmd lab | Earlier @AdobeResearch, @IITRoorkee
Navita Goyal
Loading...
while steering methods effectively control target behavior, they substantially increase LLMs’ vulnerability to jailbreaks, revealing a failure of robust specificity. If you’re at EACL, stop by my poster at 9AM today to hear more.
Here's a link to the full paper: aclanthology.org/2026.eacl-lo...
In this work, we argue that evaluating efficacy alone isn’t enough. Steering has two sides — efficacy and specificity — yet current evaluations predominantly focus on the former. We introduce a three-part framework for specificity (general, control, robustness) and show that...
What can cognitive science learn from AI? In infinitefaculty.substack.com/p/what-cogni... I outline how AI has found that scale and richness of learning experiences fundamentally change learning & generalization — and how I believe we should rethink cognitive experiments & theories in response.
This call is still open. I am looking to recruit, as well as many other faculty at Cornell. We review folders as they come, and will send offers until all positions are filled.
Please share with your network 🙏
Woah, this is so cool! How was I not aware of this. I just set mine up to prepare for NeurIPS and I am loving it already... it made thousands of accepted paper so much more tractable to navigate
Thanks WiAIR (@wiair.bsky.social) for featuring my work on your YouTube channel. Watch the video to hear about our work on inference-time steering — and why these interventions LLMs may not be as “precise” as they look.
AIM's 2nd round of TTK hiring - building up to 30 - is up!
📅 Ddl 12/22/25
🔬 Accessibility & Learning, plus Sustainability & Social Justice
🧑🏫 Associate/Full Prof*
🔗 umd.wd1.myworkdayjobs.com/en-US/UMCP/j...
*Assistant-level candidates: apply to departments, mentioning AIM in a cover letter