Inlay

ProfilePosts

New post by @michelleding.bsky.social on resources for the Brown community in the aftermath of the shooting. open.substack.com/pub/michelle...

Serena Booth (@reniebird.bsky.social) reflects on the challenge of collecting human preferences to steer AI systems and wonders if we are doing it all wrong. cntr.brown.edu/news/2025-11... This is part 1 of 3 of CNTR researcher reflections on COLM 2025.

💡We kicked off the SoLaR workshop at #COLM2025 with a great opinion talk by @michelleding.bsky.social & Jo Gasior Kavishe (joint work with @victorojewale.bsky.social and @geomblog.bsky.social ) on "Testing LLMs in a sandbox isn't responsible. Focusing on community use and needs is."

I've come to seriously value any opportunity I get to create a safe space for those coming up behind me. It's incredibly important - and I know this because I'm here, able to do this work, precisely because of those that chose to endure who knows what in order to create the right space for me ❣️

It's been a journey of nearly 3 years, but I'm very excited to announce the CNTR AISLE Portal! 🚀 cntr-aisle.org It’s a new way to review and evaluate the 1,000+ AI bills introduced in the U.S. over the last three years. Check out the Bill Library and our Profiles#AIPolicy #OpenData

Agents prioritize task completion rather than whether they should act. This is a consequence of how they are trained. My student @victorojewale.bsky.social has been investigating this and just wrote a (prize winning) paper arguing why (and how) we need a notion of "informed abstention". Link below.

How do we stop playing whack-a-mole when it comes to deepfake abuse? 🧵⚠️

It's about what's hidden, and what new deficiencies the tech carries with it. @victorojewale.bsky.social opines on the evolution of deployed AI and its limits. victorojewale.substack.com/p/from-exper...

Technologies like synthetic data, evaluations, and red-teaming are often framed as enhancing AI privacy and safety. But what if their effects lie elsewhere? In a new paper with @realbrianjudge.bsky.social at #EAAMO25, we pull back the curtain on AI safety's toolkit. (1/n) arxiv.org/pdf/2509.22872