New post by @michelleding.bsky.social on resources for the Brown community in the aftermath of the shooting. open.substack.com/pub/michelle...
Serena Booth (@reniebird.bsky.social) reflects on the challenge of collecting human preferences to steer AI systems and wonders if we are doing it all wrong. cntr.brown.edu/news/2025-11...
This is part 1 of 3 of CNTR researcher reflections on COLM 2025.
💡We kicked off the SoLaR workshop at #COLM2025 with a great opinion talk by @michelleding.bsky.social & Jo Gasior Kavishe (joint work with @victorojewale.bsky.social and
@geomblog.bsky.social
) on "Testing LLMs in a sandbox isn't responsible. Focusing on community use and needs is."
I've come to seriously value any opportunity I get to create a safe space for those coming up behind me.
It's incredibly important - and I know this because I'm here, able to do this work, precisely because of those that chose to endure who knows what in order to create the right space for me ❣️
It's been a journey of nearly 3 years, but I'm very excited to announce the CNTR AISLE Portal! 🚀 cntr-aisle.org It’s a new way to review and evaluate the 1,000+ AI bills introduced in the U.S. over the last three years. Check out the Bill Library and our Profiles#AIPolicy #OpenData
Agents prioritize task completion rather than whether they should act. This is a consequence of how they are trained. My student @victorojewale.bsky.social has been investigating this and just wrote a (prize winning) paper arguing why (and how) we need a notion of "informed abstention". Link below.
How do we stop playing whack-a-mole when it comes to deepfake abuse? 🧵⚠️
It's about what's hidden, and what new deficiencies the tech carries with it. @victorojewale.bsky.social opines on the evolution of deployed AI and its limits. victorojewale.substack.com/p/from-exper...
Technologies like synthetic data, evaluations, and red-teaming are often framed as enhancing AI privacy and safety. But what if their effects lie elsewhere?
In a new paper with @realbrianjudge.bsky.social at #EAAMO25, we pull back the curtain on AI safety's toolkit. (1/n)
arxiv.org/pdf/2509.22872